skip to main content
research-article

Computing Contingent Plan Graphs using Online Planning

Authors Info & Claims
Published:23 January 2022Publication History
Skip Abstract Section

Abstract

In contingent planning under partial observability with sensing actions, agents actively use sensing to discover meaningful facts about the world. Recent successful approaches translate the partially observable contingent problem into a non-deterministic fully observable problem, and then use a planner for non-deterministic planning. However, the translation may become very large, encumbering the task of the non-deterministic planner. We suggest a different approach—using an online contingent solver repeatedly to construct a plan tree. We execute the plan returned by the online solver until the next observation action, and then branch on the possible observed values, and replan for every branch independently. In many cases a plan tree can have an exponential width in the number of state variables, but the tree may have a structure that allows us to compactly represent it using a directed graph. We suggest a mechanism for tailoring such a graph that reduces both the computational effort and the storage space. Our method also handles non-deterministic domains, by identifying cycles in the plans. We present a set of experiments, showing our approach to scale better than state-of-the-art offline planners.

REFERENCES

  1. [1] Albore Alexandre, Palacios Héctor, and Geffner Hector. 2009. A translation-based approach to contingent planning. In Proceedings of the IJCAI. 16231628. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. [2] Bertoli P., Cimatti A., Pistore M., Roveri M., and Traverso P.. 2001. MBP: A model based planner. In Proceedings of the IJCAI’01 Workshop on Planning under Uncertainty and Incomplete Information. Seattle.Google ScholarGoogle Scholar
  3. [3] Blum Avrim L. and Furst Merrick L.. 1997. Fast planning through planning graph analysis. Artificial Intelligence 90, 1–2 (1997), 281300. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. [4] Bonet Blai and Geffner Hector. 2001. GPT: A tool for planning with uncertainty and partial information. In Proceedings of the IJCAI-01 Workshop on Planning with Uncertainty and Partial Information. Citeseer, 8287.Google ScholarGoogle Scholar
  5. [5] Bonet Blai and Geffner Hector. 2011. Planning under partial observability by classical replanning: Theory and experiments. In IJCAI 2011, Proceedings of the 22nd International Joint Conference on Artificial Intelligence, Barcelona, Catalonia, Spain, July 16–22, 2011. 19361941. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. [6] Bonet Blai and Geffner Hector. 2014. Belief tracking for planning with sensing: Width, complexity and approximations. Journal of Artificial Intelligence Research 50 (2014), 923970. DOI: https://doi.org/10.1613/jair.4475 Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. [7] Bonet Blai, Palacios Hector, and Geffner Hector. 2010. Automatic derivation of finite-state machines for behavior control. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 24. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. [8] Brafman Ronen I. and Shani Guy. 2012. A multi-path compilation approach to contingent planning. In Proceedings of the 26th AAAI Conference on Artificial Intelligence. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. [9] Brafman Ronen I. and Shani Guy. 2012. Replanning in domains with partial information and sensing actions. Journal of Artificial Intelligence Research 45, 1 (2012), 565600. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. [10] Brafman Ronen I. and Shani Guy. 2014. On the properties of belief tracking for online contingent planning using regression. In Proceedings of the ECAI 2014-21st European Conference on Artificial Intelligence. 147152. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. [11] Brafman Ronen I. and Shani Guy. 2016. Online belief tracking using regression for contingent planning. Artificial Intelligence 241 (2016), 131152. DOI: https://doi.org/10.1016/j.artint.2016.08.005 Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. [12] Bryce Dan, Kambhampati Subbarao, and Smith David E.. 2006. Planning graph heuristics for belief space search. Journal of AI Research 26, 1 (2006), 3599. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. [13] Cassandra Anthony R.. 1998. A survey of POMDP applications. In Proceedings of the Working Notes of AAAI 1998 Fall Symposium on Planning with Partially Observable Markov Decision Processes, Vol. 1724.Google ScholarGoogle Scholar
  14. [14] Cassandra Anthony R., Littman Michael L., and Zhang Nevin Lianwen. 2013. Incremental pruning: A simple, fast, exact method for partially observable markov decision processes. In Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence (UAI’97). Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 54–61. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. [15] Cimatti Alessandro and Roveri Marco. 2000. Conformant planning via symbolic model checking. Journal of Artificial Intelligence Research 13 (2000), 305338. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. [16] Fang Liangda, Liu Yongmei, and Wen Ximing. 2015. On the progression of knowledge and belief for nondeterministic actions in the situation calculus. In Proceedings of the 24th International Joint Conference on Artificial Intelligence. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. [17] Fox Maria, Gerevini Alfonso, Long Derek, and Serina Ivan. 2006. Plan Stability: Replanning versus Plan Repair. In Proceedings of the ICAPS, Vol. 6. 212221. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. [18] Hansen Eric A.. 2008. Sparse stochastic finite-state controllers for POMDPs. In Proceedings of the UAI. 256263. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. [19] Hoffmann J. and Nebel B.. 2001. The FF planning system: Fast plan generation through heuristic search. JAIR 14 (2001), 253302. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. [20] Hoffmann Jörg and Brafman Ronen. 2005. Contingent planning via heuristic forward search with implicit belief states. In Proceedings of the ICAPS, Vol. 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. [21] Hu Yuxiao and Giacomo Giuseppe De. 2011. A generic framework and solver for synthesizing finite-state controllers. In Proceedings of the AAAI 2011 Workshop on Generalized Planning.Google ScholarGoogle Scholar
  22. [22] Junges Sebastian, Jansen Nils, Wimmer Ralf, Quatmann Tim, Winterer Leonore, Katoen Joost-Pieter, and Becker Bernd. 2018. Finite-state controllers of POMDPs using parameter synthesis. In Proceedings of the 34th Conference on Uncertainty in Artificial Intelligence, UAI 2018, Monterey, California, August 6-10, 2018, Globerson Amir and Silva Ricardo (Eds.), AUAI Press, 519529.Google ScholarGoogle Scholar
  23. [23] Klassen Toryn Q., McIlraith Sheila A., and Levesque Hector J.. 2018. Specifying plausibility levels for iterated belief change in the situation calculus. In Proceedings of the 16th International Conference on Principles of Knowledge Representation and Reasoning.Google ScholarGoogle Scholar
  24. [24] Komarnitsky Radimir and Shani Guy. 2016. Computing contingent plans using online replanning. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, February 12-17, 2016, Phoenix, Arizona.31593165. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. [25] Kurniawati Hanna, Hsu David, and Lee Wee Sun. 2008. Sarsop: Efficient point-based pomdp planning by approximating optimally reachable belief spaces. In Proceedings of the Robotics: Science and Systems, Vol. 2008. Citeseer.Google ScholarGoogle ScholarCross RefCross Ref
  26. [26] Maliah Shlomi, Brafman Ronen I., Karpas Erez, and Shani Guy. 2014. Partially observable online contingent planning using landmark heuristics. In Proceedings of the 24th International Conference on Automated Planning and Scheduling, ICAPS. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. [27] Muise Christian J., McIlraith Sheila A., and Beck J. Christopher. 2012. Improved non-deterministic planning by exploiting state relevance. In Proceedings of the 22nd International Conference on Automated Planning and Scheduling, ICAPS. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. [28] Muise Christian J., Belle Vaishak, and McIlraith Sheila A.. 2014. Computing contingent plans via fully observable non-deterministic planning. In Proceedings of the 28th AAAI Conference on Artificial Intelligence. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. [29] Palacios Héctor, Albore Alexandre, and Geffner Hector. 2014. Compiling contingent planning into classical planning: New translations and results. In Proceedings of the ICAPS Workshop on Models and Paradigms for Planning under Uncertainty.Google ScholarGoogle Scholar
  30. [30] Palacios Héctor and Geffner Hector. 2009. Compiling uncertainty away in conformant planning problems with bounded width. Journal of Artificial Intelligence Research 35 (2009), 623675. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. [31] Poupart Pascal and Boutilier Craig. 2003. Bounded finite state controllers. Advances in Neural Information Processing Systems 16 (2003), 823830. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. [32] Rintanen Jussi. 2008. Regression for classical and nondeterministic planning. In Proceedings of the ECAI 2008-18th European Conference on Artificial Intelligence, Patras, Greece, July 21–25, 2008, Proceedings. 568572. DOI: https://doi.org/10.3233/978-1-58603-891-5-568 Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. [33] Rintanen Jussi. 2009. Planning and SAT. Handbook of Satisfiability 185 (2009), 483504.Google ScholarGoogle Scholar
  34. [34] Sanner Scott and Kersting Kristian. 2010. Symbolic dynamic programming for first-order POMDPs. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 24. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. [35] Shani Guy and Brafman Ronen I.. 2011. Replanning in domains with partial information and sensing actions. In Proceedings of the IJCAI. 20212026. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. [36] Shani Guy, Poupart Pascal, Brafman Ronen I., and Shimony Solomon Eyal. 2008. Efficient ADD operations for point-based algorithms. In Proceedings of the ICAPS. 330337. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. [37] Shani Guy, Pineau Joelle, and Kaplow Robert. 2013. A survey of point-based POMDP solvers. Autonomous Agents and Multi-Agent Systems 27, 1 (2013), 151. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. [38] Shmaryahu Dorin, Shani Guy, and Hoffmann Jörg. 2019. Comparative criteria for partially observable contingent planning. Autonomous Agents and Multi-Agent Systems 33, 5 (2019), 481517. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. [39] Sim Hyeong Seop, Kim Kee-Eung, Kim Jin Hyung, Chang Du-Seong, and Koo Myoung-Wan. 2008. Symbolic heuristic search value iteration for factored POMDPs. In Proceedings of the AAAI. 10881093. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. [40] Sondik Edward J.. 1978. The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs. Operations Research 26, 2 (1978), 282304. Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. [41] To Son Thanh. 2011. On the impact of belief state representation in planning under uncertainty. In Proceedings of the IJCAI. 28562857. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. [42] To Son Thanh, Pontelli Enrico, and Son Tran Cao. 2011. On the effectiveness of CNF and DNF representations in contingent planning. In Proceedings of the IJCAI. 20332038. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. [43] To Son Thanh, Son Tran Cao, and Pontelli Enrico. 2011. On the effectiveness of belief state representation in contingent planning. In Proceedings of the AAAI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. [44] To Son Thanh, Son Tran Cao, and Pontelli Enrico. 2011. Conjunctive representations in contingent planning: Prime implicates versus minimal CNF formula. In Proceedings of the AAAI. Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. [45] Yoon Sung Wook, Fern Alan, and Givan Robert. 2007. FF-replan: A baseline for probabilistic planning. In Proceedings of the ICAPS, Vol. 7. 352359. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. [46] Zhang Xiaodi, Grastien Alban, and Scala Enrico. 2020. Computing superior counter-examples for conformant planning. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 1001710024.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Computing Contingent Plan Graphs using Online Planning

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Autonomous and Adaptive Systems
      ACM Transactions on Autonomous and Adaptive Systems  Volume 16, Issue 1
      March 2021
      73 pages
      ISSN:1556-4665
      EISSN:1556-4703
      DOI:10.1145/3505218
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 23 January 2022
      • Revised: 1 September 2021
      • Accepted: 1 September 2021
      • Received: 1 August 2020
      Published in taas Volume 16, Issue 1

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article
      • Refereed

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text

    HTML Format

    View this article in HTML Format .

    View HTML Format