research-article

Computing Contingent Plan Graphs using Online Planning

Authors:
Shlomi Maliah

Software and Information Systems Engineering, Ben Gurion University of the Negev, Israel

Software and Information Systems Engineering, Ben Gurion University of the Negev, Israel
View Profile

,
Radimir Komarnitski

Software and Information Systems Engineering, Ben Gurion University of the Negev, Israel

Software and Information Systems Engineering, Ben Gurion University of the Negev, Israel
View Profile

,
Guy Shani

Software and Information Systems Engineering, Ben Gurion University of the Negev, Israel

Software and Information Systems Engineering, Ben Gurion University of the Negev, Israel

0000-0003-4131-0382
View Profile

ACM Transactions on Autonomous and Adaptive Systems Volume 16 Issue 1Article No.: 1pp 1–30https://doi.org/10.1145/3488903

Published:23 January 2022Publication History

ACM Transactions on Autonomous and Adaptive Systems

Abstract

In contingent planning under partial observability with sensing actions, agents actively use sensing to discover meaningful facts about the world. Recent successful approaches translate the partially observable contingent problem into a non-deterministic fully observable problem, and then use a planner for non-deterministic planning. However, the translation may become very large, encumbering the task of the non-deterministic planner. We suggest a different approach—using an online contingent solver repeatedly to construct a plan tree. We execute the plan returned by the online solver until the next observation action, and then branch on the possible observed values, and replan for every branch independently. In many cases a plan tree can have an exponential width in the number of state variables, but the tree may have a structure that allows us to compactly represent it using a directed graph. We suggest a mechanism for tailoring such a graph that reduces both the computational effort and the storage space. Our method also handles non-deterministic domains, by identifying cycles in the plans. We present a set of experiments, showing our approach to scale better than state-of-the-art offline planners.

REFERENCES

[1] Albore Alexandre, Palacios Héctor, and Geffner Hector. 2009. A translation-based approach to contingent planning. In Proceedings of the IJCAI. 1623–1628. Google ScholarDigital Library
[2] Bertoli P., Cimatti A., Pistore M., Roveri M., and Traverso P.. 2001. MBP: A model based planner. In Proceedings of the IJCAI’01 Workshop on Planning under Uncertainty and Incomplete Information. Seattle.Google Scholar
[3] Blum Avrim L. and Furst Merrick L.. 1997. Fast planning through planning graph analysis. Artificial Intelligence 90, 1–2 (1997), 281–300. Google ScholarDigital Library
[4] Bonet Blai and Geffner Hector. 2001. GPT: A tool for planning with uncertainty and partial information. In Proceedings of the IJCAI-01 Workshop on Planning with Uncertainty and Partial Information. Citeseer, 82–87.Google Scholar
[5] Bonet Blai and Geffner Hector. 2011. Planning under partial observability by classical replanning: Theory and experiments. In IJCAI 2011, Proceedings of the 22nd International Joint Conference on Artificial Intelligence, Barcelona, Catalonia, Spain, July 16–22, 2011. 1936–1941. Google ScholarDigital Library
[6] Bonet Blai and Geffner Hector. 2014. Belief tracking for planning with sensing: Width, complexity and approximations. Journal of Artificial Intelligence Research 50 (2014), 923–970. DOI: https://doi.org/10.1613/jair.4475 Google ScholarDigital Library
[7] Bonet Blai, Palacios Hector, and Geffner Hector. 2010. Automatic derivation of finite-state machines for behavior control. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 24. Google ScholarDigital Library
[8] Brafman Ronen I. and Shani Guy. 2012. A multi-path compilation approach to contingent planning. In Proceedings of the 26th AAAI Conference on Artificial Intelligence. Google ScholarDigital Library
[9] Brafman Ronen I. and Shani Guy. 2012. Replanning in domains with partial information and sensing actions. Journal of Artificial Intelligence Research 45, 1 (2012), 565–600. Google ScholarDigital Library
[10] Brafman Ronen I. and Shani Guy. 2014. On the properties of belief tracking for online contingent planning using regression. In Proceedings of the ECAI 2014-21st European Conference on Artificial Intelligence. 147–152. Google ScholarDigital Library
[11] Brafman Ronen I. and Shani Guy. 2016. Online belief tracking using regression for contingent planning. Artificial Intelligence 241 (2016), 131–152. DOI: https://doi.org/10.1016/j.artint.2016.08.005 Google ScholarDigital Library
[12] Bryce Dan, Kambhampati Subbarao, and Smith David E.. 2006. Planning graph heuristics for belief space search. Journal of AI Research 26, 1 (2006), 35–99. Google ScholarDigital Library
[13] Cassandra Anthony R.. 1998. A survey of POMDP applications. In Proceedings of the Working Notes of AAAI 1998 Fall Symposium on Planning with Partially Observable Markov Decision Processes, Vol. 1724.Google Scholar
[14] Cassandra Anthony R., Littman Michael L., and Zhang Nevin Lianwen. 2013. Incremental pruning: A simple, fast, exact method for partially observable markov decision processes. In Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence (UAI’97). Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 54–61. Google ScholarDigital Library
[15] Cimatti Alessandro and Roveri Marco. 2000. Conformant planning via symbolic model checking. Journal of Artificial Intelligence Research 13 (2000), 305–338. Google ScholarDigital Library
[16] Fang Liangda, Liu Yongmei, and Wen Ximing. 2015. On the progression of knowledge and belief for nondeterministic actions in the situation calculus. In Proceedings of the 24th International Joint Conference on Artificial Intelligence. Google ScholarDigital Library
[17] Fox Maria, Gerevini Alfonso, Long Derek, and Serina Ivan. 2006. Plan Stability: Replanning versus Plan Repair. In Proceedings of the ICAPS, Vol. 6. 212–221. Google ScholarDigital Library
[18] Hansen Eric A.. 2008. Sparse stochastic finite-state controllers for POMDPs. In Proceedings of the UAI. 256–263. Google ScholarDigital Library
[19] Hoffmann J. and Nebel B.. 2001. The FF planning system: Fast plan generation through heuristic search. JAIR 14 (2001), 253–302. Google ScholarDigital Library
[20] Hoffmann Jörg and Brafman Ronen. 2005. Contingent planning via heuristic forward search with implicit belief states. In Proceedings of the ICAPS, Vol. 2005. Google ScholarDigital Library
[21] Hu Yuxiao and Giacomo Giuseppe De. 2011. A generic framework and solver for synthesizing finite-state controllers. In Proceedings of the AAAI 2011 Workshop on Generalized Planning.Google Scholar
[22] Junges Sebastian, Jansen Nils, Wimmer Ralf, Quatmann Tim, Winterer Leonore, Katoen Joost-Pieter, and Becker Bernd. 2018. Finite-state controllers of POMDPs using parameter synthesis. In Proceedings of the 34th Conference on Uncertainty in Artificial Intelligence, UAI 2018, Monterey, California, August 6-10, 2018, Globerson Amir and Silva Ricardo (Eds.), AUAI Press, 519–529.Google Scholar
[23] Klassen Toryn Q., McIlraith Sheila A., and Levesque Hector J.. 2018. Specifying plausibility levels for iterated belief change in the situation calculus. In Proceedings of the 16th International Conference on Principles of Knowledge Representation and Reasoning.Google Scholar
[24] Komarnitsky Radimir and Shani Guy. 2016. Computing contingent plans using online replanning. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, February 12-17, 2016, Phoenix, Arizona.3159–3165. Google ScholarDigital Library
[25] Kurniawati Hanna, Hsu David, and Lee Wee Sun. 2008. Sarsop: Efficient point-based pomdp planning by approximating optimally reachable belief spaces. In Proceedings of the Robotics: Science and Systems, Vol. 2008. Citeseer.Google ScholarCross Ref
[26] Maliah Shlomi, Brafman Ronen I., Karpas Erez, and Shani Guy. 2014. Partially observable online contingent planning using landmark heuristics. In Proceedings of the 24th International Conference on Automated Planning and Scheduling, ICAPS. Google ScholarDigital Library
[27] Muise Christian J., McIlraith Sheila A., and Beck J. Christopher. 2012. Improved non-deterministic planning by exploiting state relevance. In Proceedings of the 22nd International Conference on Automated Planning and Scheduling, ICAPS. Google ScholarDigital Library
[28] Muise Christian J., Belle Vaishak, and McIlraith Sheila A.. 2014. Computing contingent plans via fully observable non-deterministic planning. In Proceedings of the 28th AAAI Conference on Artificial Intelligence. Google ScholarDigital Library
[29] Palacios Héctor, Albore Alexandre, and Geffner Hector. 2014. Compiling contingent planning into classical planning: New translations and results. In Proceedings of the ICAPS Workshop on Models and Paradigms for Planning under Uncertainty.Google Scholar
[30] Palacios Héctor and Geffner Hector. 2009. Compiling uncertainty away in conformant planning problems with bounded width. Journal of Artificial Intelligence Research 35 (2009), 623–675. Google ScholarDigital Library
[31] Poupart Pascal and Boutilier Craig. 2003. Bounded finite state controllers. Advances in Neural Information Processing Systems 16 (2003), 823–830. Google ScholarDigital Library
[32] Rintanen Jussi. 2008. Regression for classical and nondeterministic planning. In Proceedings of the ECAI 2008-18th European Conference on Artificial Intelligence, Patras, Greece, July 21–25, 2008, Proceedings. 568–572. DOI: https://doi.org/10.3233/978-1-58603-891-5-568 Google ScholarDigital Library
[33] Rintanen Jussi. 2009. Planning and SAT. Handbook of Satisfiability 185 (2009), 483–504.Google Scholar
[34] Sanner Scott and Kersting Kristian. 2010. Symbolic dynamic programming for first-order POMDPs. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 24. Google ScholarDigital Library
[35] Shani Guy and Brafman Ronen I.. 2011. Replanning in domains with partial information and sensing actions. In Proceedings of the IJCAI. 2021–2026. Google ScholarDigital Library
[36] Shani Guy, Poupart Pascal, Brafman Ronen I., and Shimony Solomon Eyal. 2008. Efficient ADD operations for point-based algorithms. In Proceedings of the ICAPS. 330–337. Google ScholarDigital Library
[37] Shani Guy, Pineau Joelle, and Kaplow Robert. 2013. A survey of point-based POMDP solvers. Autonomous Agents and Multi-Agent Systems 27, 1 (2013), 1–51. Google ScholarDigital Library
[38] Shmaryahu Dorin, Shani Guy, and Hoffmann Jörg. 2019. Comparative criteria for partially observable contingent planning. Autonomous Agents and Multi-Agent Systems 33, 5 (2019), 481–517. Google ScholarDigital Library
[39] Sim Hyeong Seop, Kim Kee-Eung, Kim Jin Hyung, Chang Du-Seong, and Koo Myoung-Wan. 2008. Symbolic heuristic search value iteration for factored POMDPs. In Proceedings of the AAAI. 1088–1093. Google ScholarDigital Library
[40] Sondik Edward J.. 1978. The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs. Operations Research 26, 2 (1978), 282–304. Google ScholarDigital Library
[41] To Son Thanh. 2011. On the impact of belief state representation in planning under uncertainty. In Proceedings of the IJCAI. 2856–2857. Google ScholarDigital Library
[42] To Son Thanh, Pontelli Enrico, and Son Tran Cao. 2011. On the effectiveness of CNF and DNF representations in contingent planning. In Proceedings of the IJCAI. 2033–2038. Google ScholarDigital Library
[43] To Son Thanh, Son Tran Cao, and Pontelli Enrico. 2011. On the effectiveness of belief state representation in contingent planning. In Proceedings of the AAAI. Google ScholarDigital Library
[44] To Son Thanh, Son Tran Cao, and Pontelli Enrico. 2011. Conjunctive representations in contingent planning: Prime implicates versus minimal CNF formula. In Proceedings of the AAAI. Google ScholarDigital Library
[45] Yoon Sung Wook, Fern Alan, and Givan Robert. 2007. FF-replan: A baseline for probabilistic planning. In Proceedings of the ICAPS, Vol. 7. 352–359. Google ScholarDigital Library
[46] Zhang Xiaodi, Grastien Alban, and Scala Enrico. 2020. Computing superior counter-examples for conformant planning. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 34. 10017–10024.Google ScholarCross Ref

Index Terms

Computing Contingent Plan Graphs using Online Planning
1. Computing methodologies
  1. Artificial intelligence
    1. Planning and scheduling
      1. Planning under uncertainty

Recommendations

Landmark-based heuristic online contingent planning

In contingent planning problems, agents have partial information about their state and use sensing actions to learn the value of some variables. When sensing and actuation are separated, plans for such problems can often be viewed as a tree of sensing ...
Read More
Comparative Criteria for Partially Observable Contingent Planning
AAMAS '19: Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems

In contingent planning under partial observability with sensing actions, the solution can be represented as a plan tree, branching on various possible observations. Typically, one seeks a satisfying plan leading to a goal state at each leaf. In many ...
Read More
Unavoidable deadends in deterministic partially observable contingent planning
Abstract
Traditionally, a contingent plan, branching on the observations an agent obtains throughout plan execution, must reach a goal state from every possible initial state. However, in many real world problems, no such plan exists. Yet, there are plans ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Transactions on Autonomous and Adaptive Systems Volume 16, Issue 1
March 2021
73 pages
ISSN:1556-4665
EISSN:1556-4703
DOI:10.1145/3505218
Editor:
Valérie Issarny
Inria, France
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 23 January 2022
- Revised: 1 September 2021
- Accepted: 1 September 2021
- Received: 1 August 2020
Published in taas Volume 16, Issue 1

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Automated planning
contingent planning
partial observability
Qualifiers
- research-article
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 2
  Total Citations
  View Citations
- 270
  Total Downloads
- Downloads (Last 12 months)74
- Downloads (Last 6 weeks)6
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

View Full Text

HTML Format

View this article in HTML Format .

View HTML Format

Computing Contingent Plan Graphs using Online Planning

ACM Transactions on Autonomous and Adaptive Systems

Abstract

REFERENCES

Cited By

Index Terms

Recommendations

Landmark-based heuristic online contingent planning

Comparative Criteria for Partially Observable Contingent Planning

Unavoidable deadends in deterministic partially observable contingent planning

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Full Text

HTML Format

Caption

Computing Contingent Plan Graphs using Online Planning

ACM Transactions on Autonomous and Adaptive Systems

Abstract

REFERENCES

Cited By

Index Terms

Recommendations

Landmark-based heuristic online contingent planning

Comparative Criteria for Partially Observable Contingent Planning

Unavoidable deadends in deterministic partially observable contingent planning

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Full Text

HTML Format

Share this Publication link

Share on Social Media