The recent advances in computer speed and algorithms for probabilistic inference have led to a resurgence of work on planning under uncertainty. The aim is to design AI planners for environments where there may be incomplete or faulty information, where actions may not always have the same results and where there may be tradeoffs between the different possible outcomes of a plan. Addressing uncertainty in AI planning algorithms will greatly increase the range of potential applications but there is plenty of work to be done before we see practical decision-theoretic planning systems. This article outlines some of the challenges that need to be overcome and surveys some of the recent work in the area.
|
4388
|
Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference
– Pearl
- 1988
|
|
1397
|
STRIPS: A new approach in the application of theorem proving to problem solving
– Fikes, Nilsson
- 1971
|
|
740
|
Fast planning through planning graph analysis
– Blum, Furst
- 1995
|
|
548
|
Markov decision processes : discrete stochastic dynamic programming
– Puterman
- 1994
|
|
511
|
A machine program for theorem proving
– Davis, Logemann, et al.
- 1962
|
|
432
|
Pushing the envelope: Planning propositional logic, and stochastic search
– Kautz, Selman
- 1996
|
|
389
|
UCPOP: A sound, complete, partial order planner for ADL
– Penberthy, Weld
- 1992
|
|
353
|
Dynamic Programming and Markov Processes
– Howard
- 1960
|
|
342
|
Generating project networks
– Tate
- 1977
|
|
241
|
An algorithm for probabilistic planning
– Kushmerick, Hanks, et al.
- 1995
|
|
234
|
An Introduction to Least Commitment Planning
– Weld
- 1994
|
|
210
|
Acting optimally in partially observable stochastic domains
– Cassandra, Kaelbling, et al.
- 1994
|
|
201
|
Conditional nonlinear planning
– Peot, Smith
- 1992
|
|
195
|
Probabilistic planning with information gathering and contingent execution
– Draper, S, et al.
- 1994
|
|
187
|
Exploiting structure in policy construction
– Boutilier, Dearden, et al.
- 1995
|
|
183
|
Integrated planning and learning: The PRODIGY architecture
– Veloso, Carbonell, et al.
- 1995
|
|
155
|
GPS, a program that simulates human thought, Computers and Thought
– Newell, Simon
- 1963
|
|
145
|
Planning under time constraints in stochastic domains
– Dean, Kaelbling, et al.
- 1995
|
|
133
|
Planning with deadlines in stochastic domains
– Dean, Kaelbling, et al.
- 1993
|
|
133
|
Extending graphplan to handle uncertainty and sensing actions
– Weld, Anderson, et al.
- 1998
|
|
131
|
Algorithms for Sequential Decision Making
– Littman
- 1996
|
|
118
|
Approximating optimal policies for partially observable stochastic domains. (unpublished manuscript
– Parr, Russell
- 1995
|
|
102
|
Conformant Graphplan
– Smith, Weld
- 1998
|
|
95
|
Planning and reacting in uncertain and dynamic environments
– Wilkins, Myers, et al.
- 1995
|
|
94
|
Decomposition techniques for planning in stochastic domains
– Dean, Lin
- 1995
|
|
88
|
Planning for contingencies: a decision-based approach
– Pryor, Collins
- 1996
|
|
86
|
Utility models for goaldirected, decision-theoretic planners
– Haddawy, Hanks
- 1998
|
|
84
|
Model minimization in Markov decision processes
– Dean, Givan
- 1997
|
|
82
|
An algorithm for probabilistic least-commitment planning
– Kushmerick, Hanks, et al.
- 1994
|
|
81
|
Planning under uncertainty: Structural assumptions and computational leverage. (manuscript
– Boutilier, Dean, et al.
- 1995
|
|
79
|
Formulation of Tradeoffs in Planning Under Uncertainty
– Wellman
- 1990
|
|
77
|
Automatically Generating Abstractions for Problem Solving
– Knoblock
- 1991
|
|
65
|
Using abstractions for decision-theoretic planning with time constraints
– Boutilier, Dearden
- 1994
|
|
61
|
The computational complexity of probabilistic planning
– Littman, Goldsmith, et al.
- 1998
|
|
60
|
Maxplan: A new approach to probabilistic planning
– Majercik, Littman
- 1998
|
|
53
|
The concept and implementation of skeletal plans
– Friedland, Iwasaki
- 1985
|
|
50
|
Planning with external events
– Blythe
- 1994
|
|
48
|
Decision theory and artificial intelligence ii: The hungymonkey. Cognitive Science 1:158-192
– Feldman, Sproull
- 1977
|
|
47
|
Transformational planning of reactive behavior
– McDermott
- 1992
|
|
46
|
Decision-theoretic planning
– Blythe
- 1999
|
|
43
|
A Heuristic Variable Grid Solution Method for POMDPs
– Brafman
- 1997
|
|
43
|
Efficient decision-theoretic planning: Techniques and empirical analysis
– Haddawy, Doan, et al.
- 1995
|
|
42
|
Decision-theoretic refinement planning using inheritance abstraction
– Haddawy, Suwandi
- 1994
|
|
35
|
Control strategies for a stochastic planner
– Tash, Russell
- 1994
|
|
31
|
Epsilon-safe planning
– Goldman, Boddy
- 1994
|
|
30
|
Planning Under Uncertainty in Dynamic Domains
– Blythe
- 1998
|
|
30
|
Prioritized goal Decomposition of Markov decision processes: Toward a synthesis of classical and decision theoretic planning
– Boutilier, Brafman, et al.
- 1997
|
|
29
|
Control knowledge to improve plan quality
– Perez, Carbonell
- 1994
|
|
26
|
Contingency selection in plan generation
– Onder, Pollack
- 1997
|
|
25
|
Structured Reachability Analysis for Markov Decision Processes
– Boutilier, Brafman, et al.
|