We consider decentralized control of Markov decision processes and give complexity bounds on the worst-case running time for algorithms that nd optimal solutions. Generalizations of both the fullyobservable case and the partially-observable case that allow for decentralized control are described. For even two agents, the nite-horizon problems corresponding to both of these models are hard for nondeterministic exponential time. These complexity results illustrate a fundamental dierence between centralized and decentralized control of Markov decision processes. In contrast to the problems involving centralized control, the problems we consider provably do not admit polynomial-time algorithms. Furthermore, assuming EXP
|
1633
|
Computational Complexity
– Papadimitriou
- 1994
|
|
424
|
Planning and Acting in Partially Observable Stochastic Domains
– Kaelbling, Littman, et al.
- 1998
|
|
372
|
Markov Decision Processes
– Puterman
- 1994
|
|
334
|
Collaborative plans for complex group action
– GROSZ, KRAUS
- 1996
|
|
325
|
Non-deterministic exponential time has two-prover interactive proofs
– Babai, Fortnow, et al.
- 1991
|
|
192
|
The complexity of Markov decision processes
– Papadimitriou, Tsitsiklis
- 1987
|
|
135
|
Task Decomposition, Dynamic Role Assignment, and Low-Bandwidth Communication for Real-Time Strategic Teamwork
– Stone, Veloso
- 1999
|
|
126
|
Reinforcement learning algorithm for partially observable Markov decision problems
– Jaakkola, Singh, et al.
- 1994
|
|
115
|
Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes
– Cassandra, Littman, et al.
- 1997
|
|
93
|
Learning to cooperate via policy search
– Peshkin, Kim, et al.
- 2000
|
|
66
|
Solving POMDPs by searching in policy space
– Hansen
- 1998
|
|
63
|
A survey of computational complexity results in systems and control
– Blondel, Tsitsiklis
- 2000
|
|
51
|
On the undecidability of probabilistic planning and infinite-horizon partially observable Markov decision processes
– Madani, Hanks, et al.
- 1999
|
|
50
|
Reflections on the nature of multi-agent coordination and its implications for an agent architecture. Autonomous Agents and Multi-Agent Systems
– Lesser
- 1998
|
|
47
|
Using communication to reduce locality in distributed multi-agent learning
– Matarić
- 1998
|
|
46
|
On the complexity of decentralized decision making and detection problems
– Tsitsiklis, Athans
- 1985
|
|
45
|
Distributed problem solving and planning
– Durfee
- 1999
|
|
42
|
A survey of research in distributed, continual planning
– desJardins, Durfee, et al.
- 1999
|
|
35
|
Solving POMDPs by searching the space of finite policies
– Meuleau, Kim, et al.
- 1999
|
|
34
|
Distributed value functions
– Schneider, Wong, et al.
- 1999
|
|
30
|
Complexity of finite-horizon Markov decision process problems
– Mundhenk, Goldsmith, et al.
- 2000
|
|
26
|
Multiple-person alternation
– Peterson, Reif
- 1979
|
|
25
|
Intractable problems in control theory
– Papadimitriou, Tsitsiklis
- 1986
|
|
18
|
Complexity of solvable cases of the decision problem for the predicate calculus
– Lewis
- 1978
|
|
16
|
On the complexity of designing distributed protocols
– Papadimitriou, Tsitsiklis
- 1982
|
|
15
|
Decentralized control of finite state markov processes
– Hsu, Marcus
- 1982
|
|
11
|
Decentralized control of a multiple access broadcast channel: Performance bounds
– Ooi, Wornell
- 1996
|
|
10
|
My brain is full: When more memory helps
– Lusena, Li, et al.
- 1999
|
|
10
|
Multiagent Systems: Challenges and Opportunities for Decision-Theoretic Planning
– Boutilier
- 1999
|
|
9
|
Decentralized op-timal control of markov chains with a common past information set
– Aicardi, Davoli, et al.
- 1987
|
|
6
|
Applications of Markov Decision Processes in Telecommunications: A Survey
– Altman
- 2001
|
|
6
|
On the undecidability of probabilistic planning and in partially observable markov decision process problems
– Madani, Hanks
- 1999
|
|
2
|
Decentralized control of state Markov processes
– Marcus
- 1982
|
|
1
|
An integrated system for mulit-rover scientific exploration
– Estlin, Gray, et al.
- 1999
|
|
1
|
Planning and actiong in partially observable stochastic domains
– Kaelbling, Littman
- 1998
|
|
1
|
A separation theorem for periodic sharing information patterns in decentralized control
– Ooi, Verbout, et al.
- 1997
|