MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  The complexity of decentralized control of markov decision processes (2000) [100 citations — 13 self]

Download:
Download as a PDF | Download as a PS
by Daniel S. Bernstein, Robert Givan, Neil Immerman, Shlomo Zilberstein
Mathematics of Operations Research
http://dynamo.ecn.purdue.edu/~givan/brown/papers/mor00.ps
Add To MetaCart

Abstract:

We consider decentralized control of Markov decision processes and give complexity bounds on the worst-case running time for algorithms that nd optimal solutions. Generalizations of both the fullyobservable case and the partially-observable case that allow for decentralized control are described. For even two agents, the nite-horizon problems corresponding to both of these models are hard for nondeterministic exponential time. These complexity results illustrate a fundamental dierence between centralized and decentralized control of Markov decision processes. In contrast to the problems involving centralized control, the problems we consider provably do not admit polynomial-time algorithms. Furthermore, assuming EXP

Citations

1633 Computational Complexity – Papadimitriou - 1994
424 Planning and Acting in Partially Observable Stochastic Domains – Kaelbling, Littman, et al. - 1998
372 Markov Decision Processes – Puterman - 1994
334 Collaborative plans for complex group action – GROSZ, KRAUS - 1996
325 Non-deterministic exponential time has two-prover interactive proofs – Babai, Fortnow, et al. - 1991
192 The complexity of Markov decision processes – Papadimitriou, Tsitsiklis - 1987
135 Task Decomposition, Dynamic Role Assignment, and Low-Bandwidth Communication for Real-Time Strategic Teamwork – Stone, Veloso - 1999
126 Reinforcement learning algorithm for partially observable Markov decision problems – Jaakkola, Singh, et al. - 1994
115 Incremental Pruning: A Simple, Fast, Exact Method for Partially Observable Markov Decision Processes – Cassandra, Littman, et al. - 1997
93 Learning to cooperate via policy search – Peshkin, Kim, et al. - 2000
66 Solving POMDPs by searching in policy space – Hansen - 1998
63 A survey of computational complexity results in systems and control – Blondel, Tsitsiklis - 2000
51 On the undecidability of probabilistic planning and infinite-horizon partially observable Markov decision processes – Madani, Hanks, et al. - 1999
50 Reflections on the nature of multi-agent coordination and its implications for an agent architecture. Autonomous Agents and Multi-Agent Systems – Lesser - 1998
47 Using communication to reduce locality in distributed multi-agent learning – Matarić - 1998
46 On the complexity of decentralized decision making and detection problems – Tsitsiklis, Athans - 1985
45 Distributed problem solving and planning – Durfee - 1999
42 A survey of research in distributed, continual planning – desJardins, Durfee, et al. - 1999
35 Solving POMDPs by searching the space of finite policies – Meuleau, Kim, et al. - 1999
34 Distributed value functions – Schneider, Wong, et al. - 1999
30 Complexity of finite-horizon Markov decision process problems – Mundhenk, Goldsmith, et al. - 2000
26 Multiple-person alternation – Peterson, Reif - 1979
25 Intractable problems in control theory – Papadimitriou, Tsitsiklis - 1986
18 Complexity of solvable cases of the decision problem for the predicate calculus – Lewis - 1978
16 On the complexity of designing distributed protocols – Papadimitriou, Tsitsiklis - 1982
15 Decentralized control of finite state markov processes – Hsu, Marcus - 1982
11 Decentralized control of a multiple access broadcast channel: Performance bounds – Ooi, Wornell - 1996
10 My brain is full: When more memory helps – Lusena, Li, et al. - 1999
10 Multiagent Systems: Challenges and Opportunities for Decision-Theoretic Planning – Boutilier - 1999
9 Decentralized op-timal control of markov chains with a common past information set – Aicardi, Davoli, et al. - 1987
6 Applications of Markov Decision Processes in Telecommunications: A Survey – Altman - 2001
6 On the undecidability of probabilistic planning and in partially observable markov decision process problems – Madani, Hanks - 1999
2 Decentralized control of state Markov processes – Marcus - 1982
1 An integrated system for mulit-rover scientific exploration – Estlin, Gray, et al. - 1999
1 Planning and actiong in partially observable stochastic domains – Kaelbling, Littman - 1998
1 A separation theorem for periodic sharing information patterns in decentralized control – Ooi, Verbout, et al. - 1997