• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

DMCA

The Complexity of Decentralized Control of Markov Decision Processes (2000)

Cached

  • Download as a PDF

Download Links

  • [anytime.cs.umass.edu]
  • [anytime.cs.umass.edu]
  • [anytime.cs.umass.edu]
  • [anytime.cs.umass.edu]
  • [rbr.cs.umass.edu]
  • [anytime.cs.umass.edu]
  • [anytime.cs.umass.edu]
  • [anytime.cs.umass.edu]
  • [rbr.cs.umass.edu]
  • [rbr.cs.umass.edu]
  • [www-anw.cs.umass.edu]
  • [dynamo.ecn.purdue.edu]
  • [rbr.cs.umass.edu]
  • [anytime.cs.umass.edu]
  • [www-anw.cs.umass.edu]
  • [rbr.cs.umass.edu]
  • [www.cs.umass.edu]
  • [people.cs.umass.edu]
  • [people.cs.umass.edu]
  • [anytime.cs.umass.edu]
  • [rbr.cs.umass.edu]
  • [anytime.cs.umass.edu]
  • [works.bepress.com]
  • [works.bepress.com]

  • Other Repositories/Bibliography

  • DBLP
  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Daniel S. Bernstein , Robert Givan , Neil Immerman , Shlomo Zilberstein
Venue:Mathematics of Operations Research
Citations:411 - 46 self
  • Summary
  • Citations
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@INPROCEEDINGS{Bernstein00thecomplexity,
    author = {Daniel S. Bernstein and Robert Givan and Neil Immerman and Shlomo Zilberstein},
    title = {The Complexity of Decentralized Control of Markov Decision Processes},
    booktitle = {Mathematics of Operations Research},
    year = {2000},
    pages = {2002}
}

Share

Facebook Twitter Reddit Bibsonomy

OpenURL

 

Abstract

We consider decentralized control of Markov decision processes and give complexity bounds on the worst-case running time for algorithms that find optimal solutions. Generalizations of both the fullyobservable case and the partially-observable case that allow for decentralized control are described. For even two agents, the finite-horizon problems corresponding to both of these models are hard for nondeterministic exponential time. These complexity results illustrate a fundamental difference between centralized and decentralized control of Markov decision processes. In contrast to the problems involving centralized control, the problems we consider provably do not admit polynomial-time algorithms. Furthermore, assuming EXP NEXP, the problems require super-exponential time to solve in the worst case.

Keyphrases

markov decision process    decentralized control    fundamental difference    finite-horizon problem    optimal solution    markov decision    partially-observable case    polynomial-time algorithm    super-exponential time    complexity bound    nondeterministic exponential time    worst-case running time    complexity result    exp nexp    fullyobservable case   

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University