See this document in CiteSeerX!

Acting Optimally in Partially Observable Stochastic Domains (1994)  (Make Corrections)  (148 citations)
Anthony R. Cassandra, Leslie Pack Kaelbling, Michael L. Littman
Proceedings of the Twelfth National Conference on Artificial Intelligence (AAAI-94)



  Home/Search   Context   Related

 
View or download:
brown.edu/people/lpk/aaai94.ps
duke.edu/~mlittman/doc...pomdpfinal.ps
brown.edu/~lpk/aaai94.ps
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  brown.edu/research/ai/publicat... (more)
From:  duke.edu/~mlittman/docs/refer
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: In this paper, we describe the partially observable Markov decision process (pomdp) approach to finding optimal or near-optimal control strategies for partially observable stochastic environments, given a complete model of the environment. The pomdp approach was originally developed in the operations research community and provides a formal basis for planning problems that have been of interest to the AI community. We found the existing algorithms for computing optimal control strategies to be... (Update)

Cited by:   More
Vector-space Analysis of Belief-state Approximation for.. - Pascal Poupart University (2001)   (Correct)
Value-Directed Sampling Methods for Monitoring POMDPs - Pascal Poupart University (2001)   (Correct)
Approximate Solutions For Partially Observable Stochastic.. - Common Payoffs Rosemary   (Correct)

Similar documents (at the sentence level):
6.4%:   Acting under Uncertainty: Discrete Bayesian Models for .. - Cassandra, Kaelbling, .. (1996)   (Correct)
6.4%:   Discrete Bayesian Uncertainty Models for Mobile-Robot.. - Cassandra, Kaelbling..   (Correct)

Active bibliography (related documents):   More   All
0.5:   A Hybrid Architecture for Situated Learning of Reactive.. - Sun, Peterson, Merrill (1999)   (Correct)
0.5:   Research Summary - Miyazaki   (Correct)
0.2:   Planning in an Imperfect World Using Previous Experiences - Chiu (1995)   (Correct)

Similar documents based on text:   More   All
0.5:   Approximate Planning with Hierarchical Partially.. - Theocharous, Mahadevan (2002)   (Correct)
0.3:   Learning policies for partially observable.. - Littman, Cassandra.. (1995)   (Correct)
0.3:   A Heuristic Variable Grid Solution Method for POMDPs - Brafman (1997)   (Correct)

Related documents from co-citation:   More   All
71:   The optimal control of partially observable markov processes over a finite horiz.. (context) - Smallwood, Sondik - 1973
37:   A survey of partially observable Markov decision processes: Theory (context) - GE - 1982
27:   Approximating optimal policies for partially observable stochastic domains - Parr, Russell - 1995

BibTeX entry:   (Update)

Cassandra, A.; Kaelbling, L.; and Littman, M. 1994. Acting optimally in partially observable stochastic domains. In Proceedings of the National Conference on Artificial Intelligence (AAAI), 1023--1028. http://citeseer.ist.psu.edu/cassandra94acting.html   More

@inproceedings{ cassandra94acting,
    author = "Anthony R. Cassandra and Leslie Pack Kaelbling and Michael L. Littman",
    title = "Acting Optimally in Partially Observable Stochastic Domains",
    booktitle = "Proceedings of the Twelfth National Conference on Artificial Intelligence ({AAAI}-94)",
    volume = "2",
    publisher = "AAAI Press/MIT Press",
    address = "Seattle, Washington, USA",
    isbn = "0-262-51078-2",
    pages = "1023--1028",
    year = "1994",
    url = "citeseer.ist.psu.edu/cassandra94acting.html" }
Citations (may not include all citations):
268   Dynamic Programming and Markov Processes (context) - Howard - 1960
216   The optimal control of partially observable markov processes.. (context) - Smallwood, Sondik - 1973
216   The Optimal Control of Partially Observable Markov Processes (context) - Sondik - 1971
210   A formal theory of knowledge and action (context) - Moore - 1985
138   Integrated architectures for learning (context) - Sutton - 1990
106   A survey of partially observable markov decision processes: .. (context) - Monahan - 1982
103   Reinforcement learning with perceptual aliasing: The percept.. - Chrisman - 1992
96   The complexity of markov decision processes (context) - Papadimitriou, Tsitsiklis - 1987
71   A survey of algorithmic methods for partially observed marko.. (context) - Lovejoy - 1991
42   Optimal control of markov decision processes with incomplete.. (context) - Astrom - 1965
30   Algorithms for Partially Observable Markov Decision Processe.. (context) - Cheng - 1988
8   and Dayan (context) - Watkins - 1992
5   Algorithms for partially observable markov decision processe.. (context) - Cassandra, Kaelbling et al. - 1994
2   Department of Computer and Information Science (context) - learning, asynchronous et al. - 1957
2   Learning to perceive and act by trial and error (context) - Learning, Whitehead et al. - 1991
1   and Singh (context) - Anal, -- et al. - 1991
1   Overcoming incomplete perception with utile distinction memo.. (context) - Operations, -- - 1993



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.cs.brown.edu/research/ai/publications/):   More
On the Complexity of Solving Markov Decision Problems - Littman, Dean, Kaelbling (1995)   (Correct)
Using Goals to Find Plans with High Expected Utility - Jak Kirman (1993)   (Correct)
Localized Temporal Reasoning Using Subgoals And Computational.. - Lin, Dean (1996)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC