See this document in CiteSeerX!

Hierarchical Solution of Markov Decision Processes using Macro-actions (1998)  (Make Corrections)  (49 citations)
Milos Hauskrecht, Nicolas Meuleau Leslie Pack Kaelbling, Thomas Dean Computer ...



  Home/Search   Context   Related

 
View or download:
brown.edu/people/t...echtetalUAI98.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  brown.edu/people/t...publications (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: MDP for a four-room example. Grey circles mark peripheral states of the original MDP, i.e. states of the abstract MDP. (Update)

Cited by:   More
Journal of Artificial Intelligence Research 22 (2004).. - Michael Bowling Bowling   (Correct)
Existence of Multiagent Equilibria with Limited Agents - Michael Bowling Mhb   (Correct)
Journal of Machine Learning Research 7 (2006) 2259-2301.. - Anders Jonsson Anders   (Correct)

Similar documents (at the sentence level):
77.1%:   Hierarchical Solution of Markov Decision Processes .. - Hauskrecht.. (1998)   (Correct)

Active bibliography (related documents):   More   All
0.8:   Planning with macro-actions: Effect of initial value function.. - Hauskrecht   (Correct)
0.4:   Stochastic Dynamic Programming with Factored Representations - Boutilier, Dearden, al. (1999)   (Correct)
0.3:   Decision Theoretic Planning: Structural Assumptions and.. - Boutilier, Dean, Hanks (1999)   (Correct)

Similar documents based on text:
98.0:   Unknown -   (Correct)

Related documents from co-citation:   More   All
18:   Decomposition techniques for planning in stochastic domains - Dean, Lin - 1995
16:   Decision Theoretic Planning: Structural Assumptions and Computational Leverage - Boutilier, Dean et al.
16:   Reinforcement learning with hierarchies of machines - Parr, Russell - 1997

BibTeX entry:   (Update)

Hauskrecht, M., Meuleau, N., Boutilier, C., Kaelbling, L. P. & Dean, T. (1998). Hierarchical solution of markov decision processes using macro-actions. In Proceedings of the Fourteenth Annual Conference on Uncertainty in Artificial Intelligence. http://citeseer.ist.psu.edu/hauskrecht98hierarchical.html   More

@inproceedings{ hauskrechthierarchical,
    author = "Milos Hauskrecht and Nicolas Meuleau and Leslie Pack Kaelbling and Thomas Dean and Craig Boutilier",
    title = "Hierarchical Solution of {Markov} Decision Processes using Macro-actions",
    pages = "220--229",
    url = "citeseer.ist.psu.edu/hauskrecht98hierarchical.html" }
Citations (may not include all citations):
413   Neuro-dynamic Programming (context) - Bertsekas, Tsitsiklis - 1996
408   Princeton University Press (context) - Bellman - 1957
300   SOAR: An architecture for general intelligence (context) - Laird, Newell et al. - 1987
291   Markov Decision Processes: Discrete Stochastic Dynamic Progr.. (context) - Puterman - 1994
268   Dynamic Programming and Markov Processes (context) - Howard - 1960
225   Learning and executing generalized robot plans (context) - Fikes, Hart et al. - 1972
136   Exploiting structure in policy construction - Boutilier, Dearden et al. - 1995
90   Planning under time constraints in stochastic domains - Dean, Kaelbling et al. - 1995
87   Reinforcement learning with hierarchies of machines - Parr, Russell - 1998
71   Macro-operators: A weak method for learning (context) - Korf - 1985
70   Abstraction and approximate decision theoretic planning - Dearden, Boutilier - 1997
63   Decomposition techniques for planning in stochastic domains - Dean, Lin - 1995
51   Model minimization in Markov decision processes - Dean, Givan - 1997
51   Finding structure in reinforcement learning - Thrun, Schwartz - 1995
40   Selectively generalizing plans for problem solving (context) - Minton - 1985
39   Multi-time models for temporally abstract planning - Precup, Sutton - 1998
25   TD models: Modeling the world at a mixture of time scales - Sutton - 1995
22   Flexible Decomposition Algorithms for Weakly Coupled Markov .. - Parr - 1998
15   Multilayer control of large Markov chains (context) - Forestier, Varaiya - 1978
15   Theoretical results on reinforcement learning with temporall.. - Precup, Sutton et al. - 1998
11   Decomposition of systems governed by Markov chains (context) - Kushner, Chen - 1974
5   Planning with temporally abstract actions (context) - Hauskrecht - 1998
3   Hierarchical control and learning with hierarchies of machin.. (context) - Parr - 1998
3   Hierarchical reinforcement learning: Preliminary results (context) - Kaelbling - 1993



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.cs.brown.edu/people/tld/pages/publications.html):   More
Unknown -   (Correct)
Equivalence Notions and Model Minimization in - Markov Decision Processes   (Correct)
Solving Factored MDPs via Non-Homogeneous Partitioning - Kee-Eung Kim And   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC