See this document in CiteSeerX!

Reinforcement Learning with Hierarchies of Machines (1997)  (Make Corrections)  (87 citations)
Ronald Parr, Stuart Russell
Advances in Neural Information Processing Systems



  Home/Search   Context   Related

 
View or download:
stanford.edu/~parr/hamnips97.ps.gz
berkeley.edu/~parr/hamnips97.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  stanford.edu/~parr/index (more)
From:  berkeley.edu/~parr/
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: We present a new approach to reinforcement learning in which the policies considered by the learning process are constrained by hierarchies of partially specified machines. This allows for the use of prior knowledge to reduce the search space and provides a framework in which knowledge can be transferred across problems and in which component solutions can be recombined to solve larger and more complicated problems. Our approach can be seen as providing a link between reinforcement learning and ... (Update)

Cited by:   More
TTree: Tree-Based State Generalization with - Tempor Al Ly   (Correct)
Journal of Machine Learning Research 7 (2006) 2259-2301.. - Anders Jonsson Anders   (Correct)
Reinforcement Learning Of Dialogue Strategies - With Hierarchic Al   (Correct)

Similar documents (at the sentence level):
54.6%:   Reinforcement Learning with Hierarchies of Machines - Ron Parr (1997)   (Correct)
12.0%:   Hierarchical Control and Learning for Markov Decision Processes - Parr (1998)   (Correct)

Active bibliography (related documents):   More   All
0.2:   A Unifying Framework for Temporal Abstraction in Stochastic.. - Ronald Parr   (Correct)
0.2:   Programmable Reinforcement Learning Agents - Andre, Russell (2000)   (Correct)
0.1:   Symbolic Performance & Learning In Continuous-valued Environments - Rogers (1997)   (Correct)

Similar documents based on text:   More   All
0.1:   Directed Sonar Sensing for Mobile Robot Navigation - Leonard, Durrant-Whyte (1992)   (Correct)
0.1:   Approximating Optimal Policies for Partially Observable.. - Parr, Russell (1995)   (Correct)
0.0:   Reinforcement Learning Using Approximate Belief States - Rodriguez, Parr, Koller (1999)   (Correct)

Related documents from co-citation:   More   All
36:   Hierarchical reinforcement learning with the MAXQ value function decomposition - Dietterich - 1998
28:   Reinforcement learning: An introduction - Sutton, Barto - 1998
27:   Feudal reinforcement learning - Dayan, Hinton - 1993

BibTeX entry:   (Update)

Parr, R. & Russell, S. (1997). Reinforcement learning with hierarchies of machines. In Proceedings of Advances in Neural Information Processing Systems 10. MIT Press. http://citeseer.ist.psu.edu/parr97reinforcement.html   More

@inproceedings{ parr97reinforcement,
    author = "Ronald Parr and Stuart Russell",
    title = "Reinforcement Learning with Hierarchies of Machines",
    booktitle = "Advances in Neural Information Processing Systems",
    volume = "10",
    publisher = "The {MIT} Press",
    editor = "Michael I. Jordan and Michael J. Kearns and Sara A. Solla",
    year = "1997",
    url = "citeseer.ist.psu.edu/parr97reinforcement.html" }
Citations (may not include all citations):
1364   A robust layered control system for a mobile robot (context) - Brooks - 1986
348   Parallel and Distributed Computation: Numerical Methods (context) - Bertsekas, Tsitsiklis - 1989
190   Plan: the Open Planning Architecture (context) - Currie, Tate - 1991
107   the convergence of stochastic iterative dynamic programming .. - Jaakkola, Jordan et al. - 1994
101   Reinforcement Learning for Robots Using Neural Networks (context) - Lin - 1993
94   Feudal reinforcement learning - Dayan, Hinton - 1993
83   Hierarchical reinforcement learning with the MAXQ value func.. - Dietterich - 1997
73   Transfer of learning by composing solutions of elemental seq.. - Singh - 1992
63   Decomposition techniques for planning in stochastic domains - Dean, Lin - 1995
39   Multi-time models for temporally abstract planning - Precup, Sutton
39   Reinforcement learning with soft state aggregation - Singh, Jaakola et al. - 1995
24   Model reduction techniques for computing approximately optim.. - Dean, Givan et al. - 1997
18   Roles of macro-actions in accelerating reinforcement learnin.. - McGovern, Sutton et al. - 1997
16   Scaling reinforcement learning algorithms by learning variab.. - Singh - 1992
9   planning and learning in an autonomous agent (context) - Benson, Nilsson - 1995
7   Temporal abstraction in reinforcement learning (context) - Sutton - 1995
1   Computer Science Department (context) - Lin, for et al. - 1997
1   Synthesizing efficient agents from partial programs (context) - Hsu - 1991



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://robotics.stanford.edu/~parr/index.html):   More
Hierarchical Control and Learning for Markov Decision Processes - By Ronald   (Correct)
A Unifying Framework for Temporal Abstraction in Stochastic.. - Ronald Parr   (Correct)
Computing factored value functions for policies in structured.. - Koller, Parr (1999)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC