(Enter summary)
Abstract: We present a new approach to reinforcement learning in which the policies considered by the learning process are constrained by hierarchies of partially specified machines. This allows for the use of prior knowledge to reduce the search space and provides a framework in which knowledge can be transferred across problems and in which component solutions can be recombined to solve larger and more complicated problems. Our approach can be seen as providing a link between reinforcement learning and ... (Update)
Cited by: More
TTree: Tree-Based State Generalization with - Tempor Al Ly
(Correct)
Journal of Machine Learning Research 7 (2006) 2259-2301.. - Anders Jonsson Anders
(Correct)
Reinforcement Learning Of Dialogue Strategies - With Hierarchic Al
(Correct)
Similar documents (at the sentence level):
54.6%: Reinforcement Learning with Hierarchies of Machines - Ron Parr (1997)
(Correct)
12.0%: Hierarchical Control and Learning for Markov Decision Processes - Parr (1998)
(Correct)
Active bibliography (related documents): More All
0.2: A Unifying Framework for Temporal Abstraction in Stochastic.. - Ronald Parr
(Correct)
0.2: Programmable Reinforcement Learning Agents - Andre, Russell (2000)
(Correct)
0.1: Symbolic Performance & Learning In Continuous-valued Environments - Rogers (1997)
(Correct)
Similar documents based on text: More All
0.1: Directed Sonar Sensing for Mobile Robot Navigation - Leonard, Durrant-Whyte (1992)
(Correct)
0.1: Approximating Optimal Policies for Partially Observable.. - Parr, Russell (1995)
(Correct)
0.0: Reinforcement Learning Using Approximate Belief States - Rodriguez, Parr, Koller (1999)
(Correct)
Related documents from co-citation: More All
36: Hierarchical reinforcement learning with the MAXQ value function decomposition
- Dietterich - 1998
28: Reinforcement learning: An introduction
- Sutton, Barto - 1998
27: Feudal reinforcement learning
- Dayan, Hinton - 1993
BibTeX entry: (Update)
Parr, R. & Russell, S. (1997). Reinforcement learning with hierarchies of machines. In Proceedings of Advances in Neural Information Processing Systems 10. MIT Press. http://citeseer.ist.psu.edu/parr97reinforcement.html More
@inproceedings{ parr97reinforcement,
author = "Ronald Parr and Stuart Russell",
title = "Reinforcement Learning with Hierarchies of Machines",
booktitle = "Advances in Neural Information Processing Systems",
volume = "10",
publisher = "The {MIT} Press",
editor = "Michael I. Jordan and Michael J. Kearns and Sara A. Solla",
year = "1997",
url = "citeseer.ist.psu.edu/parr97reinforcement.html" }
Citations (may not include all citations):
1364
A robust layered control system for a mobile robot (context) - Brooks - 1986
348
Parallel and Distributed Computation: Numerical Methods (context) - Bertsekas, Tsitsiklis - 1989
190
Plan: the Open Planning Architecture (context) - Currie, Tate - 1991
107
the convergence of stochastic iterative dynamic programming ..
- Jaakkola, Jordan et al. - 1994
101
Reinforcement Learning for Robots Using Neural Networks (context) - Lin - 1993
94
Feudal reinforcement learning
- Dayan, Hinton - 1993
83
Hierarchical reinforcement learning with the MAXQ value func..
- Dietterich - 1997
73
Transfer of learning by composing solutions of elemental seq..
- Singh - 1992
63
Decomposition techniques for planning in stochastic domains
- Dean, Lin - 1995
39
Multi-time models for temporally abstract planning
- Precup, Sutton
39
Reinforcement learning with soft state aggregation
- Singh, Jaakola et al. - 1995
24
Model reduction techniques for computing approximately optim..
- Dean, Givan et al. - 1997
18
Roles of macro-actions in accelerating reinforcement learnin..
- McGovern, Sutton et al. - 1997
16
Scaling reinforcement learning algorithms by learning variab..
- Singh - 1992
9
planning and learning in an autonomous agent (context) - Benson, Nilsson - 1995
7
Temporal abstraction in reinforcement learning (context) - Sutton - 1995
1
Computer Science Department (context) - Lin, for et al. - 1997
1
Synthesizing efficient agents from partial programs (context) - Hsu - 1991
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://robotics.stanford.edu/~parr/index.html): More
Hierarchical Control and Learning for Markov Decision Processes - By Ronald
(Correct)
A Unifying Framework for Temporal Abstraction in Stochastic.. - Ronald Parr
(Correct)
Computing factored value functions for policies in structured.. - Koller, Parr (1999)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC