(Enter summary)
Abstract: This paper introduces an integration of reinforcement learning and behavior-based control designed to produce real-time learning in situated agents. The model layers a distributed and asynchronous reinforcement learning algorithm over a learned topological map and standard behavioral substrate to create a reinforcement learning complex. The topological map creates a small and task-relevant state space that aims to make learning feasible, while the distributed and asynchronous aspects of the... (Update)
Cited by: More
Anticipatory Learning for Focusing Search in Reinforcement.. - Konidaris, Hayes (2004)
(Correct)
Active bibliography (related documents): More All
3.6: Behaviour-Based Reinforcement Learning - Konidaris (2003)
(Correct)
0.5: Estimating Future Reward in Reinforcement Learning Animats.. - Konidaris, Hayes (2004)
(Correct)
0.5: Self-Organized Robotic System Design and Autonomous Odor.. - Hayes
(Correct)
Similar documents based on text: More All
0.1: Learning in a State of Confusion: Perceptual Aliasing in Grid.. - Crook, Hayes (2003)
(Correct)
0.1: Could Active Perception Aid Navigation of Partially Observable .. - Crook, Hayes
(Correct)
0.1: Intrinsically Motivated Reinforcement Learning: A.. - Stout, Konidaris, Barto (2005)
(Correct)
BibTeX entry: (Update)
G.D. Konidaris and G.M. Hayes. An Architecture for Behavior-Based Reinforcement Learning. To appear, Adaptive Behavior, 2004. http://citeseer.ist.psu.edu/konidaris05architecture.html More
@article(konidaris05architecture,
title = "An Architecture for Behavior-Based Reinforcement Learning",
author = "G.D. Konidaris and G.M. Hayes",
journal = "Adaptive Behavior",
volume = 13,
number = 1,
year = 2005,
pages = "5---32"
)
Citations (may not include all citations):
976
Machine learning (context) - Mitchell - 1997
700
Self-organization and associative memory (context) - Kohonen - 1989
614
Reinforcement learning: An introduction
- Sutton, Barto - 1998
477
Intelligence without representation
- Brooks - 1991
348
Parallel and distributed computation: Numerical methods (context) - Bertsekas, Tsitsiklis - 1989
281
Machine Learning (context) - Watkins, Dayan - 1992
257
Learning to act using real-time dynamic programming
- Barto, Bradtke et al. - 1995
183
Automatic programming of behavior-based robots using reinfor..
- Mahadevan, Connell - 1992
125
Learning to coordinate behaviors
- Maes, Brooks - 1990
92
What are plans (context) - Agre, Chapman - 1990
67
Reinforcement learning architectures for animats (context) - Sutton - 1990
59
Reward functions for accelerated learning
- Mataric - 1994
51
Reinforcement learning in the multi-robot domain
- Mataric - 1997
42
Action selection methods using reinforcement learning
- Humphrys - 1996
40
Planning by incremental dynamic programming
- Sutton - 1991
35
An approach to anytime learning
- Grefenstette, Ramsey - 1992
25
Using local information in a non-local way for mapping graph..
- Dudek, Freedman et al. - 1993
22
Practical reinforcement learning in continuous spaces
- Smart, Kaelbling - 2000
20
The artificial evolution of adaptive behaviour
- Harvey - 1995
19
Planning is just a way of avoiding figuring out what to do n.. (context) - Brooks - 1987
17
Evolutionary algorithms for reinforcement learning
- Moriarty, Schultz et al. - 1999
12
Layered learning
- Stone, Veloso - 2000
11
Reward and diversity in multirobot foraging
- Balch - 1999
9
The role of learning in autonomous robots
- Brooks - 1991
9
Clay: Integrating motor schemas and reinforcement learning
- Balch - 1997
9
Learning hierarchical control structures for multiple tasks ..
- Digney - 1998
7
A self-organising network that grows when required (context) - Marsland, Shapiro et al. - 2002
6
Topological simultaneous localization and mapping
- Choset, Nagatani - 2001
6
Khepera user manual (context) - SA - 1999
6
Concurrent layered learning
- Whiteson, Stone - 2003
5
Learning a distributed map representation based on navigatio.. (context) - Mataric, Brooks - 1990
5
Learning in a state of confusion: Perceptual aliasing in gri..
- Crook, Hayes - 2003
3
vision turret user manual (context) - SA - 1999
3
Modularity and specialized learning: Reexamining behavior-ba..
- Bryson - 2002
3
Reinforcement landmark learning (context) - Toombs, Phillips et al. - 1998
2
Polarization compass for robot navigation (context) - Schmolke, Mallot - 2002
2
Behaviour-based reinforcement learning
- Konidaris - 2003
2
Integrating RL and behavior-based control for soccer
- Balch - 1997
2
Applications of the self-organising map to reinforcement lea.. (context) - Smith - 2002
Documents on the same site (http://www-all.cs.umass.edu/~gdk/publications.html): More
Intrinsically Motivated Reinforcement Learning: A.. - Stout, Konidaris, Barto (2005)
(Correct)
Estimating Future Reward in Reinforcement Learning Animats.. - Konidaris, Hayes (2004)
(Correct)
Anticipatory Learning for Focusing Search in Reinforcement.. - Konidaris, Hayes (2004)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC