See this document in CiteSeerX!

Developing navigation behavior through  (Make Corrections)  
self-organizing distinctive state abstraction Jeerson Provost, Benjamin J....



  Home/Search   Context   Related

 
View or download:
utexas.edu/~pbeeso...ostconnsci06.pdf
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  utexas.edu/~pbeeson/_papers_/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: A major challenge in reinforcement learning research is to extend methods that have worked well on discrete, short-range, low-dimensional problems to continuous, high-diameter, high-dimensional problems, such as robot navigation using high-resolution sensors. Self-Organizing Distinctivestate Abstraction (SODA) is a new, generic method by which a robot in a continuous world can better learn to navigate by learning a set of high-level features and building temporally-extended actions to... (Update)

Active bibliography (related documents):   More   All
0.5:   AGILO RoboCuppers 2004 - Stulp, Kirsch, Gedikli, Beetz (2004)   (Correct)
0.4:   Toward Learning the Causal Layer of the Spatial Semantic .. - Provost, Beeson, Kuipers (2001)   (Correct)
0.2:   Recent Advances in Hierarchical Reinforcement Learning - Barto, Mahadevan (2003)   (Correct)

Similar documents based on text:
0.0:   Unknown -   (Correct)

BibTeX entry:   (Update)

@misc{ state-developing,
  author = "Self-Organizing Distinctive State",
  title = "Developing Navigation Behavior Through",
  url = "citeseer.ist.psu.edu/748580.html" }
Citations (may not include all citations):
1213   Self-Organizing Maps (context) - Kohonen - 1995
614   Reinforcement Learning: An Introduction - Sutton, Barto - 1998
116   A growing neural gas network learns topologies - Fritzke - 1995
76   Reinforcement Learning with Selective Perception and Hidden .. (context) - McCallum - 1995
57   The Spatial Semantic Hierarchy - Kuipers - 2000
39   Continual Learning in Reinforcement Environments - Ring - 1994
27   Three-dimensional neural net for learning visuomotor coordin.. - Martinetz, Ritter et al. - 1990
25   playerstage project Tool multi robot and distributed sensor .. - Vaughan, player et al. - 2003
22   Practical reinforcement learning in continuous spaces - Smart, Kaelbling - 2000
12   Automatic discovery of subgoals in reinforcement learning us.. - McGovern, Barto - 2001
11   Temporal abstraction in reinforcement learning - Precup - 2000
10   Discovering hierarchy in reinforcement learning with hexq - Hengst - 2002
9   Learning hierarchical control structure for multiple tasks a.. - Digney - 1998
7   Mapbuilding using self-organizing networks in really useful .. (context) - Nehmzow, Smithers - 1991
7   Map learning with uninterpreted sensors and e#ectors (context) - Pierce, Kuipers - 1997
4   Performance comparison of landmark recognition systems for n.. - Duckett, Nehmzow - 2000
2   Using abstract models of behaviours to automatically generat.. (context) - Ryan - 2002
1   Decomposing infants' object representations: A dual-route pr.. (context) - Schlesinger - 2006
1   Developmental robotics (context) - Schmidhuber - 2006
1   Learning acceptable windows of contingency (context) - Gold, Scassellati - 2006
1   From unknown sensors and actuators to actions grounded in se.. (context) - Olsson, Nehaniv et al. - 2006
1   Bootstrap learning of foundational representations - Kuipers, Beeson et al. - 2006
1   The discovery of communication (context) - Oudeyer, Kaplan - 2006
1   Applications of the self-organizing map to reinforcement lea.. (context) - Smith - 2002
1   Learning a world model and planning with a self-organizing - Toussaint - 2004
1   Towards autonomous sensor and actuator model induction on a .. (context) - Stronger, Stone - 2006
1   Introduction to the special issue on developmental robotics (context) - Blank, Meeden - 2006
1   Constructivist learning: A neural implementation of the sche.. (context) - Chaput, Kuipers et al. - 2003
1   Toward learning the causal layer of the spatial semantic hie.. - Provost, Beeson et al. - 2001
1   Between MDPs and SMDPs: A framework for temporal abstraction.. (context) - Sutton, Precup et al. - 1999

Documents on the same site (http://www.cs.utexas.edu/~pbeeson/_papers_/):   More
Post-Piagetian Constructivism for Grounded Knowledge Acquisition - Chaput (2001)   (Correct)
Higher-Order Derivative Constraints in Qualitative.. - Kuipers, Chiu, Molle.. (1991)   (Correct)
Unknown - Herbert Bert Kay   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC