• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Mobile robot learning of delayed response tasks through event extraction: A solution to the road sign problem and beyond (2001)

by F Lin˚aker, H Jacobsson
Venue:In Proc. of IJCAI’2001
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 11
Next 10 →

Hierarchical Reinforcement Learning Based on Subgoal Discovery and Subpolicy Specialization

by Bram Bakker, Jürgen Schmidhuber - Proceedings of the 8-th Conference on Intelligent Autonomous Systems, IAS-8 , 2004
"... We introduce a new method for hierarchical reinforcement learning. Highlevel policies automatically discover subgoals; low-level policies learn to specialize on different subgoals. Subgoals are represented as desired abstract observations which cluster raw input data. High-level value functions c ..."
Abstract - Cited by 23 (1 self) - Add to MetaCart
We introduce a new method for hierarchical reinforcement learning. Highlevel policies automatically discover subgoals; low-level policies learn to specialize on different subgoals. Subgoals are represented as desired abstract observations which cluster raw input data. High-level value functions cover the state space at a coarse level; low-level value functions cover only parts of the state space at a fine-grained level. Experiments show that this method outperforms several flat reinforcement learning methods in a deterministic task and in a stochastic task.

Rule Extraction from Recurrent Neural Networks: a Taxonomy and Review

by Henrik Jacobsson - Neural Computation , 2005
"... this paper, the progress of this development is reviewed and analysed in detail. In order to structure the survey and to evaluate the techniques, a taxonomy, specifically designed for this purpose, has been developed. Moreover, important open research issues are identified, that, if addressed pr ..."
Abstract - Cited by 15 (3 self) - Add to MetaCart
this paper, the progress of this development is reviewed and analysed in detail. In order to structure the survey and to evaluate the techniques, a taxonomy, specifically designed for this purpose, has been developed. Moreover, important open research issues are identified, that, if addressed properly, possibly can give the field a significant push forward

Neuromodulation of Reactive Sensorimotor Mappings as a Short-Term Memory Mechanism

by Tom Ziemke, Mikael Thieme - in Delayed Response Tasks,” Adaptive Behavior , 2002
"... ..."
Abstract - Cited by 12 (0 self) - Add to MetaCart
Abstract not found

A Robot that Reinforcement-Learns to Identify and Memorize Important Previous Observations

by Bram Bakker, Viktor Zhumatiy, Gabriel Gruener, Jürgen Schmidhuber - IN PROCEEDINGS OF THE 2003 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS , 2003
"... It is difficult to apply traditional reinforcement learning algorithms to robots, due to problems with large and continuous domains, partial observability, and limited numbers of learning experiences. This paper deals with these problems by combining: 1. reinforcement learning with memory, implement ..."
Abstract - Cited by 9 (0 self) - Add to MetaCart
It is difficult to apply traditional reinforcement learning algorithms to robots, due to problems with large and continuous domains, partial observability, and limited numbers of learning experiences. This paper deals with these problems by combining: 1. reinforcement learning with memory, implemented using an LSTM recurrent neural network whose inputs are discrete events extracted from raw inputs; 2. online exploration and offline policy learning. An experiment with a real robot demonstrates the methodology's feasibility.

Reinforcement Learning in Partially Observable Mobile Robot Domains Using Unsupervised Event Extraction

by Bram Bakker, Fredrik Linåker, Jürgen Schmidhuber - In Proc. IROS’02 , 2002
"... This paper describes how learning tasks in partially observable mobile robot domains can be solved by combining reinforcement learning with an unsupervised learning \event extraction" mechanism, called ARAVQ. ARAVQ transforms the robot's continuous, noisy, high-dimensional sensory input stream into ..."
Abstract - Cited by 7 (2 self) - Add to MetaCart
This paper describes how learning tasks in partially observable mobile robot domains can be solved by combining reinforcement learning with an unsupervised learning \event extraction" mechanism, called ARAVQ. ARAVQ transforms the robot's continuous, noisy, high-dimensional sensory input stream into a compact sequence of high-level events. The resulting hierarchical control system uses an LSTM recurrent neural network as the reinforcement learning component, which learns high-level actions in response to the history of high-level events. The highlevel actions select low-level behaviors which take care of real-time motor control. Illustrative experiments based on a Khepera mobile robot simulator are presented.

Evolving internal memory for T-maze tasks in noisy environments

by Daeeun Kim - Connection Science , 2004
"... In autonomous agent systems, internal memory can be an important element to overcome the limitations of purely reactive agent behaviour. This paper presents an analysis of memory requirements for T-maze tasks well known as the road sign problem. In these tasks a robot agent should make a decision of ..."
Abstract - Cited by 6 (2 self) - Add to MetaCart
In autonomous agent systems, internal memory can be an important element to overcome the limitations of purely reactive agent behaviour. This paper presents an analysis of memory requirements for T-maze tasks well known as the road sign problem. In these tasks a robot agent should make a decision of turning left or right at the T-junction in the approach corridor, depending on a history of perceptions. The robot agent in simulation can sense the light intensity influenced by light lamps placed on the bank of the wall. We apply the evolutionary multiobjective optimization approach to finite state controllers with two objectives, behaviour performance and memory size. Then the internal memory is quantified by counting internal states needed for the T-maze tasks in noisy environments. In particular, we focused on the influence of noise on internal memory and behaviour performance, and it is shown that state machines with variable thresholds can improve the performance with a hysteresis effect to filter out noise. This paper also provides an analysis of noise effect on perceptions and its relevance on performance degradation in state machines. keywords: T-maze, delayed response task, evolutionary robotics, finite state machines, evolutionary multiobjective optimization, internal memory 1 1

Self-Organized Modulation of a Neural Robot Controller

by Nicklas Bergfeldt, Fredrik Linåker , 2002
"... We show how a simple layered system can selforganize into a set of distinct states and qualitatively different behaviors as a result of learning a robotic delayed response task. Our approach is based on an architecture where higher levels are able to dynamically modulate the lower reactive mapping w ..."
Abstract - Cited by 4 (1 self) - Add to MetaCart
We show how a simple layered system can selforganize into a set of distinct states and qualitatively different behaviors as a result of learning a robotic delayed response task. Our approach is based on an architecture where higher levels are able to dynamically modulate the lower reactive mapping when needed.

D.: Experiments with reservoir computing on the road sign problem

by Eric Aislan Antonelo, Benjamin Schrauwen, Dirk Stroob - In: Brazilian Congress on Neural Networks (CBRN) (2007
"... Abstract — The road sign problem is tackled in this work with Reservoir Computing (RC) networks. These networks are made of a fixed recurrent neural network where only a readout layer is trained. In the road sign problem, an agent has to decide at some point in time which action to take given releva ..."
Abstract - Cited by 3 (3 self) - Add to MetaCart
Abstract — The road sign problem is tackled in this work with Reservoir Computing (RC) networks. These networks are made of a fixed recurrent neural network where only a readout layer is trained. In the road sign problem, an agent has to decide at some point in time which action to take given relevant information gathered in the past. We show that RC can handle simple and complex T-maze tasks (which are a subdomain of the road sign problem). Keywords — Reservoir Computing, road sign problem, T-maze, long-term memory. 1

From Time-Steps To Events And Back

by Fredrik Linåker , 2001
"... We present a design for general layered control systems capable of learning delayed response tasks with arbitrarily long delays. The approach is based on extracting events from the time-step based input stream and reacting to these events using top-down modulation. This modulation can bias the outpu ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
We present a design for general layered control systems capable of learning delayed response tasks with arbitrarily long delays. The approach is based on extracting events from the time-step based input stream and reacting to these events using top-down modulation. This modulation can bias the outputs in some direction and lets the system focus on a subset of the available sensory array. To avoid information distortion as inputs passes through layers, all layers are connected directly to the input, like in the Subsumption Architecture. However, here higher layers also have access to an additional information channel from lower layers, specifying whether the particular input seems relevant. Thereby higher layers can lay out more long-term plans without distractions, only being interrupted or notified when important input conditions arise.

Mobile Robot Control in the Road Sign Problem using Reservoir Computing Networks

by Eric Antonelo, Benjamin Schrauwen, Dirk Stroobandt , 2008
"... In this work we tackle the road sign problem with Reservoir Computing (RC) networks. The T-maze task (a particular form of the road sign problem) consists of a robot in a T-shaped environment that must reach the correct goal (left or right arm of the T-maze) depending on a previously received input ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
In this work we tackle the road sign problem with Reservoir Computing (RC) networks. The T-maze task (a particular form of the road sign problem) consists of a robot in a T-shaped environment that must reach the correct goal (left or right arm of the T-maze) depending on a previously received input sign. It is a control task in which the delay period between the sign received and the required response (e.g., turn right or left) is a crucial factor. Delayed response tasks like this one form a temporal problem that can be handled very well by RC networks. Reservoir Computing is a biologically plausible technique which overcomes the problems of previous algorithms such as Backpropagation Through Time- which exhibits slow (or non-) convergence on training. RC is a new concept that includes a fast and efficient training algorithm. We show that this simple approach can solve the T-maze task efficiently.
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University