(Enter summary)
Abstract: The central thesis of this article is that memory-based methods provide
natural and powerful mechanisms for high-autonomy learning control. This
paper takes the form of a survey of the ways in which memory-based methods
can and have been applied to control tasks, with an emphasis on tasks
in robotics and manufacturing. We explain the various forms that control
tasks can take, and how this impacts on the choice of learning algorithm.
We show a progression of five increasingly more complex... (Update)
Context of citations to this paper: More
...environment. Several methods have been proposed in this direction : Learning by demonstrations or from examples, Memory based learning [19, 1, 13] , Imitation [12, 11, 2, 14, 7] or Supervised Learning [21, 20] Those approaches focus on the learning of complex action sequences...
...experience to the wrong states. In the learning methods above, we can distinguish between representational tools and learning paradigms [23]. The representational tools in VRDP and parti game are kd trees. They are used to partition the state space, approximate the value...
Cited by: More
Reinforcement Learning in the Joint Space: Value Iteration in.. - Monson (2003)
(Correct)
Continual Learning for Mobile Robots - Großmann (2001)
(Correct)
Learning a Navigation Task in Changing Environments by.. - Grossmann, Poli
(Correct)
Similar documents (at the sentence level):
20.7%: Locally Weighted Learning for Control - Atkeson, Moore, Schaal (1996)
(Correct)
5.3%: Robot Learning by Nonparametric Regression - Schaal, Atkeson (1995)
(Correct)
5.1%: Memory-Based Neural Networks For Robot Learning - Atkeson, Schaal (1995)
(Correct)
Active bibliography (related documents): More All
0.6: Efficient Memory-based Learning for Robot Control - Moore (1990)
(Correct)
0.5: Connectionist Adaptive Control - Jervis (1993)
(Correct)
0.4: The Parti-game Algorithm for Variable Resolution.. - Moore, Atkeson (1995)
(Correct)
Similar documents based on text: More All
0.7: Local Dimensionality Reduction - Schaal, Vijayakumar, Atkeson (1998)
(Correct)
0.6: Scalable Techniques from Nonparametric Statistics for.. - Schaal, Atkeson.. (2000)
(Correct)
0.6: Locally Weighted Learning - Christopher G. Atkeson, Andrew W.. (1996)
(Correct)
Related documents from co-citation: More All
9: Learning to predict by the method of temporal differences
- Sutton - 1988
8: The parti-game algorithm for variable resolution reinforcement learning in multi..
- Moore, Atkeson - 1995
8: Incremental multi-step Q-learning
- Peng, Williams - 1996
BibTeX entry: (Update)
Moore, A.W., Atkeson, C.G, and Schaal, S. (1995). Memory-based learning for control. Technical Report CMU-RI-TR-95-18, CMU Robotics Institute, 1995. http://citeseer.ist.psu.edu/article/moore95memorybased.html More
@misc{ moore95memorybased,
author = "A. Moore and C. Atkeson and S. Schaal",
title = "Memory-based learning for control",
text = "Moore, A.W., Atkeson, C.G, and Schaal, S. (1995). Memory-based learning
for control. Technical Report CMU-RI-TR-95-18, CMU Robotics Institute, 1995.",
year = "1995",
url = "citeseer.ist.psu.edu/article/moore95memorybased.html" }
Citations (may not include all citations):
658
Learning from Delayed Rewards (context) - Watkins - 1989
508
Computational Geometry (context) - Preparata, Shamos - 1985
482
Iterative Solution of Nonlinear Equations in Several Variabl.. (context) - Ortega, Rheinboldt - 1970
408
Princeton University Press (context) - Bellman - 1957
175
Parallel and Distributed Computation (context) - Bertsekas, Tsitsiklis - 1989
162
Prioritized Sweeping: Reinforcement Learning with Less Data ..
- Moore, Atkeson - 1993
141
Temporal Credit Assignment in Reinforcement Learning (context) - Sutton - 1984
133
and Reacting Based on Approximating Dynamic Programming (context) - Sutton, for et al. - 1990
133
and Reacting Based on Approximating Dynamic Programming (context) - Sutton, for et al. - 1990
116
Forward Models: Supervised Learning with a Distal Teacher
- Jordan, Rumelhart - 1992
111
Active learning with statistical models
- Cohn, Ghahramani et al. - 1995
84
Real-time Learning and Control using Asynchronous Dynamic Pr.. (context) - Barto, Bradtke et al. - 1994
75
BOXES: An Experiment in Adaptive Control (context) - Michie, Chambers - 1968
75
Combining Instance-Based and Model-Based Learning
- Quinlan - 1993
57
Efficient Algorithms with Neural Network Behaviour (context) - Omohundro - 1987
52
Variable Resolution Dynamic Programming: Efficiently Learnin.. (context) - Moore - 1991
50
Efficient Learning and Planning Within the Dyna Framework
- Peng, Williams - 1993
46
Elementary Numerical Analysis (context) - Conte, De Boor - 1980
40
Bumptrees for Efficient Function (context) - Omohundro - 1991
37
Robot Juggling: An Implementation of Memory-based Learning (context) - Schaal, Atkeson - 1994
35
Multiresolution Instance-based Learning (context) - Deng, Moore - 1995
35
Optimum Systems Control (context) - Sage, White - 1977
32
An Empirical Investigation of Brute Force to choose Features
- Moore, Hill et al. - 1992
31
Using Local Models to Control Movement (context) - Atkeson - 1989
30
Enhancing Transfer in Reinforcement Learning by Building Sto.. (context) - Mahadevan - 1992
28
Real-Time Application of Neural Networks for Sensor-Based Co.. (context) - Miller - 1989
28
Acquisition of Dynamic Control Knowledge for a Robotic Manip.. (context) - Moore - 1990
24
Robust Adaptive Control by Learning only Forward Models (context) - Moore - 1992
18
Department of Computer Science
- Kaelbling, Embedded et al. - 1990
17
Bayesian Model Comparison and Backprop Nets
- MacKay - 1992
16
Application of a General Learning Algorithm to the Control o.. (context) - Miller, Glanz et al. - 1987
16
Behaviour and Robotics (context) - Albus - 1981
15
Using Local Trajectory Optimizers to Speed up Global Optimiz..
- Atkeson - 1994
14
TaskLevel Robot Learning: Juggling a Tennis Ball More Accura.. (context) - Aboaf, Drucker et al. - 1989
9
LOESS: Multivariate Smoothing by Moving Least Squares (context) - Grosse - 1989
7
Reliability estimation for neural network based autonomous d.. (context) - Pomerleau - 1994
6
Knowledge of Knowledge and Intelligent Experimentation for L.. (context) - Moore - 1991
6
COINS Technical Report (context) - Barto, Sutton et al. - 1989
4
MURPHY: A Connectionist Approach to VisionBased Robot Motion.. (context) - Mel - 1989
4
Assessing the Quality of Local Linear Models (context) - Schaal, Atkeson - 1994
1
Lazy learning with Locally weighted regression: A survey of .. (context) - Atkeson, Moore et al. - 1995
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.cs.cmu.edu/afs/cs/user/awm/web/papers.html): More
Efficient Locally Weighted Polynomial Regression Predictions - Moore, Schneider, Deng
(Correct)
The Parti-game Algorithm for Variable Resolution.. - Moore, Atkeson (1995)
(Correct)
Prioritized Sweeping: Reinforcement Learning with Less Data.. - Moore, Atkeson (1993)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC