|
1134
|
Reinforcement learning: a survey
– Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore
- 1996
|
|
2304
|
Learning internal representation by error propagation
– D E Rumelhart, G E Hinton, R J Williams
- 1996
|
|
26
|
Emergent Control and Planning in an Autonomous Vehicle
– Lisa Meeden, Gary Mcgraw, Douglas Blank
- 1993
|
|
1137
|
Learning from delayed rewards
– C J C H Watkins
- 1989
|
|
1313
|
Finding structure in time
– Jeffrey L. Elman
- 1990
|
|
219
|
Temporal Credit Assignment in Reinforcement Learning
– R S Sutton
- 1984
|
|
2442
|
A robust layered control system for a mobile robot
– R A Brooks
- 1986
|
|
178
|
Reinforcement learning for robots using neural networks
– L-J Lin
- 1992
|
|
1965
|
Dynamic Programming
– R Bellman
- 1957
|
|
280
|
Learning in embedded systems
– L P Kaelbling
- 1993
|
|
6
|
Mini board 2.0 technical reference
– F Martin
- 1992
|
|
226
|
On-Line Q-Learning Using Connectionist Systems
– G. A. Rummery, M. Niranjan
- 1994
|
|
364
|
Vehicles: Experiments in Synthetic Psychology
– V Braitenberg
- 1984
|
|
62
|
A stochastic reinforcement learning algorithm for learning real-valued functions
– V Gullapalli
- 1990
|
|
146
|
Vehicles: Experiments in Synthetic
– V Braitenberg
- 1984
|
|
171
|
Evolving Networks: Using the Genetic Algorithm with Connectionist Learning
– Richard K. Belew, John Mcinerney, Nicol N. Schraudolph
- 1990
|
|
649
|
Reinforcement Learning
– Richard S. Sutton, Presented Pirooz Chubak, Dyna Architecture, Dyna Architecture
- 1998
|
|
1163
|
An introduction to genetic algorithms
– M Mitchell
- 1997
|
|
190
|
Learning to Coordinate Behaviors
– P. Maes, R. Brooks, Pattie Maes, Rodney A. Brooks
- 1990
|