(Enter summary)
Abstract: This paper describes the application of reinforcement learning (RL)
to the difficult real world problem of elevator dispatching. The elevator
domain poses a combination of challenges not seen in most
RL research to date. Elevator systems operate in continuous state
spaces and in continuous time as discrete event dynamic systems.
Their states are not fully observable and they are nonstationary
due to changing passenger arrival rates. In addition, we use a team
of RL agents, each of which is... (Update)
Cited by: More
Planning In Hybrid Structured Stochastic - Domains Comenius University
(Correct)
Journal of Machine Learning Research 7 (2006) 1789--1828 .. - Payoff Propagation Jelle
(Correct)
Co-Evolution in the Successful Learning of Backgammon Strategy - Pollack, Blair (1998)
(Correct)
Similar documents (at the sentence level):
40.3%: Elevator Group Control Using Multiple Reinforcement Learning.. - Crites, Barto (1998)
(Correct)
40.3%: Large-Scale Dynamic Optimization Using Teams of Reinforcement.. - Crites (1996)
(Correct)
Active bibliography (related documents): More All
0.2: Task-Oriented Reinforcement Learning - Kamal (2003)
(Correct)
0.1: Mitsubishi Electric Research Laboratories - Http Www Merl (2004)
(Correct)
0.0: Reinforcement Learning: A Survey - Leslie Pack Kaelbling, Michael L.. (1996)
(Correct)
Similar documents based on text: More All
0.7: Planning and Control Models for Elevators in High-Rise Buildings - Siikonen (1997)
(Correct)
0.6: Intelligent Elevator Control By Ordinal Structure Fuzzy.. - Khiang, Khalid, Yusof
(Correct)
0.4: From Theory to Practice: AI Planning for High Performance.. - Koehler (2001)
(Correct)
Related documents from co-citation: More All
33: Practical Issues in Temporal Difference Learning
- Tesauro - 1992
31: Learning to predict by the method of temporal differences
- Sutton - 1988
29: Learning from Delayed Rewards (context) - CJCH - 1989
BibTeX entry: (Update)
Crites R. H. and A. G. Barto, "Improving Elevator Performance Using Reinforcement Learning", in D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo (Eds.) Advances in Neural Information Processing Systems. http://citeseer.ist.psu.edu/crites96improving.html More
@inproceedings{ crites96improving,
author = "Robert H. Crites and Andrew G. Barto",
title = "Improving Elevator Performance Using Reinforcement Learning",
booktitle = "Advances in Neural Information Processing Systems",
volume = "8",
publisher = "The {MIT} Press",
editor = "David S. Touretzky and Michael C. Mozer and Michael E. Hasselmo",
pages = "1017--1023",
year = "1996",
url = "citeseer.ist.psu.edu/crites96improving.html" }
Citations (may not include all citations):
658
Learning from Delayed Rewards (context) - Watkins - 1989
219
Practical Issues in Temporal Difference Learning
- Tesauro - 1992
110
Temporal Difference Learning and TD--Gammon (context) - Tesauro - 1995
34
Reinforcement Learning Methods for Continuous--Time Markov D..
- Bradtke, Duff - 1995
24
TD--Gammon (context) - Tesauro - 1994
6
Elevator Dispatchers for Down Peak Traffic (context) - Bao, Cassandras et al. - 1994
4
A Dynamic Load Balancing Approach to the Control of Multiser.. (context) - Lewis - 1991
The graph only includes citing articles where the year of publication is known.
Documents on the same site (http://www.cs.cmu.edu/Groups/NIPS/NIPS95/Papers.html): More
A Neural Network Autoassociator for Induction.. - Petsche.. (1996)
(Correct)
Dynamics of Attention as Near Saddle-Node Bifurcation Behavior - Nakahara, Doya (1996)
(Correct)
Empirical Entropy Manipulation for Real-World Problems - Viola, Schraudolph, Sejnowski (1996)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC