See this document in CiteSeerX!

Improving Elevator Performance Using Reinforcement Learning (1996)  (Make Corrections)  (124 citations)
Robert Crites, Andrew Barto
Advances in Neural Information Processing Systems



  Home/Search   Context   Related

 
View or download:
umass.edu/pub/anw/pub/crit...nips8.ps.Z
umass.edu/~mharmon/rltuto...crites.ps.Z
umass.edu/cgibin/getfile/...nips8.ps.Z
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  cmu.edu/Groups/NIPS/NIPS...Papers (more)
From:  purdue.edu/~lrn/projects/Instr...
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: This paper describes the application of reinforcement learning (RL) to the difficult real world problem of elevator dispatching. The elevator domain poses a combination of challenges not seen in most RL research to date. Elevator systems operate in continuous state spaces and in continuous time as discrete event dynamic systems. Their states are not fully observable and they are nonstationary due to changing passenger arrival rates. In addition, we use a team of RL agents, each of which is... (Update)

Cited by:   More
Planning In Hybrid Structured Stochastic - Domains Comenius University   (Correct)
Journal of Machine Learning Research 7 (2006) 1789--1828 .. - Payoff Propagation Jelle   (Correct)
Co-Evolution in the Successful Learning of Backgammon Strategy - Pollack, Blair (1998)   (Correct)

Similar documents (at the sentence level):
40.3%:   Elevator Group Control Using Multiple Reinforcement Learning.. - Crites, Barto (1998)   (Correct)
40.3%:   Large-Scale Dynamic Optimization Using Teams of Reinforcement.. - Crites (1996)   (Correct)

Active bibliography (related documents):   More   All
0.2:   Task-Oriented Reinforcement Learning - Kamal (2003)   (Correct)
0.1:   Mitsubishi Electric Research Laboratories - Http Www Merl (2004)   (Correct)
0.0:   Reinforcement Learning: A Survey - Leslie Pack Kaelbling, Michael L.. (1996)   (Correct)

Similar documents based on text:   More   All
0.7:   Planning and Control Models for Elevators in High-Rise Buildings - Siikonen (1997)   (Correct)
0.6:   Intelligent Elevator Control By Ordinal Structure Fuzzy.. - Khiang, Khalid, Yusof   (Correct)
0.4:   From Theory to Practice: AI Planning for High Performance.. - Koehler (2001)   (Correct)

Related documents from co-citation:   More   All
33:   Practical Issues in Temporal Difference Learning - Tesauro - 1992
31:   Learning to predict by the method of temporal differences - Sutton - 1988
29:   Learning from Delayed Rewards (context) - CJCH - 1989

BibTeX entry:   (Update)

Crites R. H. and A. G. Barto, "Improving Elevator Performance Using Reinforcement Learning", in D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo (Eds.) Advances in Neural Information Processing Systems. http://citeseer.ist.psu.edu/crites96improving.html   More

@inproceedings{ crites96improving,
    author = "Robert H. Crites and Andrew G. Barto",
    title = "Improving Elevator Performance Using Reinforcement Learning",
    booktitle = "Advances in Neural Information Processing Systems",
    volume = "8",
    publisher = "The {MIT} Press",
    editor = "David S. Touretzky and Michael C. Mozer and Michael E. Hasselmo",
    pages = "1017--1023",
    year = "1996",
    url = "citeseer.ist.psu.edu/crites96improving.html" }
Citations (may not include all citations):
658   Learning from Delayed Rewards (context) - Watkins - 1989
219   Practical Issues in Temporal Difference Learning - Tesauro - 1992
110   Temporal Difference Learning and TD--Gammon (context) - Tesauro - 1995
34   Reinforcement Learning Methods for Continuous--Time Markov D.. - Bradtke, Duff - 1995
24   TD--Gammon (context) - Tesauro - 1994
6   Elevator Dispatchers for Down Peak Traffic (context) - Bao, Cassandras et al. - 1994
4   A Dynamic Load Balancing Approach to the Control of Multiser.. (context) - Lewis - 1991



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.cs.cmu.edu/Groups/NIPS/NIPS95/Papers.html):   More
A Neural Network Autoassociator for Induction.. - Petsche.. (1996)   (Correct)
Dynamics of Attention as Near Saddle-Node Bifurcation Behavior - Nakahara, Doya (1996)   (Correct)
Empirical Entropy Manipulation for Real-World Problems - Viola, Schraudolph, Sejnowski (1996)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC