• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

DMCA

Reinforcement learning: a survey (1996)

Cached

  • Download as a PDF

Download Links

  • [www.cs.cmu.edu]
  • [www.cs.cmu.edu]
  • [www.cs.cmu.edu]
  • [www2.cs.uh.edu]
  • [www.cs.cmu.edu]
  • [www.cs.cmu.edu]
  • [www.eecs.ucf.edu]
  • [www.eecs.ucf.edu]
  • [www.eecs.ucf.edu]
  • [www.eecs.ucf.edu]
  • [www2.cs.uh.edu]
  • [www.cs.cmu.edu]
  • [opim.wharton.upenn.edu]
  • [grace.wharton.upenn.edu]
  • [www.mil.ufl.edu]
  • [mil.ufl.edu]
  • [synapse.cs.byu.edu]
  • [www.cs.uml.edu]
  • [www.cs.uml.edu]
  • [iridia.ulb.ac.be]
  • [www.ri.cmu.edu]
  • [pecan.srv.cs.cmu.edu]
  • [www.cs.duke.edu]
  • [www.cs.cmu.edu]
  • [www.cse.wustl.edu]
  • [www.nbu.bg]
  • [www.eecs.harvard.edu]
  • [www.eecs.harvard.edu]
  • [csl.anu.edu.au]
  • [www-2.cs.cmu.edu]
  • [www.jair.org]
  • [people.cs.pitt.edu]
  • [users.cecs.anu.edu.au]
  • [www.cc.gatech.edu]
  • [people.cs.pitt.edu]
  • [engr.case.edu]
  • [www.cc.gatech.edu]
  • [jair.org]
  • [people.cs.pitt.edu]
  • [www.cc.gatech.edu]
  • [people.cs.pitt.edu]
  • [www.ri.cmu.edu]
  • [arxiv.org]

  • Other Repositories/Bibliography

  • DBLP
  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Leslie Pack Kaelbling , Michael L. Littman , Andrew W. Moore
Venue:Journal of Artificial Intelligence Research
Citations:1712 - 25 self
  • Summary
  • Citations
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@ARTICLE{Kaelbling96reinforcementlearning:,
    author = {Leslie Pack Kaelbling and Michael L. Littman and Andrew W. Moore},
    title = {Reinforcement learning: a survey},
    journal = {Journal of Artificial Intelligence Research},
    year = {1996},
    volume = {4},
    pages = {237--285}
}

Share

Facebook Twitter Reddit Bibsonomy

OpenURL

 

Abstract

This paper surveys the field of reinforcement learning from a computer-science perspective. It is written to be accessible to researchers familiar with machine learning. Both the historical basis of the field and a broad selection of current work are summarized. Reinforcement learning is the problem faced by an agent that learns behavior through trial-and-error interactions with a dynamic environment. The work described here has a resemblance to work in psychology, but differs considerably in the details and in the use of the word "reinforcement." The paper discusses central issues of reinforcement learning, including trading off exploration and exploitation, establishing the foundations of the field via Markov decision theory, learning from delayed reinforcement, constructing empirical models to accelerate learning, making use of generalization and hierarchy, and coping with hidden state. It concludes with a survey of some implemented systems and an assessment of the practical utility of current methods for reinforcement learning.

Keyphrases

reinforcement learning    work described    computer-science perspective    trial-and-error interaction    implemented system    broad selection    central issue    historical basis    machine learning    practical utility    word reinforcement    current work    markov decision theory    dynamic environment    empirical model    hidden state    current method   

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University