See this document in CiteSeerX!

Efficient Reinforcement Learning in Factored MDPs (1999)  (Make Corrections)  (4 citations)
Michael Kearns ATT Labs Daphne Koller Stanford...
IJCAI



  Home/Search   Context   Related

 
View or download:
upenn.edu/~mkearns/papers/dbne3.ps.Z
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  upenn.edu/~mkearns/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: We present a provably efficient and near-optimal algorithm for reinforcement learning in Markov decision processes (MDPs) whose transition model can be factored as a dynamic Bayesian network (DBN). (Update)

Similar documents based on text:   More   All
0.3:   Policy Iteration for Factored MDPs - Koller, Parr (2000)   (Correct)
0.3:   Using Probabilistic Information in Data Integration - Florescu, Koller, Levy (1997)   (Correct)
0.3:   Max-norm Projections for Factored MDPs - Guestrin, Koller, Parr (2001)   (Correct)

BibTeX entry:   (Update)

M. Kearns and D. Koller. Efficient reinforcement learning in factored MDPs. In Proc. IJCAI, 1999. http://citeseer.ist.psu.edu/article/kearns99efficient.html   More

@inproceedings{ kearns99efficient,
    author = "Michael J. Kearns and Daphne Koller",
    title = "Efficient Reinforcement Learning in Factored {MDPs}",
    booktitle = "{IJCAI}",
    pages = "740-747",
    year = "1999",
    url = "citeseer.ist.psu.edu/article/kearns99efficient.html" }
Citations (may not include all citations):
219   A tutorial on learning with Bayesian networks - Heckerman - 1995
188   Decision theoretic planning: Structural assumptions and comp.. - Boutilier, Dean et al. - 1999
130   Influence diagrams (context) - Howard, Matheson - 1984
113   Tractable inference for complex stochastic processes - Boyen, Koller - 1998
61   Polynomialtime approximation algorithms for the Ising model - Jerrum, Sinclair - 1993
59   The BATmobile: Towards a Bayesian automated taxi - Forbes, Huang et al. - 1995
42   Computing factored value functions for policies in structure.. - Koller, Parr - 1999
37   A sparse sampling algorithm for near-optimal planning in lar.. - Kearns, Mansour et al. - 1999
34   Solving very large weakly coupled Markov decision processes - Meuleau, Hauskrecht et al. - 1998
8   Central limit theorem for nonstationary Markov chains (context) - Dobrushin - 1956
8   Near-optimal performance for reinforcement learning in polyn.. (context) - Kearns, Singh - 1998
7   An expert system for control of waste water treatment---a pi.. (context) - Jensen, Kjrulff et al. - 1989
2   Lectureson the Coupling Method (context) - Lindvall - 1992

Documents on the same site (http://www.cis.upenn.edu/~mkearns/):   More
Graphical Economics - Sham Kakade Michael   (Correct)
Efficient Algorithms for Learning to Play Repeated Games Against.. - al. (1995)   (Correct)
On the Boosting Ability of Top-Down Decision Tree Learning.. - Kearns (1996)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC