(Enter summary)
Abstract: We present a provably efficient and near-optimal algorithm
for reinforcement learning in Markov decision
processes (MDPs) whose transition model can
be factored as a dynamic Bayesian network (DBN). (Update)
Similar documents based on text: More All
0.3: Policy Iteration for Factored MDPs - Koller, Parr (2000)
(Correct)
0.3: Using Probabilistic Information in Data Integration - Florescu, Koller, Levy (1997)
(Correct)
0.3: Max-norm Projections for Factored MDPs - Guestrin, Koller, Parr (2001)
(Correct)
BibTeX entry: (Update)
M. Kearns and D. Koller. Efficient reinforcement learning in factored MDPs. In Proc. IJCAI, 1999. http://citeseer.ist.psu.edu/article/kearns99efficient.html More
@inproceedings{ kearns99efficient,
author = "Michael J. Kearns and Daphne Koller",
title = "Efficient Reinforcement Learning in Factored {MDPs}",
booktitle = "{IJCAI}",
pages = "740-747",
year = "1999",
url = "citeseer.ist.psu.edu/article/kearns99efficient.html" }
Citations (may not include all citations):
219
A tutorial on learning with Bayesian networks
- Heckerman - 1995
188
Decision theoretic planning: Structural assumptions and comp..
- Boutilier, Dean et al. - 1999
130
Influence diagrams (context) - Howard, Matheson - 1984
113
Tractable inference for complex stochastic processes
- Boyen, Koller - 1998
61
Polynomialtime approximation algorithms for the Ising model
- Jerrum, Sinclair - 1993
59
The BATmobile: Towards a Bayesian automated taxi
- Forbes, Huang et al. - 1995
42
Computing factored value functions for policies in structure..
- Koller, Parr - 1999
37
A sparse sampling algorithm for near-optimal planning in lar..
- Kearns, Mansour et al. - 1999
34
Solving very large weakly coupled Markov decision processes
- Meuleau, Hauskrecht et al. - 1998
8
Central limit theorem for nonstationary Markov chains (context) - Dobrushin - 1956
8
Near-optimal performance for reinforcement learning in polyn.. (context) - Kearns, Singh - 1998
7
An expert system for control of waste water treatment---a pi.. (context) - Jensen, Kjrulff et al. - 1989
2
Lectureson the Coupling Method (context) - Lindvall - 1992
Documents on the same site (http://www.cis.upenn.edu/~mkearns/): More
Graphical Economics - Sham Kakade Michael
(Correct)
Efficient Algorithms for Learning to Play Repeated Games Against.. - al. (1995)
(Correct)
On the Boosting Ability of Top-Down Decision Tree Learning.. - Kearns (1996)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC