59 citations found. Retrieving documents...
John N. Tsitsiklis and Benjamin Van Roy.
Feature-based methods for large scale dynamic programming
. Machine Learning, 22:59--94, 1996.
Home/Search
Document Details and Download
Context
Related Articles
Check
This paper is cited by the following papers:
First 50 documents
Next 50
Solving Factored POMDPs with Linear Value Functions - Guestrin, Koller, Parr (2001)
(3 citations)
(Correct)
Reinforcement Learning Using Neural Networks, with Applications.. - Coulom (2002)
(Correct)
Kernel-Based Reinforcement Learning in Average-Cost Problems: .. - Ormoneit, Glynn (2000)
(2 citations)
(Correct)
Experiments in Robot Control for an Instance-Based.. - Ribeiro, Hemerly (1999)
(Correct)
Convergent Reinforcement Learning with Value Function.. - Szepesvári (2001)
(1 citation)
(Correct)
Event-Learning And Robust Policy Heuristics - Lörincz, Pólik, Szita (2001)
(Correct)
Solving Factored POMDPs with Linear Value Functions - Guestrin, Koller, Parr (2001)
(3 citations)
(Correct)
Max-norm Projections for Factored MDPs - Guestrin, Koller, Parr (2001)
(21 citations)
(Correct)
Kernel-Based Reinforcement Learning - Ormoneit, Sen (1999)
(7 citations)
(Correct)
A Learning Algorithm For Markov Decision Processes With.. - Baras, Borkar (2000)
(1 citation)
(Correct)
Computing factored value functions for policies in.. - Daphne Koller Computer (1999)
(26 citations)
(Correct)
Max-norm Projections for Factored MDPs - Guestrin, Koller, Parr (2001)
(21 citations)
(Correct)
Model-Based Reinforcement Learning with an Approximate, Learned.. - Kuvayev (1997)
(2 citations)
(Correct)
Model-Based Reinforcement Learning with an Approximate.. - Leonid Kuvayev Rich (1996)
(2 citations)
(Correct)
Reinforcement Learning: A Survey - Kaelbling, Littman, Moore (1996)
(367 citations)
(Correct)
Policy Gradient Methods for Reinforcement Learning.. - Sutton.. (2000)
(60 citations)
(Correct)
Value-Function Approximations for Partially Observable Markov.. - Hauskrecht (2000)
(43 citations)
(Correct)
Simulation-Based Methods for Markov Decision Processes - Marbach (1998)
(11 citations)
(Correct)
Model Minimization in Markov Decision Processes - Dean, Givan (1997)
(35 citations)
(Correct)
Stochastic Dynamic Programming with Factored Representations - Boutilier, Dearden, al. (1999)
(30 citations)
(Correct)
Advantages of Cooperation Between Reinforcement Learning.. - Berenji, Vengerov (2000)
(3 citations)
(Correct)
The Complexity of Model Aggregation - Goldsmith, Sloan (2000)
(1 citation)
(Correct)
Learning Algorithms For Markov Decision Processes With.. - Abounadi, Bertsekas (1998)
(4 citations)
(Correct)
Module Based Reinforcement Learning for a Real Robot - Kalmár, Szepesvári, Lorincz
(1 citation)
(Correct)
Approximate Solutions to Markov Decision Processes - Gordon (1999)
(18 citations)
(Correct)
Reinforcement Learning: Theory and Practice - Szepesvári
(Correct)
Solving Stochastic Planning Problems With Large State and.. - Dean, Kim, Givan (1998)
(3 citations)
(Correct)
Stable Fitted Reinforcement Learning - Geoffrey Gordon (1996)
(6 citations)
(Correct)
Module Based Reinforcement Learning for a Real Robot - Kalmar, Szepesvári..
(1 citation)
(Correct)
Computing factored value functions for policies in structured.. - Koller, Parr (1999)
(26 citations)
(Correct)
Static and Dynamic Aspects of Optimal Sequential Decision Making - Szepesvari (1998)
(1 citation)
(Correct)
Learning Controllers for Complex Behavioral Systems - Crawford, Sastry
(Correct)
Correlated Action Effects in Decision Theoretic Regression - Boutilier (1997)
(3 citations)
(Correct)
Approximation in Model-Based Learning - Leonid Kuvayev (1997)
(2 citations)
(Correct)
Generalized Markov Decision Processes.. - Szepesvári.. (1996)
(1 citation)
(Correct)
Solving Factored MDPs via Non-Homogeneous Partitioning - Kee-Eung Kim And
(Correct)
Max-norm Projections for Factored MDPs - Carlos Guestrin Computer (2001)
(21 citations)
(Correct)
International Joint Conference on Artificial.. - Max-Norm Projections For (2001)
(Correct)
Mitsubishi Electric Research Laboratories - Http Www Merl (2003)
(Correct)
Kernel-Based Reinforcement Learning in Average-Cost Problems - Ormoneit, Glynn (2000)
(2 citations)
(Correct)
Efficient Max-Norm Distance Computation and Reliable .. - Varadhan, Krishnan, .. (2003)
(Correct)
Max-norm Projections for Factored MDPs - Guestrin, Koller, Parr (2001)
(21 citations)
(Correct)
Linear Algebra in Very High-Dimension Vector Spaces.. - Kim, Dean, Hazlehurst (2000)
(Correct)
Title of the Book! - Name Of Author
(Correct)
Thesis Proposal: Learning to make decisions from large data sets - Zadrozny
(Correct)
Piecewise Linear Value Function Approximation for Factored MDPs - Poupart, Boutilier (2002)
(4 citations)
(Correct)
Solution of Delayed Reinforcement Learning Problems Having.. - Ravindran (1996)
(Correct)
Mean-Field Theory for Batched-TD(lambda) - Pineda
(Correct)
An Analysis of Temporal-Difference Learning with Function.. - Tsitsiklis, Van Roy (1996)
(54 citations)
Self-citation (Tsitsiklis Van roy)
(Correct)
An Analysis of Temporal-Difference Learning with Function.. - Tsitsiklis, Van Roy (1996)
(54 citations)
Self-citation (Tsitsiklis Van roy)
(Correct)
First 50 documents
Next 50
Online articles have much greater impact
More about CiteSeer.IST
Add search form to your site
Submit documents
Feedback
CiteSeer.IST - Copyright
Penn State
and
NEC