59 citations found. Retrieving documents...
John N. Tsitsiklis and Benjamin Van Roy. Feature-based methods for large scale dynamic programming. Machine Learning, 22:59--94, 1996.

 Home/Search   Document Details and Download   Context   Related Articles   Check  

This paper is cited by the following papers:

First 50 documents  Next 50

Solving Factored POMDPs with Linear Value Functions - Guestrin, Koller, Parr (2001)   (3 citations)  (Correct)
Reinforcement Learning Using Neural Networks, with Applications.. - Coulom (2002)   (Correct)
Kernel-Based Reinforcement Learning in Average-Cost Problems: .. - Ormoneit, Glynn (2000)   (2 citations)  (Correct)
Experiments in Robot Control for an Instance-Based.. - Ribeiro, Hemerly (1999)   (Correct)
Convergent Reinforcement Learning with Value Function.. - Szepesvári (2001)   (1 citation)  (Correct)
Event-Learning And Robust Policy Heuristics - Lörincz, Pólik, Szita (2001)   (Correct)
Solving Factored POMDPs with Linear Value Functions - Guestrin, Koller, Parr (2001)   (3 citations)  (Correct)
Max-norm Projections for Factored MDPs - Guestrin, Koller, Parr (2001)   (21 citations)  (Correct)
Kernel-Based Reinforcement Learning - Ormoneit, Sen (1999)   (7 citations)  (Correct)
A Learning Algorithm For Markov Decision Processes With.. - Baras, Borkar (2000)   (1 citation)  (Correct)
Computing factored value functions for policies in.. - Daphne Koller Computer (1999)   (26 citations)  (Correct)
Max-norm Projections for Factored MDPs - Guestrin, Koller, Parr (2001)   (21 citations)  (Correct)
Model-Based Reinforcement Learning with an Approximate, Learned.. - Kuvayev (1997)   (2 citations)  (Correct)
Model-Based Reinforcement Learning with an Approximate.. - Leonid Kuvayev Rich (1996)   (2 citations)  (Correct)
Reinforcement Learning: A Survey - Kaelbling, Littman, Moore (1996)   (367 citations)  (Correct)
Policy Gradient Methods for Reinforcement Learning.. - Sutton.. (2000)   (60 citations)  (Correct)
Value-Function Approximations for Partially Observable Markov.. - Hauskrecht (2000)   (43 citations)  (Correct)
Simulation-Based Methods for Markov Decision Processes - Marbach (1998)   (11 citations)  (Correct)
Model Minimization in Markov Decision Processes - Dean, Givan (1997)   (35 citations)  (Correct)
Stochastic Dynamic Programming with Factored Representations - Boutilier, Dearden, al. (1999)   (30 citations)  (Correct)
Advantages of Cooperation Between Reinforcement Learning.. - Berenji, Vengerov (2000)   (3 citations)  (Correct)
The Complexity of Model Aggregation - Goldsmith, Sloan (2000)   (1 citation)  (Correct)
Learning Algorithms For Markov Decision Processes With.. - Abounadi, Bertsekas (1998)   (4 citations)  (Correct)
Module Based Reinforcement Learning for a Real Robot - Kalmár, Szepesvári, Lorincz   (1 citation)  (Correct)
Approximate Solutions to Markov Decision Processes - Gordon (1999)   (18 citations)  (Correct)
Reinforcement Learning: Theory and Practice - Szepesvári   (Correct)
Solving Stochastic Planning Problems With Large State and.. - Dean, Kim, Givan (1998)   (3 citations)  (Correct)
Stable Fitted Reinforcement Learning - Geoffrey Gordon (1996)   (6 citations)  (Correct)
Module Based Reinforcement Learning for a Real Robot - Kalmar, Szepesvári..   (1 citation)  (Correct)
Computing factored value functions for policies in structured.. - Koller, Parr (1999)   (26 citations)  (Correct)
Static and Dynamic Aspects of Optimal Sequential Decision Making - Szepesvari (1998)   (1 citation)  (Correct)
Learning Controllers for Complex Behavioral Systems - Crawford, Sastry   (Correct)
Correlated Action Effects in Decision Theoretic Regression - Boutilier (1997)   (3 citations)  (Correct)
Approximation in Model-Based Learning - Leonid Kuvayev (1997)   (2 citations)  (Correct)
Generalized Markov Decision Processes.. - Szepesvári.. (1996)   (1 citation)  (Correct)
Solving Factored MDPs via Non-Homogeneous Partitioning - Kee-Eung Kim And   (Correct)
Max-norm Projections for Factored MDPs - Carlos Guestrin Computer (2001)   (21 citations)  (Correct)
International Joint Conference on Artificial.. - Max-Norm Projections For (2001)   (Correct)
Mitsubishi Electric Research Laboratories - Http Www Merl (2003)   (Correct)
Kernel-Based Reinforcement Learning in Average-Cost Problems - Ormoneit, Glynn (2000)   (2 citations)  (Correct)
Efficient Max-Norm Distance Computation and Reliable .. - Varadhan, Krishnan, .. (2003)   (Correct)
Max-norm Projections for Factored MDPs - Guestrin, Koller, Parr (2001)   (21 citations)  (Correct)
Linear Algebra in Very High-Dimension Vector Spaces.. - Kim, Dean, Hazlehurst (2000)   (Correct)
Title of the Book! - Name Of Author   (Correct)
Thesis Proposal: Learning to make decisions from large data sets - Zadrozny   (Correct)
Piecewise Linear Value Function Approximation for Factored MDPs - Poupart, Boutilier (2002)   (4 citations)  (Correct)
Solution of Delayed Reinforcement Learning Problems Having.. - Ravindran (1996)   (Correct)
Mean-Field Theory for Batched-TD(lambda) - Pineda   (Correct)
An Analysis of Temporal-Difference Learning with Function.. - Tsitsiklis, Van Roy (1996)   (54 citations)  Self-citation (Tsitsiklis Van roy)   (Correct)
An Analysis of Temporal-Difference Learning with Function.. - Tsitsiklis, Van Roy (1996)   (54 citations)  Self-citation (Tsitsiklis Van roy)   (Correct)

First 50 documents  Next 50

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC