413 citations found. Retrieving documents...
D. Bertsekas and J. Tsitsiklis. Neuro-Dynamic Programming. Athena Scientific, Belmont, MA, 1996.

 Home/Search   Document Not in Database   Summary   Related Articles   Check  

This paper is cited in the following contexts:

First 50 documents  Next 50

Least-Squares Temporal Difference Learning - Justin Boyan Nasa   (Correct)

No context found.

D. Bertsekas and J. Tsitsiklis. Neuro-Dynamic Programming. Athena Scientific, Belmont, MA, 1996.


On the Smoothness of Linear Value Function Approximations - Branislav Kveton Intelligent   (Correct)

No context found.

Bertsekas, D., and Tsitsiklis, J. 1996. Neuro-Dynamic Programming. Belmont, MA: Athena Scientific.


Planning In Hybrid Structured Stochastic - Domains Comenius University   (Correct)

No context found.

Dimitri Bertsekas and John Tsitsiklis. Neuro-Dynamic Programming. Athena Scientific, Belmont, MA, 1996.


Learning to Control an Octopus Arm with - Gaussian Process Temporal   (Correct)

No context found.

D.P. Bertsekas and J.N. Tsitsiklis. Neuro-Dynamic Programming. Athena Scientific, 1996.


Stochastic Reactive Production Scheduling by - Asynchronous (2005)   (Correct)

No context found.

Bertsekas, D. P., Tsitsiklis J. N.: Neuro-Dynamic Programming (1996)


Journal of Machine Learning Research 7 (2006) 1079--1105.. - Multi-Armed Bandit And   (Correct)

No context found.

D. P. Bertsekas and J. N. Tsitsiklis. Neuro-Dynamic Programming. Athena Scientific, Belmont, MA, 1996.


Journal of Machine Learning Research 7 (2006) 1079--1105.. - Multi-Armed Bandit And   (Correct)

No context found.

D. P. Bertsekas and J. N. Tsitsiklis. Neuro-Dynamic Programming. Athena Scientific, 1995.


Algorithms for Planning under Uncertainty in Prediction .. - O'Kane, Tovar, Cheng.. (2005)   (Correct)

No context found.

D. P. Bertsekas and J. Tsitsiklis. Neuro-Dynamic Programming. Athena Scientific, 1996.


A Simulation-Based Algorithm for Ergodic Control of.. - Bhatnagar, Borkar, al. (2006)   (Correct)

No context found.

D. P. Bertsekas and J. Tsitsiklis. Neuro-dynamic Programming. Athena Scientific, Boston, MA, USA, 1996.


Journal of Machine Learning Research 7 (2006) 1789--1828 .. - Payoff Propagation Jelle   (Correct)

No context found.

D. P. Bertsekas and J. N. Tsitsiklis. Neuro-dynamic programming. Athena Scientific, 1996.


Boundedness of Iterates in Q-Learning - Abhijit Gosavi Department   (Correct)

No context found.

D. Bertsekas and J. Tsitsiklis. Neuro-dynamic programming. Athena Scientific, MA, 1996.


Reinforcement Learning for Long-Run Average Cost - Abhijit Gosavi Assistant   (Correct)

No context found.

Dimitri P. Bertsekas and John N. Tsitsiklis. Neuro-Dynamic Programming. Athena Scientific, Belmont, MA, 1996.


Journal of Artificial Intelligence Research 15 (2001).. - Jonathan Baxter Jbaxter   (Correct)

No context found.

Bertsekas, D. P., & Tsitsiklis, J. N. (1996). Neuro-Dynamic Programming. Athena Scientific.


Hierarchical Solution of Markov Decision Processes using .. - Milos Hauskrecht Nicolas (1998)   (36 citations)  (Correct)

No context found.

D. P. Bertsekas and J.. N. Tsitsiklis. Neuro-dynamic Programming. Athena, 1996.


Approximate Solutions to Factored Markov Decision Processes via - Greedy Search In   (Correct)

No context found.

Bertsekas, D. P., and Tsitsiklis, J. N. 1996. NeuroDynamic Programming. Belmont, Massachusetts: Athena Scientific.


Solving Factored MDPs via Non-Homogeneous Partitioning - Kee-Eung Kim And   (Correct)

No context found.

Dimitri P. Bertsekas and John N. Tsitsiklis. Neuro-Dynamic Programming.Athena Scientific, 1996.


Appeared in the Twentieth Conference on Uncertainty in.. - Solving Factored Mdps (2004)   (Correct)

No context found.

D. Bertsekas and J. Tsitsiklis. Neuro-dynamic Programming. Athena, 1996.


Covariant Policy Search - Andrew Bagnell And (2003)   (Correct)

No context found.

D. Bertsekas and J. Tsitsiklis. Neuro-Dynamic Programming. Athena Scientific, 1996.


Intensive Reinforcement Learning - Wawrzyski (2005)   (Correct)

No context found.

D. P. Bertsekas and J. N. Tsitsiklis. Neuro-Dynamic Programming, Athena Scientific, 1997.


Reinforcement Learning for Humanoid Robotics - Peters, Vijayakumar, Schaal (2003)   (Correct)

No context found.

Bertsekas, D. P. & Tsitsiklis, J. N.. Neuro-Dynamic Programming. Athena Scientific, Belmont, MA, 1996


Linear Program Approximations for Factored Continuous-State .. - Hauskrecht, Kveton (2003)   (Correct)

No context found.

D.P. Bertsekas and J.N. Tsitsiklis. Neuro-dynamic Programming. Athena Sc., 1996.


Greedy linear value-approximation for factored Markov.. - Relu Patrascu Rpatrasc (2002)   (2 citations)  (Correct)

No context found.

Bertsekas, D., and Tsitsiklis, J. 1996. Neuro-Dynamic Programming. Athena Scientific.


Mitsubishi Electric Research Laboratories - Http Www Merl (2003)   (Correct)

No context found.

Bertsekas, D. P., and Tsitsiklis, J. N. 1996. Neuro-Dynamic Programming. Belmont, MA: Athena Scientific.


Stochasticsand - Statistics Reinforcement Learning   (Correct)

No context found.

D.P. Bertsekas, J.N. Tsitsiklis, Neuro-Dynamic Programming, Athena Scientific, Belmont, MA, 1996.


Solving Factored MDPs with Continuous and Discrete Variables - Guestrin, Hauskrecht.. (2004)   (Correct)

No context found.

D. Bertsekas and J. Tsitsiklis. Neuro-dynamic Programming. Athena, 1996.

First 50 documents  Next 50

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC