7 citations found. Retrieving documents...
Gordon, G. J. 1996. Stable fitted reinforcement learning. In Touretzky, D. S.; Mozer, M. C.; and Hasselmo, M. E., eds., Advances in Neural Information Processing Systems, volume 8, 1052--1058. The MIT Press.

 Home/Search   Document Details and Download   Summary   Related Articles   Check  

This paper is cited in the following contexts:
Reinforcement Learning with Policy Constraints - Thrun, Schulte   (Correct)

....space. Submitted to ICML 98 1 Introduction In recent years, the field of reinforcement learning [2, 26, 33] has received significant attention in the AI community. Recent research has led to a much improved theoretical understanding of the capabilities and limitations of reinforcement learning [1, 2, 4, 11, 7, 10]. Various researchers have successfully applied reinforcement learning techniques to challenging real world problems [5, 25, 29, 34] The vast majority of the work, however, has focused on situations where a reinforcement learner faces a single task. In this paper we consider the problem where a ....

G. J. Gordon. Stable fitted reinforcement learning. In D. Touretzky, M. Mozer, and M.E. Hasselmo, editors, Advances in Neural Information Processing Systems 8, Cambridge, MA, 1996. MIT Press.


Chattering in SARSA(lambda) - A CMU Learning Lab Internal Report - Gordon (1996)   Self-citation (Gordon)   (Correct)

.... including Sutton, Dayan, and Sejnowski have proven the convergence of TD( in various situations; the most comprehensive proof is [TV96] A more complete description of SARSA, together with examples, is in [Sut96] The example of figure 1 is a simplified version of an example presented in [Gor96]. The examples attributed to Bertsekas were presented in his talk at the NSF workshop on reinforcement learning [Ber96] Acknowledgements This material is based on work supported under a National Science Foundation Graduate Research Fellowship and by ARPA grant number F33615 93 1 1330. Any ....

G. J. Gordon. Stable fitted reinforcement learning. In D. Touretzky, M. Mozer, and M. Hasselmo, editors, Advances in Neural Information Processing Systems, volume 8. MIT Press, 1996.


Mitsubishi Electric Research Laboratories - Http Www Merl (2003)   (Correct)

No context found.

Gordon, G. J. 1996. Stable fitted reinforcement learning. In Touretzky, D. S.; Mozer, M. C.; and Hasselmo, M. E., eds., Advances in Neural Information Processing Systems, volume 8, 1052--1058. The MIT Press.


Logical Markov Decision Programs and the Convergence of.. - Kersting, De Raedt (2004)   (1 citation)  (Correct)

No context found.

G. J. Gordon. Stable fitted reinforcement learning. In Advances in Neural Information Processing, pages 1052--1058. MIT Press, 1996.


A Symbol's Role In Learning Low Level Control Functions - Drummond (1999)   (1 citation)  (Correct)

No context found.

. Stable Function Approximation in Dynamic Programming. In Proceedings of the Twelfth International Conference of Machine Learning, pp.


Reinforcement Learning Through Gradient Descent - Baird, III (1999)   (7 citations)  (Correct)

No context found.

Gordon, G. (1996). "Stable fitted reinforcement learning". In G. Tesauro, M. Mozer, and M. Hasselmo (eds.), Advances in Neural Information Processing Systems 8, pp. 10521058.


Gradient Descent for General Reinforcement Learning - Baird, Moore (1998)   (53 citations)  (Correct)

No context found.

Gordon, G. (1996). "Stable fitted reinforcement learning". In G. Tesauro, M. Mozer, and M.

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC