| Gordon, G. J. 1996. Stable fitted reinforcement learning. In Touretzky, D. S.; Mozer, M. C.; and Hasselmo, M. E., eds., Advances in Neural Information Processing Systems, volume 8, 1052--1058. The MIT Press. |
....space. Submitted to ICML 98 1 Introduction In recent years, the field of reinforcement learning [2, 26, 33] has received significant attention in the AI community. Recent research has led to a much improved theoretical understanding of the capabilities and limitations of reinforcement learning [1, 2, 4, 11, 7, 10]. Various researchers have successfully applied reinforcement learning techniques to challenging real world problems [5, 25, 29, 34] The vast majority of the work, however, has focused on situations where a reinforcement learner faces a single task. In this paper we consider the problem where a ....
G. J. Gordon. Stable fitted reinforcement learning. In D. Touretzky, M. Mozer, and M.E. Hasselmo, editors, Advances in Neural Information Processing Systems 8, Cambridge, MA, 1996. MIT Press.
.... including Sutton, Dayan, and Sejnowski have proven the convergence of TD( in various situations; the most comprehensive proof is [TV96] A more complete description of SARSA, together with examples, is in [Sut96] The example of figure 1 is a simplified version of an example presented in [Gor96]. The examples attributed to Bertsekas were presented in his talk at the NSF workshop on reinforcement learning [Ber96] Acknowledgements This material is based on work supported under a National Science Foundation Graduate Research Fellowship and by ARPA grant number F33615 93 1 1330. Any ....
G. J. Gordon. Stable fitted reinforcement learning. In D. Touretzky, M. Mozer, and M. Hasselmo, editors, Advances in Neural Information Processing Systems, volume 8. MIT Press, 1996.
No context found.
Gordon, G. J. 1996. Stable fitted reinforcement learning. In Touretzky, D. S.; Mozer, M. C.; and Hasselmo, M. E., eds., Advances in Neural Information Processing Systems, volume 8, 1052--1058. The MIT Press.
No context found.
G. J. Gordon. Stable fitted reinforcement learning. In Advances in Neural Information Processing, pages 1052--1058. MIT Press, 1996.
No context found.
. Stable Function Approximation in Dynamic Programming. In Proceedings of the Twelfth International Conference of Machine Learning, pp.
No context found.
Gordon, G. (1996). "Stable fitted reinforcement learning". In G. Tesauro, M. Mozer, and M. Hasselmo (eds.), Advances in Neural Information Processing Systems 8, pp. 10521058.
No context found.
Gordon, G. (1996). "Stable fitted reinforcement learning". In G. Tesauro, M. Mozer, and M.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC