An on-line algorithm for dynamic reinforcement learning and planning in reactive environments. (1990)

by J Schmidhuber
Venue:In International Joint Conference on Neural Networks. IEEE,