Residual algorithms: Reinforcement learning with function approximation (1995)

by L ˜C Baird
Venue:In Proceedings of the Twelfth International Conference on Machine Learning (ICML