Near-optimal reinforcement learning in polynomial time (1998)

by M Kearns, S Singh
Venue:In Proceedings of the Fifteenth International Conference on Machine Learning