Near-optimal reinforcement learning in polynomial time. (1999)

by M J Kearns, S Singh
Venue:Machine Learning,