Near-optimal reinforcement learning in polynomial time (1998)

by Michael Kearns
Venue:Machine Learning