Asynchronous Stochastic Approximation and Q-Learning. (1994)

by J N Tsitsiklis
Venue:Machine Learning.