Learning to predict by the methods of temporal di®erences (1988)

by R S Sutton
Venue:Machine Learning