Hybrid least-squares algorithms for approximate policy evaluation (2009)

by J Johns, M Petrik, S Mahadevan
Venue:Machine Learning