Least-Squares Policy Iteration (2003)

by Michail G. Lagoudakis , Ronald Parr , L. Bartlett
Venue:Journal of Machine Learning Research
Citations:214 - 6 self