Least-squares policy iteration (2003)

by M Lagoudakis, R Parr