Policy search by dynamic programming. (2003)

by J Andrew Bagnell, Sham M Kakade, Jeff G Schneider, Andrew Y Ng
Venue:In Neural Information Processing Systems (NIPS),