Error bounds for approximate policy iteration (2003)

by R Munos
Venue:In International Conference on Machine Learning