Safe policy iteration (2013)

by Matteo Pirotta, Marcello Restelli, Alessio Pecorino, Daniele Calandriello
Venue:In Proceedings of The 30th International Conference on Machine Learning