Trust region policy optimization. (2015)

by John Schulman, Sergey Levine, Pieter Abbeel, Michael I Jordan, Philipp Moritz
Venue:In Proceedings of the 32nd International Conference on Machine Learning, ICML 2015,