Policy gradient methods for robotics (2006)

by J Peters, S Schaal
Venue:in Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS