An extended policy gradient algorithm for robot task learning (2007)

by A Cherubini, F Giannone, L Iocchi, P F Palamara
Venue:in: Proc. of IEEE/RSJ International Conference on Intelligent Robots and System