Using Policy Gradient Reinforcement Learning on Automous Robot Controllers (2000)

by Gregory Z Grudic, Vijay Kumar
Venue:IROS03, Las Vagas, US, October, 2003 [11] Richard S. Sutton etc, ”Policy Gradient Methods for Reinforcement Learning with Function Approximation”, Advances in Neural Information Processing System