Hiroshi Ishiguro, and Norihiko Hagita. Adaptation of an interactive robot’s behavior using policy gradient reinforcement learning (2005)

by Christian Smith, Noriaki Mitsunaga, Takayuki Kanda
Venue:In Proceedings of the tenth Robotics Symposia