Hiroshi Ishiguro, and Norihiko Hagita. Robot behavior adaptation for human-robot interaction based on policy gradient reinforcement learning (2005)

by Noriaki Mitsunaga, Christian Smith, Takayuki Kanda
Venue:In IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS2005