Fast reinforcement learning of dialogue policies using stable function approximation (2005)

by Matthias Denecke, Kohji Dohsaka, Mikio Nakano
Venue:In