Reinforcement learning for dialog management using least-squares policy iteration and fast feature selection. (2009)

by Lihong Li, Jason D Williams, Suhrid Balakrishnan
Venue:In Proc. of Interspeech,