Nonparametric bayesian policy priors for reinforcement learning (2010)

by F Doshi-Velez
Venue:In Advances in Neural Information Processing Systems 23