Optimal learning with non-Gaussian rewards (2013)

by Z Ding, I O Ryzhov
Venue:eds, ‘Proceedings of the 2013 Winter Simulation Conference