Stochastic convex optimization with bandit feedback. (2011)

by Alekh Agarwal, Dean P Foster, Daniel J Hsu, Sham M Kakade, Alexander Rakhlin
Venue:In Advances in Neural Information Processing Systems,