Deviations of stochastic bandit regret. (2011)

by Antoine Salomon, Jean-Yves Audibert
Venue:In Algorithmic Learning Theory,