lil’UCB: An Optimal Exploration Algorithm for Multi-Armed Bandits. (2014)

by Kevin Jamieson, Matthew Malloy, Robert Nowak, Sebastien Bubeck
Venue:In Conference on Learning Theory,