Finite-time analysis of the multiarmed bandit problem (2002)

by Peter Auer, Nicolò Cesa-Bianchi, Paul Fischer