Action elimination and stopping conditions for the multi-armed bandit and reinforcement learning problems (0)

by E Even-Dar, S Mannor, Y Mansour
Venue:The Journal of Machine Learning Research