Bandit based Monte-Carlo planning (2006)

by L Kocsis, C Szepesvari
Venue:in Proceedings of European Conference on Machine Learning