Regret bounds for reinforcement learning with policy advice. (2013)

by Mohammad Gheshlaghi Azar, Alessandro Lazaric, Emma Brunskill
Venue:In Machine Learning and Knowledge Discovery in Databases,