A reduction of imitation learning and structured prediction to no-regret online learning (2011)

by Stephane Ross, Geoff J Gordon, J Andrew Bagnell
Venue:In AI-Stats