DMCA

Online regret bounds for Markov decision processes with deterministic transitions (2008)

by Ronald Ortner
Venue:Proc. of the 19th International Conference on Algorithmic Learning Theory (ALT 2008), volume 5254 of Lecture Notes in Computer Science
Citations:10 - 1 self