Markov decision processes with arbitrary reward processes (2009)

by Jia Yuan Yu, Shie Mannor, Nahum Shimkin
Venue:Mathematics of Operations Research