DMCA

An expectation maximization algorithm for continuous markov decision processes with arbitrary rewards (2009)

by Matt Hoffman , Nando de Freitas , Arnaud Doucet , Jan Peters
Venue:IN TWELFTH INT. CONF. ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS
Citations:13 - 2 self