Bias correction and confidence intervals for fitted Q-iteration (2008)

by B Chakraborty, V Strecher, S Murphy
Venue:In Workshop on model uncertainty and risk in reinforcement learning (NIPS