Hierarchical reinforcement learning with the MAXQ value function decomposition. (2000)

by T G Dietterich
Venue:Journal of Artificial Intelligence Research,