Learning and Sequential Decision Making," (1989)

by A G Barto, R S Sutton, C J C H Watkins