A natural policy gradient (0)

by S Kakade
Venue:Advances in Neural Information Processing Systems