DMCA

Derivatives of Logarithmic Stationary Distributions for Policy Gradient Reinforcement Learning

by Tetsuro Morimura , Eiji Uchibe , Junichiro Yoshimoto , Jan Peters , Kenji Doya