Dynamic potential-based reward shaping (2012)

by S Devlin, D Kudenko
Venue:In Proceedings of The Eleventh Annual International Conference on Autonomous Agents and Multiagent Systems