Combining manual feedback with subsequent MDP reward signals for reinforcement learning (0)

by W B Knox, P Stone
Venue:in: Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems