Approximate dynamic programming for two-player zero-sum markov games (2015)

by Julien Perolat, Bilal Piot, Bruno Scherrer, Olivier Pietquin
Venue:In Proc. of ICML