STD(λ): learning state differences with TD(λ)

Cached

Download Links

by Lex Weaver , Jonathan Baxter