Linear least-squares algorithms for temporal difference learning (1996)
by
Steven J. Bradtke
,
Andrew G. Barto
,
Pack Kaelbling
| Venue: | Machine Learning |
| Citations: | 139 - 0 self |







