Kaelbling. Learning policies for partially observable environments: Scaling up (1995)

by M L Littman, A R Cassandra, L Pack
Venue:In International Conference on Machine Learning