• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 1 of 1

Table 1: A summary of the model-based algorithms described in this paper. The ! column contains d if the internal state update is deterministic and s if the update is stochastic. Similarly, the column indicates if the choice of action is deterministic of stochastic. Uppercase D indicates that the function is xed instead of learnt. The last column describes how parameterises !(hj ; g; y) and parameterises (uj ; h; y).

in A (Revised) Survey of Approximate Methods for Solving Partially Observable Markov Decision Processes
by Douglas Aberdeen 2003
"... In PAGE 29: ...cales [Precup, 2000]. Ghavamzadeh and Mahadevan [2001] and Makar et al. [2001] extend MAXQ to continuous time and demonstrate the algorithm in a multi-agent automated guided vehicle setting. 9 Summary Table1 summarises the model-based algorithms described in this paper. Table 2 correspondingly summarises the model-free algorithms.... In PAGE 31: ...column has the same meaning as Table1 . The tables are not a complete sum- mary of all POMDP algorithms.... ..."
Cited by 22
Results 1 - 1 of 1
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University