Results 1 -
1 of
1
Table 1: A summary of the model-based algorithms described in this paper. The ! column contains d if the internal state update is deterministic and s if the update is stochastic. Similarly, the column indicates if the choice of action is deterministic of stochastic. Uppercase D indicates that the function is xed instead of learnt. The last column describes how parameterises !(hj ; g; y) and parameterises (uj ; h; y).
in A (Revised) Survey of Approximate Methods for Solving Partially Observable Markov Decision Processes
2003
"... In PAGE 29: ...cales [Precup, 2000]. Ghavamzadeh and Mahadevan [2001] and Makar et al. [2001] extend MAXQ to continuous time and demonstrate the algorithm in a multi-agent automated guided vehicle setting. 9 Summary Table1 summarises the model-based algorithms described in this paper. Table 2 correspondingly summarises the model-free algorithms.... In PAGE 31: ...column has the same meaning as Table1 . The tables are not a complete sum- mary of all POMDP algorithms.... ..."
Cited by 22