| H. Geffner and B. Bonet, "Solving large POMDPs using real time dynamic programming," in Working Notes Fall AAAI Symposium on POMDPs, pp. 61--68, 1998. |
.... the values of the leaves as 0 (due to discounting) An alternative is to use an admissible heuristic (lower bound) for the value of the leaves (e.g. V MDP (b) and to update the values of the intermediate nodes as in Real Time Dynamic Programming [BBS95] This is the approach taken in [GB98], who discretizes b, so V can be represented as a table. Unfortunately, receeding horizon control only works for discrete (or discretized) actions (except for special cases, like linear quadratic Gaussian) 4 Direct policy search The most widely used approach to solving MDPs is an indirect one, ....
H. Gener and B. Bonet. Solving large POMDPs using real time dynamic programming. In Fall AAAI Symp. on POMDPs, 1998.
.... (MDPs) and Partially Observable MDPs (POMDPs) borrows from the work in Dynamic Programming [51, 6] Finally, the use heuristic search algorithms has long tradition in AI (e.g. 46] even though the use of such algorithms for solving Strips planning problems, MDPs, and POMDPs is more recent [3, 13, 11, 32]. In this paper we provide a coherent integration of these various elements and argue that the result provides a natural framework for modeling and solving planning problems under a variety of conditions, including deterministic and probabilistic actions, and complete, partial or null sensing. We ....
.... S, is infinite and continuous (unlike the finite state mdps we have considered so far) The known optimal pomdp algorithms can thus solve very small problems only (e.g. see [17] Non optimal methods, on the other hand, scale up better and often produce reasonable results for non trivial problems [33, 31, 59, 11]. Here we present an algorithm that is the adaptation of rtdp for solving belief mdps. We call such algorithm rtdp bel and show it in Fig. 6. Details on rtdp bel can be found in [11, 9, 10] where the algorithm is used to solve a variety of problems: planning problems with sensing, robot ....
[Article contains additional citation context not shown here]
B. Bonet and H. Geffner. Solving large POMDPs using real time dynamic programming. In Proc. AAAI Fall Symp. on POMDPs, 1998. 24
No context found.
Bonet, B., and Geffner, H. 1998b. Solving large POMDPs using real time dynamic programming. In Proc. AAAI Fall Symp. on POMDPs.
No context found.
Geffner, H., and Bonet, B. 1998b. Solving large POMDPs using real time dynamic programming.
No context found.
H. Geffner and B. Bonet, "Solving large POMDPs using real time dynamic programming," in Working Notes Fall AAAI Symposium on POMDPs, pp. 61--68, 1998.
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC