Hierarchical Control and Learning for Markov Decision Processes (1998)
| Citations: | 98 - 2 self |
BibTeX
@MISC{Parr98hierarchicalcontrol,
author = {Ronald Edward Parr},
title = {Hierarchical Control and Learning for Markov Decision Processes},
year = {1998}
}
Years of Citing Articles
OpenURL
Abstract
This dissertation investigates the use of hierarchy and problem decomposition as a means of solving large, stochastic, sequential decision problems. These problems are framed as Markov decision problems (MDPs). The new technical content of this dissertation begins with a discussion of the concept of temporal abstraction. Temporal abstraction is shown to be equivalent to the transformation of a policy defined over a region of an MDP to an action in a semi-Markov decision problem (SMDP). Several algorithms are presented for performing this transformation efficiently. This dissertation introduces the HAM method for generating hierarchical, temporally abstract actions. This method permits the partial specification of abstract actions in a way that corresponds to an abstract plan or strategy. Abstr...







