MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Planning and control in stochastic domains with imperfect information (1997) [35 citations — 6 self]

Download:
pdf | ps
by Milos Hauskrecht, Milos Hauskrecht
http://medg.lcs.mit.edu/people/milos/thesis/./report.ps.gz
Add To MetaCart

Abstract:

Information by

Citations

4388 Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference – Pearl - 1988
2141 Learning Internal Representations by Error Propagation – Rumelhart, Hinton, et al. - 1986
1877 Artificial Intelligence: A Modern Approach – Russell, Norvig - 2002
1399 Dynamic Programming – Bellman - 1957
939 Learning from Delayed Rewards – Watkins - 1989
887 Reinforcement learning: A survey – Kaelbling, Littman, et al. - 1996
583 An Introduction to Bayesian Networks – Jensen - 1996
548 Markov decision processes : discrete stochastic dynamic programming – Puterman - 1994
545 An introduction to hidden markov models – Rabiner, Juang - 1986
539 Graphical models – Lauritzen - 1996
377 Neuronlike adaptive elements that can solve difficult learning control problems – Barto, Sutton, et al. - 1983
375 Integrated Architectures for Learning, Planning and Reacting based on Approximate Dynamic Programming. Appeared – Sutton - 1990
353 Dynamic Programming and Markov Processes – Howard - 1960
292 Neuro-Dynamic Programming – Bertsekas, Tsitsiklis - 1996
291 Planning and Control – Dean, Wellman - 1991
235 uence diagrams – Howard, Matheson - 1981
221 The optimal control of Partially Observable Markov Processe – Sondik - 1971
212 Introduction to applied mathematics – Strang - 1986
210 Acting optimally in partially observable stochastic domains – Cassandra, Kaelbling, et al. - 1994
189 The complexity of Markov decision processes – Papadimitriou, Tsitsiklis - 1987
187 Exploiting structure in policy construction – Boutilier, Dearden, et al. - 1995
182 Operations for learning with graphical models – Buntine - 1994
182 Acting under uncertainty: Discrete Bayesian models for mobile-robot navigation – Cassandra, Kaelbling, et al. - 1996
175 Learning Policies for Partially Observable Environments: Scaling Up – Littman, Cassandra, et al. - 1995
164 Bayesian analysis in expert systems – Spiegelhalter, Dawid, et al. - 1993
155 Reinforcement learning with perceptual aliasing: The Perceptual Distinctions Approach – Chrisman - 1992
155 The EM algorithm for graphical association models with missing data – Lauritzen - 1995
147 Learning to predict by the methods of temporal di erences – Sutton - 1988
143 A survey of algorithmic methods for partially observed Markov decision processes”, Annals of Operations Research – Lovejoy - 1991
142 Adaptive Control Processes – Bellman - 1961
137 Residual algorithms: Reinforcement learning with function approximation – Baird - 1995
135 Principles of metareasoning – Russell, Wefald - 1991
134 Stochastic simulation algorithms for dynamic probabilistic networks – Kanazawa, Koller, et al. - 1995
133 Planning with deadlines in stochastic domains – Dean, Kaelbling, et al. - 1993
132 A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms – Monahan - 1982
131 Algorithms for Sequential Decision Making – Littman - 1996
128 Dynamic Programming and Optimal Control – Bertsekas - 1995
125 Stable function approximation in dynamic programming – Gordon - 1995
125 Simulation approaches to general probabilistic inference on belief networks – Shachter, Peot - 1989
118 Approximating optimal policies for partially observable stochastic domains. (unpublished manuscript – Parr, Russell - 1995
113 Expert Systems and Probabilistic Network Models. Springer-Verlag – Castillo, Hadi - 1996
109 Real-time learning and control using asynchronous dynamic programming – Barto, Bradtke, et al. - 1991
94 Decomposition techniques for planning in stochastic domains – Dean, Lin - 1995
94 Overcoming incomplete perception with utile distinction memory – McCallum
93 Learning without stateestimation in partially observable Markovian decision problems – Singh, Jaakkola, et al. - 1994
84 A theory of cerebellar function – Albus - 1971
84 Model minimization in Markov decision processes – Dean, Givan - 1997
84 On the Complexity of Solving Markov Decision Problems – Littman, Dean, et al. - 1995
84 Feature-based methods for large scale dynamic programming – Tsitsiklis, Roy - 1996
82 An algorithm for probabilistic least-commitment planning – Kushmerick, Hanks, et al. - 1994