|
4388
|
Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference
– Pearl
- 1988
|
|
2141
|
Learning Internal Representations by Error Propagation
– Rumelhart, Hinton, et al.
- 1986
|
|
1877
|
Artificial Intelligence: A Modern Approach
– Russell, Norvig
- 2002
|
|
1399
|
Dynamic Programming
– Bellman
- 1957
|
|
939
|
Learning from Delayed Rewards
– Watkins
- 1989
|
|
887
|
Reinforcement learning: A survey
– Kaelbling, Littman, et al.
- 1996
|
|
583
|
An Introduction to Bayesian Networks
– Jensen
- 1996
|
|
548
|
Markov decision processes : discrete stochastic dynamic programming
– Puterman
- 1994
|
|
545
|
An introduction to hidden markov models
– Rabiner, Juang
- 1986
|
|
539
|
Graphical models
– Lauritzen
- 1996
|
|
377
|
Neuronlike adaptive elements that can solve difficult learning control problems
– Barto, Sutton, et al.
- 1983
|
|
375
|
Integrated Architectures for Learning, Planning and Reacting based on Approximate Dynamic Programming. Appeared
– Sutton
- 1990
|
|
353
|
Dynamic Programming and Markov Processes
– Howard
- 1960
|
|
292
|
Neuro-Dynamic Programming
– Bertsekas, Tsitsiklis
- 1996
|
|
291
|
Planning and Control
– Dean, Wellman
- 1991
|
|
235
|
uence diagrams
– Howard, Matheson
- 1981
|
|
221
|
The optimal control of Partially Observable Markov Processe
– Sondik
- 1971
|
|
212
|
Introduction to applied mathematics
– Strang
- 1986
|
|
210
|
Acting optimally in partially observable stochastic domains
– Cassandra, Kaelbling, et al.
- 1994
|
|
189
|
The complexity of Markov decision processes
– Papadimitriou, Tsitsiklis
- 1987
|
|
187
|
Exploiting structure in policy construction
– Boutilier, Dearden, et al.
- 1995
|
|
182
|
Operations for learning with graphical models
– Buntine
- 1994
|
|
182
|
Acting under uncertainty: Discrete Bayesian models for mobile-robot navigation
– Cassandra, Kaelbling, et al.
- 1996
|
|
175
|
Learning Policies for Partially Observable Environments: Scaling Up
– Littman, Cassandra, et al.
- 1995
|
|
164
|
Bayesian analysis in expert systems
– Spiegelhalter, Dawid, et al.
- 1993
|
|
155
|
Reinforcement learning with perceptual aliasing: The Perceptual Distinctions Approach
– Chrisman
- 1992
|
|
155
|
The EM algorithm for graphical association models with missing data
– Lauritzen
- 1995
|
|
147
|
Learning to predict by the methods of temporal di erences
– Sutton
- 1988
|
|
143
|
A survey of algorithmic methods for partially observed Markov decision processes”, Annals of Operations Research
– Lovejoy
- 1991
|
|
142
|
Adaptive Control Processes
– Bellman
- 1961
|
|
137
|
Residual algorithms: Reinforcement learning with function approximation
– Baird
- 1995
|
|
135
|
Principles of metareasoning
– Russell, Wefald
- 1991
|
|
134
|
Stochastic simulation algorithms for dynamic probabilistic networks
– Kanazawa, Koller, et al.
- 1995
|
|
133
|
Planning with deadlines in stochastic domains
– Dean, Kaelbling, et al.
- 1993
|
|
132
|
A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms
– Monahan
- 1982
|
|
131
|
Algorithms for Sequential Decision Making
– Littman
- 1996
|
|
128
|
Dynamic Programming and Optimal Control
– Bertsekas
- 1995
|
|
125
|
Stable function approximation in dynamic programming
– Gordon
- 1995
|
|
125
|
Simulation approaches to general probabilistic inference on belief networks
– Shachter, Peot
- 1989
|
|
118
|
Approximating optimal policies for partially observable stochastic domains. (unpublished manuscript
– Parr, Russell
- 1995
|
|
113
|
Expert Systems and Probabilistic Network Models. Springer-Verlag
– Castillo, Hadi
- 1996
|
|
109
|
Real-time learning and control using asynchronous dynamic programming
– Barto, Bradtke, et al.
- 1991
|
|
94
|
Decomposition techniques for planning in stochastic domains
– Dean, Lin
- 1995
|
|
94
|
Overcoming incomplete perception with utile distinction memory
– McCallum
|
|
93
|
Learning without stateestimation in partially observable Markovian decision problems
– Singh, Jaakkola, et al.
- 1994
|
|
84
|
A theory of cerebellar function
– Albus
- 1971
|
|
84
|
Model minimization in Markov decision processes
– Dean, Givan
- 1997
|
|
84
|
On the Complexity of Solving Markov Decision Problems
– Littman, Dean, et al.
- 1995
|
|
84
|
Feature-based methods for large scale dynamic programming
– Tsitsiklis, Roy
- 1996
|
|
82
|
An algorithm for probabilistic least-commitment planning
– Kushmerick, Hanks, et al.
- 1994
|