(Enter summary)
Abstract: ion : : : : : : : : : : : : : : : : : : : : : : : : : : 21
2.3.7 Learning : : : : : : : : : : : : : : : : : : : : : : : : : : : : 21
2.3.8 Policy improvement : : : : : : : : : : : : : : : : : : : : : : 22
3 Plexus 24
3.1 Overview : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 24
3.2 Example domain : : : : : : : : : : : : : : : : : : : : : : : : : : : 25
3.3 Definitions : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 26
3.3.1 Envelope : : : : : : : : : : : : : :... (Update)
Active bibliography (related documents): More All
0.9: PEST overview - Kirman (1995)
(Correct)
0.7: Exploiting Structure for Planning and Control - Lin (1997)
(Correct)
0.4: Decision-Theoretic Planning and Markov Decision Processes - Dean (1994)
(Correct)
Similar documents based on text: More All
0.2: JTS: Tools for Implementing Domain-Specific Languages - Batory, Lofaso, Smaragdakis
(Correct)
0.2: Using Goals to Find Plans with High Expected Utility - Kirman, Nicholson, Lejter.. (1993)
(Correct)
0.2: Planning Under Time Constraints in Stochastic Domains - Dean, Kaelbling, Kirman.. (1995)
(Correct)
BibTeX entry: (Update)
@techreport{ kirman94predicting,
author = "Jak Kirman",
title = "Predicting Real-Time Planner Preformance by Domain Characterization",
number = "CS-94-44",
year = "1994",
url = "citeseer.ist.psu.edu/kirman94predicting.html" }
Citations (may not include all citations):
1364
A Robust Layered Control System for a Mobile Robot (context) - Brooks - 1985
658
Learning from Delayed Rewards (context) - Watkins - 1989
402
An analysis of time-dependent planning (context) - Dean, Boddy - 1988
365
Pengi: An implementation of a theory of activity (context) - Agre, Chapman - 1987
276
Finite Markov Chains (context) - Kemeny, Snell - 1960
276
Finite Markov Chains (context) - Kemeny, Snell - 1976
268
Dynamic Programming and Markov Processes (context) - Howard - 1966
257
Learning to act using real-time dynamic programming
- Barto, Bradtke et al. - 1993
246
Markov Decision Processes (context) - Puterman - 1994
213
Universal plans for reactive robots in unpredictable environ..
- Schoppers - 1987
148
Acting optimally in partially observable stochastic domains
- Cassandra, Kaelbling et al. - 1994
148
A First Course in Stochastic Processes (context) - Karlin, Taylor - 1974
146
An algorithm for probabilistic planning
- Kushmerick, Hanks et al. - 1994
144
Econometric analysis (context) - Greene - 1993
140
Probabilistic planning with information gathering and contin..
- Draper, Hanks et al. - 1994
133
and reacting based on approximating dynamic programming (context) - Sutton, for et al. - 1990
120
Planning with deadlines in stochastic domains
- Dean, Kaelbling et al. - 1993
101
Applied Probability Models with Optimization Applications (context) - Ross - 1970
90
Planning under time constraints in stochastic domains
- Dean, Kaelbling et al. - 1994
87
Anytime synthetic projection: Maximizing the probability of ..
- Drummond, Bresina - 1990
84
Princeton University Press (context) - Bellman - 1957
81
Circa: A cooperative intelligent real-time control architect..
- Musliner, Durfee et al. - 1993
72
Using abstractions for decision theoretic planning with time..
- Boutilier, Dearden - 1994
65
Stochastic Models in Operations Research (context) - Heyman, Sobel - 1982
56
Conductance and the rapid mixing property for markov chains:.. (context) - Jerrum, Sinclair - 1988
53
Tight performance bounds on greedy policies based on imperfe..
- Williams, Baird - 1993
52
A reinforcement learning method for maximizing undiscounted .. (context) - Schwartz - 1993
51
World modeling for the dynamic construction of real-time con..
- Musliner, Durfee et al. - 1993
50
Efficient learning and planning within the dyna framework
- Peng, Williams - 1993
48
Dynamic Programming and Stochastic Control (context) - Bertsekas - 1976
47
Search reduction in hierarchical problem solving
- Knoblock - 1991
45
Conductance and convergence of markov chains --- a combinato.. (context) - Mihail - 1989
29
Scientific American (context) - Gardner - 1973
23
Real-time heuristic search: First results (context) - Korf - 1987
22
Adaptive aggregation for infinite horizon dynamic programmin.. (context) - Bertsekas, Castanon - 1989
19
A model of reaction for planning in dynamic environments (context) - Sanborn - 1988
17
Memory-based reinforcement learning: Efficient computation w.. (context) - Moore, Atkeson - 1993
16
Consideration of risk in reinforcement learning
- Heger - 1994
15
Toward approximate planning in very large stochastic domains
- Nicholson, Kaelbling - 1994
13
Deliberation scheduling for time-critical sequential decisio..
- Dean, Kaelbling et al. - 1993
13
Exploiting locality in temporal reasoning (context) - Lin, Dean - 1994
12
Sensor abstractions for control of navigation (context) - Kirman, Basye et al. - 1991
11
Mathematical programming and the control of markov chains (context) - Kushner, Kleinman - 1971
8
Sequential decision making for active perception (context) - Dean, Camus et al. - 1990
7
Recovering from execution errors in sipe (context) - Wilkins - 1985
5
Algorithms for partially observable markov decision processe.. (context) - Cassandra, Kaelbling et al. - 1994
5
A framework for map construction (context) - Basye - 1992
4
A Diary on Information Theory (context) - R'enyi - 1984
3
Lebesgue Integration and Measure (context) - Weir - 1973
3
Economic Forecasts and Policy (context) - Theil - 1961
3
Plus Programmer's manual (context) - Sciences - 1993
2
Active sensor control for mobile robotics (context) - Leonard, Durrant-Whyte - 1989
1
Finite State Markovian Decisian Processes (context) - Derman - 1970
1
Model-free reinforcement learning for non-markovian decision.. (context) - Satinder, Singh et al. - 1994
1
Computationally efficient algorithms for on-line optimizatio.. (context) - Jalali, Ferguson - 1992
Documents on the same site (http://www.math.jussieu.fr/~fermigie/fermivista/ftp/ftp.cs.brown.edu.html): More
Indexing for Data Models with Constraints and Classes - Kanellakis, Ramaswamy.. (1993)
(Correct)
Reinforcement Learning for Planning and Control - Dean, Basye, Shewchuk (1993)
(Correct)
An Interactive 3D Toolkit for Constructing 3D Widgets - Zeleznik, Herndon..
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC