See this document in CiteSeerX!

Predicting Real-Time Planner Preformance By Domain Charactorization (1994)  (Make Corrections)  
Jak Kirman



  Home/Search   Context   Related

 
View or download:
brown.edu/pub/techreport...cs9444.ps.Z
brown.edu/pub/techreport...cs9444.ps.Z
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  brown.edu (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: ion : : : : : : : : : : : : : : : : : : : : : : : : : : 21 2.3.7 Learning : : : : : : : : : : : : : : : : : : : : : : : : : : : : 21 2.3.8 Policy improvement : : : : : : : : : : : : : : : : : : : : : : 22 3 Plexus 24 3.1 Overview : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 24 3.2 Example domain : : : : : : : : : : : : : : : : : : : : : : : : : : : 25 3.3 Definitions : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : : 26 3.3.1 Envelope : : : : : : : : : : : : : :... (Update)

Active bibliography (related documents):   More   All
0.9:   PEST overview - Kirman (1995)   (Correct)
0.7:   Exploiting Structure for Planning and Control - Lin (1997)   (Correct)
0.4:   Decision-Theoretic Planning and Markov Decision Processes - Dean (1994)   (Correct)

Similar documents based on text:   More   All
0.2:   JTS: Tools for Implementing Domain-Specific Languages - Batory, Lofaso, Smaragdakis   (Correct)
0.2:   Using Goals to Find Plans with High Expected Utility - Kirman, Nicholson, Lejter.. (1993)   (Correct)
0.2:   Planning Under Time Constraints in Stochastic Domains - Dean, Kaelbling, Kirman.. (1995)   (Correct)

BibTeX entry:   (Update)

@techreport{ kirman94predicting,
    author = "Jak Kirman",
    title = "Predicting Real-Time Planner Preformance by Domain Characterization",
    number = "CS-94-44",
    year = "1994",
    url = "citeseer.ist.psu.edu/kirman94predicting.html" }
Citations (may not include all citations):
1364   A Robust Layered Control System for a Mobile Robot (context) - Brooks - 1985
658   Learning from Delayed Rewards (context) - Watkins - 1989
402   An analysis of time-dependent planning (context) - Dean, Boddy - 1988
365   Pengi: An implementation of a theory of activity (context) - Agre, Chapman - 1987
276   Finite Markov Chains (context) - Kemeny, Snell - 1960
276   Finite Markov Chains (context) - Kemeny, Snell - 1976
268   Dynamic Programming and Markov Processes (context) - Howard - 1966
257   Learning to act using real-time dynamic programming - Barto, Bradtke et al. - 1993
246   Markov Decision Processes (context) - Puterman - 1994
213   Universal plans for reactive robots in unpredictable environ.. - Schoppers - 1987
148   Acting optimally in partially observable stochastic domains - Cassandra, Kaelbling et al. - 1994
148   A First Course in Stochastic Processes (context) - Karlin, Taylor - 1974
146   An algorithm for probabilistic planning - Kushmerick, Hanks et al. - 1994
144   Econometric analysis (context) - Greene - 1993
140   Probabilistic planning with information gathering and contin.. - Draper, Hanks et al. - 1994
133   and reacting based on approximating dynamic programming (context) - Sutton, for et al. - 1990
120   Planning with deadlines in stochastic domains - Dean, Kaelbling et al. - 1993
101   Applied Probability Models with Optimization Applications (context) - Ross - 1970
90   Planning under time constraints in stochastic domains - Dean, Kaelbling et al. - 1994
87   Anytime synthetic projection: Maximizing the probability of .. - Drummond, Bresina - 1990
84   Princeton University Press (context) - Bellman - 1957
81   Circa: A cooperative intelligent real-time control architect.. - Musliner, Durfee et al. - 1993
72   Using abstractions for decision theoretic planning with time.. - Boutilier, Dearden - 1994
65   Stochastic Models in Operations Research (context) - Heyman, Sobel - 1982
56   Conductance and the rapid mixing property for markov chains:.. (context) - Jerrum, Sinclair - 1988
53   Tight performance bounds on greedy policies based on imperfe.. - Williams, Baird - 1993
52   A reinforcement learning method for maximizing undiscounted .. (context) - Schwartz - 1993
51   World modeling for the dynamic construction of real-time con.. - Musliner, Durfee et al. - 1993
50   Efficient learning and planning within the dyna framework - Peng, Williams - 1993
48   Dynamic Programming and Stochastic Control (context) - Bertsekas - 1976
47   Search reduction in hierarchical problem solving - Knoblock - 1991
45   Conductance and convergence of markov chains --- a combinato.. (context) - Mihail - 1989
29   Scientific American (context) - Gardner - 1973
23   Real-time heuristic search: First results (context) - Korf - 1987
22   Adaptive aggregation for infinite horizon dynamic programmin.. (context) - Bertsekas, Castanon - 1989
19   A model of reaction for planning in dynamic environments (context) - Sanborn - 1988
17   Memory-based reinforcement learning: Efficient computation w.. (context) - Moore, Atkeson - 1993
16   Consideration of risk in reinforcement learning - Heger - 1994
15   Toward approximate planning in very large stochastic domains - Nicholson, Kaelbling - 1994
13   Deliberation scheduling for time-critical sequential decisio.. - Dean, Kaelbling et al. - 1993
13   Exploiting locality in temporal reasoning (context) - Lin, Dean - 1994
12   Sensor abstractions for control of navigation (context) - Kirman, Basye et al. - 1991
11   Mathematical programming and the control of markov chains (context) - Kushner, Kleinman - 1971
8   Sequential decision making for active perception (context) - Dean, Camus et al. - 1990
7   Recovering from execution errors in sipe (context) - Wilkins - 1985
5   Algorithms for partially observable markov decision processe.. (context) - Cassandra, Kaelbling et al. - 1994
5   A framework for map construction (context) - Basye - 1992
4   A Diary on Information Theory (context) - R'enyi - 1984
3   Lebesgue Integration and Measure (context) - Weir - 1973
3   Economic Forecasts and Policy (context) - Theil - 1961
3   Plus Programmer's manual (context) - Sciences - 1993
2   Active sensor control for mobile robotics (context) - Leonard, Durrant-Whyte - 1989
1   Finite State Markovian Decisian Processes (context) - Derman - 1970
1   Model-free reinforcement learning for non-markovian decision.. (context) - Satinder, Singh et al. - 1994
1   Computationally efficient algorithms for on-line optimizatio.. (context) - Jalali, Ferguson - 1992

Documents on the same site (http://www.math.jussieu.fr/~fermigie/fermivista/ftp/ftp.cs.brown.edu.html):   More
Indexing for Data Models with Constraints and Classes - Kanellakis, Ramaswamy.. (1993)   (Correct)
Reinforcement Learning for Planning and Control - Dean, Basye, Shewchuk (1993)   (Correct)
An Interactive 3D Toolkit for Constructing 3D Widgets - Zeleznik, Herndon..   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC