Find:
Searching for
PHRASE
sutton rich
.
Restrict to:
Header
Title
Order by:
Expected citations
Hubs
Usage
Date
Try:
Google (CiteSeer)
Google (Web)
Yahoo!
MSN
CSB
DBLP
39 documents found.
Order: number of citations.
Learning to Predict by the Methods of Temporal Differences - Sutton (1988)
(Correct)
(543 citations)
the Methods of Temporal Differences RICHARD S.
SUTTON (RICH
@GTE.COM) GTE Laboratories Incorporated, 40
ftp.cs.umass.edu/pub/anw/pub/sutton/sutton-88.ps.gz
Learning to Act using Real-Time Dynamic Programming - Barto, Bradtke, Singh (1993)
(Correct)
(230 citations)
between heuristic search and control. We thank
Rich Sutton
, Chris Watkins, Paul Werbos, and Ron Williams
numerous discussions, and we further thank
Rich Sutton
for first making us aware of Korf's research
ftp.cis.ohio-state.edu/pub/neuroprose/barto.realtime-dp.ps.Z
Reinforcement Learning with Replacing Eligibility Traces - Singh (1996)
(Correct)
(68 citations)
Technology, Cambridge, Mass. 02139 RICHARD S.
SUTTON rich
@cs.umass.edu Dept. of Computer Science
www.cs.jhu.edu/~sheppard/cs.605.754/papers/paper11b.ps.gz
Algorithms for Sequential Decision Making - Littman (1996)
(Correct)
(62 citations)
and an impressive role model. I believe that
Rich Sutton
, more than anyone, makes the field of
ftp.cs.brown.edu/pub/techreports/96/cs96-09.ps.Z
Learning and Problem Solving with Multilayer Connectionist Systems - Anderson (1986)
(Correct)
(48 citations)
Through the years of working with Andy Barto and
Rich Sutton
, I have observed many instances of "flu#
is based on graphics tools provided by
Rich Sutton
and Andy Cromarty. Most importantly, I thank
www.cs.colostate.edu/~anderson/res/rl/chuck-diss.pdf
Learning To Solve Markovian Decision Processes - Singh (1994)
(Correct)
(26 citations)
he gave me to pursue my own interests. Thanks to
Rich Sutton
for many inspiring conversations, for his
ftp.cs.colorado.edu/users/baveja/Papers/Thesis.ps.gz
Problem Solving With Reinforcement Learning - Rummery (1995)
(Correct)
(24 citations)
been Chris Watkins and Tim Jervis. I also owe
Rich Sutton
an apology for continuing to use the name
these bounds is an open question. 3Though
Rich Sutton
suggests SARSA, as you need to knov
svr-www.eng.cam.ac.uk/reports/svr-ftp/rummery_thesis.ps.Z
Reinforcement Learning And Its Application To Control - Gullapalli (1992)
(Correct)
(22 citations)
interactions with Chuck Anderson, Judy Franklin,
Rich Sutton
, and others at GTE Labs. Kamal Souccar's
ftp.cs.umass.edu/pub/techrept/techreport/1992/UM-CS-1992-010.ps
Hierarchical Learning with Procedural Abstraction Mechanisms - Rosca (1997)
(Correct)
(21 citations)
cats while having me for long GP conversations
Rich Sutton
for long discussions and patience to listen
ftp.cs.rochester.edu:21/pub/u/rosca/gp/jrphdd.ps.gz
Symbiotic Evolution of Neural Networks in Sequential Decision Tasks - Moriarty (1997)
(Correct)
(20 citations)
Mitch Potter, Alan Schultz, Jude Shavlik, and
Rich Sutton
. And finally, I must give thanks to the people
ftp.cs.utexas.edu/pub/neural-nets/papers/moriarty.diss.tr257.ps.Z
Analysis of Some Incremental Variants of Policy Iteration: .. - Williams, Baird, III (1993)
(Correct)
(19 citations)
impact on this work. Special thanks also to
Rich Sutton
, who has influenced our thinking on this
www.cs.cmu.edu/afs/cs.cmu.edu/user/leemon/www/papers/actorc/actorc.ps
Scaling Reinforcement Learning toward RoboCup Soccer - Stone (2001)
(Correct)
(14 citations)
Austin, Texas 78712-1188 U.S.A. Richard S.
Sutton rich
@
richsutton
.com Stow Research Abstract
Texas 78712-1188 U.S.A. Richard S.
Sutton rich
@
richsutton
.com Stow Research Abstract RoboCup simulated
www.cs.utexas.edu/users/pstone/Papers/2002jair/keepaway.ps.gz
Large-Scale Dynamic Optimization Using Teams of Reinforcement.. - Crites (1996)
(Correct)
(9 citations)
John McNulty, Satinder Singh, Rich
Sutton, and Rich
ard Yee. Christos Cassandras kindly provided
Guzman-Lara, John McNulty, Satinder Singh, Rich
Sutton, and Rich
ard Yee. Christos Cassandras kindly
ftp.cs.umass.edu/pub/anw/pub/crites/root.ps.Z
Reinforcement Learning by Policy Search - Peshkin (2001)
(Correct)
(7 citations)
Ehud Shapiro, Christian Shelton, Bill Smart,
Rich Sutton
, Mike Szydlo, John Tsitsiklis, Shimon Ullman,
www.ai.mit.edu/~pesha/Public/aitr.ps
Learning From Instruction And Experience: Methods For.. - Maclin (1995)
(Correct)
(6 citations)
Jim Stewart, Nick Street, Michael Streibel,
Rich Sutton
, Scott Swanson, Sebastian Thrun, Geoff Towell,
ftp.cs.wisc.edu/machine-learning/shavlik-group/maclin.thesis.firsthalf.ps.Z
A Mathematical Analysis Of Actor-Critic Architectures For.. - Williams (1990)
(Correct)
(5 citations)
have helped us greatly. Special thanks also to
Rich Sutton
, who has influenced our thinking on this
www.cs.cmu.edu/afs/cs.cmu.edu/user/leemon/www/papers/yale90/yale90.ps
Model-Based Reinforcement Learning with an Approximate.. - Leonid Kuvayev Rich (1996)
(Correct)
(2 citations)
an Approximate, Learned Model Leonid Kuvayev
Rich Sutton
Department of Computer Science University of
ftp.cs.umass.edu/pub/anw/pub/kuvayev/kuvayev-sutton-ml97.ps
Between MDPs and Semi-MDPs: Learning, Planning, and.. - Sutton, Precup, Singh (1998)
(Correct)
(2 citations)
Knowledge at Multiple Temporal Scales Richard S.
Sutton rich
@cs.umass.edu Doina Precup
ftp.cs.umass.edu/pub/anw/pub/sutton/SPS-98.ps.gz
First 20 documents
Next 20
Try your query at:
Google (CiteSeer)
Google (Web)
Yahoo!
MSN
CSB
DBLP
CiteSeer.IST - Copyright
Penn State
and
NEC