Find:
Searching for PHRASE sutton rich.
Restrict to:   Header   Title   Order by:   Expected citations   Hubs   Usage   Date   Try:   Google (CiteSeer)   Google (Web)   Yahoo!   MSN   CSB   DBLP
39 documents found. Order: number of citations.

Learning to Predict by the Methods of Temporal Differences - Sutton (1988)   (Correct)   (543 citations)
the Methods of Temporal Differences RICHARD S. SUTTON (RICH@GTE.COM) GTE Laboratories Incorporated, 40
ftp.cs.umass.edu/pub/anw/pub/sutton/sutton-88.ps.gz

Learning to Act using Real-Time Dynamic Programming - Barto, Bradtke, Singh (1993)   (Correct)   (230 citations)
between heuristic search and control. We thank Rich Sutton, Chris Watkins, Paul Werbos, and Ron Williams
numerous discussions, and we further thank Rich Sutton for first making us aware of Korf's research

ftp.cis.ohio-state.edu/pub/neuroprose/barto.realtime-dp.ps.Z

Reinforcement Learning with Replacing Eligibility Traces - Singh (1996)   (Correct)   (68 citations)
Technology, Cambridge, Mass. 02139 RICHARD S. SUTTON rich@cs.umass.edu Dept. of Computer Science
www.cs.jhu.edu/~sheppard/cs.605.754/papers/paper11b.ps.gz

Algorithms for Sequential Decision Making - Littman (1996)   (Correct)   (62 citations)
and an impressive role model. I believe that Rich Sutton, more than anyone, makes the field of
ftp.cs.brown.edu/pub/techreports/96/cs96-09.ps.Z

Learning and Problem Solving with Multilayer Connectionist Systems - Anderson (1986)   (Correct)   (48 citations)
Through the years of working with Andy Barto and Rich Sutton, I have observed many instances of "flu#
is based on graphics tools provided by Rich Sutton and Andy Cromarty. Most importantly, I thank

www.cs.colostate.edu/~anderson/res/rl/chuck-diss.pdf

Learning To Solve Markovian Decision Processes - Singh (1994)   (Correct)   (26 citations)
he gave me to pursue my own interests. Thanks to Rich Sutton for many inspiring conversations, for his
ftp.cs.colorado.edu/users/baveja/Papers/Thesis.ps.gz

Problem Solving With Reinforcement Learning - Rummery (1995)   (Correct)   (24 citations)
been Chris Watkins and Tim Jervis. I also owe Rich Sutton an apology for continuing to use the name
these bounds is an open question. 3Though Rich Sutton suggests SARSA, as you need to knov

svr-www.eng.cam.ac.uk/reports/svr-ftp/rummery_thesis.ps.Z

Reinforcement Learning And Its Application To Control - Gullapalli (1992)   (Correct)   (22 citations)
interactions with Chuck Anderson, Judy Franklin, Rich Sutton, and others at GTE Labs. Kamal Souccar's
ftp.cs.umass.edu/pub/techrept/techreport/1992/UM-CS-1992-010.ps

Hierarchical Learning with Procedural Abstraction Mechanisms - Rosca (1997)   (Correct)   (21 citations)
cats while having me for long GP conversations Rich Sutton for long discussions and patience to listen
ftp.cs.rochester.edu:21/pub/u/rosca/gp/jrphdd.ps.gz

Symbiotic Evolution of Neural Networks in Sequential Decision Tasks - Moriarty (1997)   (Correct)   (20 citations)
Mitch Potter, Alan Schultz, Jude Shavlik, and Rich Sutton. And finally, I must give thanks to the people
ftp.cs.utexas.edu/pub/neural-nets/papers/moriarty.diss.tr257.ps.Z

Analysis of Some Incremental Variants of Policy Iteration: .. - Williams, Baird, III (1993)   (Correct)   (19 citations)
impact on this work. Special thanks also to Rich Sutton, who has influenced our thinking on this
www.cs.cmu.edu/afs/cs.cmu.edu/user/leemon/www/papers/actorc/actorc.ps

Scaling Reinforcement Learning toward RoboCup Soccer - Stone (2001)   (Correct)   (14 citations)
Austin, Texas 78712-1188 U.S.A. Richard S. Sutton rich@richsutton.com Stow Research Abstract
Texas 78712-1188 U.S.A. Richard S. Sutton rich@richsutton.com Stow Research Abstract RoboCup simulated

www.cs.utexas.edu/users/pstone/Papers/2002jair/keepaway.ps.gz

Large-Scale Dynamic Optimization Using Teams of Reinforcement.. - Crites (1996)   (Correct)   (9 citations)
John McNulty, Satinder Singh, Rich Sutton, and Richard Yee. Christos Cassandras kindly provided
Guzman-Lara, John McNulty, Satinder Singh, Rich Sutton, and Richard Yee. Christos Cassandras kindly

ftp.cs.umass.edu/pub/anw/pub/crites/root.ps.Z

Reinforcement Learning by Policy Search - Peshkin (2001)   (Correct)   (7 citations)
Ehud Shapiro, Christian Shelton, Bill Smart, Rich Sutton, Mike Szydlo, John Tsitsiklis, Shimon Ullman,
www.ai.mit.edu/~pesha/Public/aitr.ps

Learning From Instruction And Experience: Methods For.. - Maclin (1995)   (Correct)   (6 citations)
Jim Stewart, Nick Street, Michael Streibel, Rich Sutton, Scott Swanson, Sebastian Thrun, Geoff Towell,
ftp.cs.wisc.edu/machine-learning/shavlik-group/maclin.thesis.firsthalf.ps.Z

A Mathematical Analysis Of Actor-Critic Architectures For.. - Williams (1990)   (Correct)   (5 citations)
have helped us greatly. Special thanks also to Rich Sutton, who has influenced our thinking on this
www.cs.cmu.edu/afs/cs.cmu.edu/user/leemon/www/papers/yale90/yale90.ps

Model-Based Reinforcement Learning with an Approximate.. - Leonid Kuvayev Rich (1996)   (Correct)   (2 citations)
an Approximate, Learned Model Leonid Kuvayev Rich Sutton Department of Computer Science University of
ftp.cs.umass.edu/pub/anw/pub/kuvayev/kuvayev-sutton-ml97.ps

Between MDPs and Semi-MDPs: Learning, Planning, and.. - Sutton, Precup, Singh (1998)   (Correct)   (2 citations)
Knowledge at Multiple Temporal Scales Richard S. Sutton rich@cs.umass.edu Doina Precup
ftp.cs.umass.edu/pub/anw/pub/sutton/SPS-98.ps.gz

First 20 documents  Next 20

Try your query at:   Google (CiteSeer)   Google (Web)   Yahoo!   MSN   CSB   DBLP

CiteSeer.IST - Copyright Penn State and NEC