• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 10

Empirical Comparison of Gradient Descent and Exponentiated Gradient Descent in Supervised and Reinforcement Learning Doina Precup, Rich Sutton September 19,

by Mb Er, Doina Precup, Rich Sutton , 1996
"... This report describes a series of results using the exponentiated gradient descent (EG) method recently proposed by Kivinen and Warmuth. Prior work is extended by comparing speed of learning on a nonstationary problem and on an extension to backpropagation networks. Most significantly, we present an ..."
Abstract - Add to MetaCart
This report describes a series of results using the exponentiated gradient descent (EG) method recently proposed by Kivinen and Warmuth. Prior work is extended by comparing speed of learning on a nonstationary problem and on an extension to backpropagation networks. Most significantly, we present an extension of the EG method to temporal-difference and reinforcement learning. This extension is compared to conventional reinforcement learning methods on two test problems using CMAC function approximators and replace traces. On the larger of the two problems, the average loss was approximately 25% smaller for the EG method. The relative computational complexity and parameter sensitivity of the two methods is also discussed. 1. Introduction This report presents the results of several experiments designed to compare two different methods for learning the weights of neural networks: gradient descent (GD) and exponentiated gradient descent (EG), in the framework of supervised and reinforce...

Multi-time Models for Temporally Abstract Planning

by Doina Precup, R. Sutton - In Advances in Neural Information Processing Systems 10 , 1997
"... Planning Doina Precup, Richard S. Sutton University of Massachusetts Amherst, MA 01003 fdprecupjrichg@cs.umass.edu Abstract Planning and learning at multiple levels of temporal abstraction is a key problem for artificial intelligence. In this paper we summarize an approach to this problem ba ..."
Abstract - Cited by 85 (9 self) - Add to MetaCart
Planning Doina Precup, Richard S. Sutton University of Massachusetts Amherst, MA 01003 fdprecupjrichg@cs.umass.edu Abstract Planning and learning at multiple levels of temporal abstraction is a key problem for artificial intelligence. In this paper we summarize an approach to this problem

Autonomous Discovery Of Temporal Abstractions From Interaction With An Environment

by Elizabeth Amy Mcgovern, Neil E. Berthier, Roderic A. Grupen, J. Eliot, B. Moss, Elizabeth Amy Mcgovern, W. Bruce Croft, Department Chair , 2002
"... This dissertation is dedicated to my parents, Bill and Gaye, who have always loved and believed in me and to my husband, Andy, whose love and support made it possible. ACKNOWLEDGMENTS Andrew Barto has been a great thesis advisor. He has helped me to become a better researcher by shaping my critical ..."
Abstract - Cited by 51 (2 self) - Add to MetaCart
. Doina Precup and Kiri Wagstaff have been wonderful friends and supporters of my re-search. It is very helpful to have such smart women friends in CS. They provided support when I needed it and they pushed me when I needed that. I feel privileged to know Doina both as a mentor and as a friend. I thank

Regularized Reinforcement Learning with Performance Guarantees

by Mahdi Milani Fard , 2014
"... To my wife, parents and supporting friends. ii ACKNOWLEDGEMENTS I would like to thank all members of McGill’s Reasoning and Learning lab who provided me with useful thoughts and ideas throughout my graduate studies. I am particularly thankful to my supervisor, Joelle Pineau, for her relentless help ..."
Abstract - Add to MetaCart
and support over this time. I would also like to thank Doina Precup for her invaluable contributions to my research and studies at McGill University. Special thanks goes to Yuri Grinberg, Amir-massoud Farahmand and Csaba Szepesvári for their contributions to my research and publications. I am also thankful

Compressed Predictive State Representation: An Efficient Moment-Method for Sequence Prediction and Sequential Decision-Making

by William L Hamilton , 2014
"... iDedication This thesis is dedicated to my sister Julianna. iAcknowledgements I would like to thank everyone who helped and encouraged me, who con-structively challenged my ideas or even just took the time to listen to them. I am deeply grateful to my supervisor Joelle Pineau for her continual, con- ..."
Abstract - Add to MetaCart
-structive guidance, to Doina Precup for providing invaluable advice throughout my BSc and MSc, to Mahdi Milani Fard for helping me to get the theory of CPSRs off the ground, and to Borja Balle for showing me a whole new perspec-tive on my own work. Special thanks to lab members Clement Gehring, Ouais Alsharif

DEDICATION

by Arthur Guez, Yuri Grinberg, Gheorge Comanici, Monica Dinculescu, Kamal Al-marhoobi, Dorna Kashef, Guillaume Saulnier, Ryan Faulkner, Fabian Kaelin, Shao-wei Png, Phil Bachman, Pablo Samuel Castro
"... To my family and Andrée-Anne. ii ACKNOWLEDGEMENTS First and foremost, I thank my supervisor, Joelle Pineau, for the invaluable guidance, support, and insightful feedback she has provided me for the last three years, but also for trusting me to work on such a fascinating project. Additionally, I ben ..."
Abstract - Add to MetaCart
to every member of the Reasoning and Learning lab, past and present, that I had the chance to inter-act with—this includes, among others, Doina Precup, Prakash Panangaden, Robert

Extracting semantic information from Wikipedia using human computation and dimensionality reduction

by Robert West, Für Opa , 2010
"... To my beloved late grandfather Heinrich Fleischer, who loved playing with words and, a typesetter by trade, would approve of my using Fraktur letters for variable names. And to his daughter. And her husband. And their son. And his sister. ii ACKNOWLEDGEMENTS This work would not have been possible wi ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
Precup, for always encouraging ‘cool ideas’ and advising me in much of the work presented in this thesis. I am thankful to both Joelle and Doina for fostering the friendly lab atmosphere that was a key component in making my time in Montreal as enjoyable as it has been, and to all members of the Mc

ACKNOWLEDGEMENTS

by Mark Mercer , 2007
"... I would first like to thank my supervisor Denis Thérien for his patience, support, and guidance over the years. His intuition and depth of knowledge have touched every part of this thesis. Denis has provided me the funding and the opportunity to work and study at McGill. I have thoroughly enjoyed my ..."
Abstract - Add to MetaCart
would like to thank my program committee members Claude Crépeau and Doina Precup for their advice and support. I also thank my external examiner, Carlo Mereghetti, for many helpful corrections and comments. I wish to thank Patrick Hayden, Alexei Miasnokov, and John Liede for participating on my defence

Memory Based Learning in Partially Observable Markov Decision Processes with Variable Length Histories

by Yang Li, Supervisor Doina Precup
"... In the field of artificial intelligence, many are interested in finding new algorithms that enable an agent to act intelligently in a world. Planning how to act in a stochastic world is a major problem in the field. An intelligent agent must usually rely on an imperfect model of the world to plan it ..."
Abstract - Add to MetaCart
was realized during Summer 2007 and funded by a NSERC USRA supervised by Dr. Doina Precup. 1

Acknowledgments

by Masoumeh Tabaeh Izadi , 2007
"... A journey is always easier when traveled with others. I have been accompanied and sup-ported by many people throughout this work. I am pleased to have the opportunity to express my gratitude to all of them. First and foremost, I would like to acknowledge a great debt of gratitude to my thesis adviso ..."
Abstract - Add to MetaCart
advisor, professor Doina Precup whose inspiring and thoughtful guidance, and supervi-sion made my thesis work possible. Discussions and regular meetings with Doina always helped shaping my thoughts and motivated me to work hard. During the past several years Doina has imparted innumerable lessons from
Results 1 - 10 of 10
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University