Doina Precup

Affiliation School of Computer Science, McGill University
Publications 119
H-index 18
Sorted by:

Publications

#Cited
568 Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning - Artificial Intelligence - 1999
85 Multi-time Models for Temporally Abstract Planning - In Advances in Neural Information Processing Systems 10 - 1997
77 Automatic basis function construction for approximate dynamic programming and reinforcement learning - Learning. Proceedings of the 23rd International Conference on Machine Learning - 2006
70 Eligibility Traces for Off-Policy Policy Evaluation - Proceedings of the Seventeenth International Conference on Machine Learning - 2000
69 Fast gradient-descent methods for temporal-difference learning with linear function approximation - In Danyluk et - 2009
63 Between MDPs and semi-MDPs: Learning, planning, and representing knowledge at multiple temporal scales - Journal of Artificial Intelligence Research - 1998
60 Off-policy temporal-difference learning with function approximation - Proceedings of the Eighteenth International Conference on Machine Learning - 2001
59 Intra-option learning about temporally abstract actions - In Proceedings of the Fifteenth International Conference on Machine Learning - 1998
58 Theoretical Results on Reinforcement Learning with Temporally Abstract Behaviors - - 1998
49 Horde: a scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction - In Proceedings of 10th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), May 2– 6 - 2011
33 Learning Options in Reinforcement Learning - Lecture Notes in Computer Science - 2002
33 A convergent form of approximate policy iteration - Adv. Neural Information Proc. Systems - 2003
29 Active learning in partially observable Markov decision processes - In ECML - 2005
24 Activity and Gait Recognition with Time-Delay Embeddings - in Proceedings of AAAI - 2010
23 Improved Switching among Temporally Abstract Actions - Advances in Neural Information Processing Systems 11 - 1999
23 Using Options for Knowledge Transfer in Reinforcement Learning - - 1999
21 Planning with Closed-Loop Macro Actions - In Working notes of the 1997 AAAI Fall Symposium on Model-directed Autonomous Systems - 1997
17 Constructive Function Approximation - - 1997
16 Bounding Performance Loss in Approximate MDP Homomorphisms -
15 Combining TD-learning with cascade-correlation networks - In Proceedings of the Twentieth International Conference on Machine Learning - 2003

View completed publications >>