Home     Top: Machine Learning: Reinforcement Learning    [Case-based Learning   Fuzzy Systems   Genetic Algorithms   Neural Networks   Pattern Recognition   Reinforcement Learning   Rule Based Systems   Vision]

Change ordering:   Authority   Hubs (tutorials)   Date   Expected authority       Show abstracts
Tutorials/surveys/introductory articles (ordered by the degree of citation of authoritative articles)

This directory is created automatically and some papers may be mislabeled. Only document within the CiteSeer database are listed. The directory is intended to provide entry points for browsing the database and is not intended to be authoritative. Papers may not appear in all relevant categories. For example, papers in a sub-category may not appear in higher level categories.

12028.9   Evolving Artificial Neural Networks - Yao (1999)   (Correct)
11624.3   Interaction and Intelligent Behavior - Mataric (1994)   (Correct)
9054.1   The Artificial Evolution of Adaptive Behaviour - Harvey (1995)   (Correct)
8889.2   Emotional Agents - Wright (1997)   (Correct)
8785.6   Reinforcement Learning: A Survey - Kaelbling, Littman, Moore (1996)   (Correct)
8326.2   Learning To Solve Markovian Decision Processes - Singh (1994)   (Correct)
8079.1   Machine Learning Research: Four Current Directions - Dietterich (1997)   (Correct)
7613.9   Statistical Physics of Clustering Algorithms - Graepel (1998)   (Correct)
7387.5   A Framework for Programming Embedded Systems: Initial Design and.. - Thrun (1998)   (Correct)
7090.0   Machine Learning and Natural Language Processing - Marquez (2000)   (Correct)
7067.2   Fuzzy Logic and Soft Computing: Technology Development and.. - Bonissone (1997)   (Correct)
6774.0   Symbiotic Evolution of Neural Networks in Sequential Decision Tasks - Moriarty (1997)   (Correct)
6604.4   Incremental Dynamic Programming for On-Line Adaptive Optimal Control - Steven J. Bradtke (1994)   (Correct)
6578.8   Representation and Management Issues for Case-Based Reasoning Systems - Jurisica (1993)   (Correct)
6430.8   Reinforcement Learning And Its Application To Control - Gullapalli (1992)   (Correct)
6425.2   Large-Scale Dynamic Optimization Using Teams of Reinforcement.. - Crites (1996)   (Correct)
6366.0   Representing and Learning Routine Activities - Hexmoor (1995)   (Correct)
6143.6   Cooperative Mobile Robotics: Antecedents and Directions - Cao, Fukunaga, Kahng, Meng (1995)   (Correct)
5904.5   Evolutionary Artificial Neural Networks - Yao (1993)   (Correct)
5738.6   A consideration of the biological and psychological foundations of.. - Sharkey, Ziemke (1998)   (Correct)
5311.6   Learning to Act using Real-Time Dynamic Programming - Barto, Bradtke, Singh (1995)   (Correct)
5247.9   Learning to Take Actions - Khardon (1996)   (Correct)
5147.2   A Tutorial Survey of Reinforcement Learning - Keerthi, Ravindran (1995)   (Correct)
5143.0   Exploration and Inference in Learning from Reinforcement - Wyatt (1997)   (Correct)
5000.3   The Hippocampus And Cerebellum In Adaptively Timed Learning.. - Grossberg, Merrill (1995)   (Correct)
4900.6   A Hybrid Architecture for Situated Learning of Reactive Sequential.. - Sun, Peterson, Merrill (1999)   (Correct)
4713.5   Learning Control of Complex Skills - Crawford (1998)   (Correct)
4673.2   Learning Algorithms in Neural Networks - Francisco A. Camargo (1990)   (Correct)
4636.3   Learning Procedural Planning Knowledge In Complex Environments - Pearson (1996)   (Correct)
4563.9   Unsupervised Neural Network Learning Procedures For Feature.. - Suzanna Becker, Mark Plumbley (1996)   (Correct)
4520.0   Handbook of Perception and Cognition, Vol.14 Chapter 4: Machine.. - Stuart Russell   (Correct)
4481.4   Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in.. - Sutton, Precup, Singh (1999)   (Correct)
4439.9   Stochastic Dynamic Programming with Factored Representations - Boutilier, Dearden, al. (1999)   (Correct)
4393.8   Formal Approaches to Innate and Learned Communication: Laying the.. - Oliphant (1997)   (Correct)
4215.2   The Developmental Approach to Artificial Intelligence: Concepts.. - Weng, Evans, Hwang, Lee (1999)   (Correct)
4015.4   Planning, Learning and Coordination in Multiagent Decision Processes - Boutilier   (Correct)
4009.1   Hierarchical Learning with Procedural Abstraction Mechanisms - Rosca (1997)   (Correct)
4005.8   On the Reduction of Costs for Robot Controller Synthesis - Attilio Giordana, Michael Kaiser.. (1994)   (Correct)
3970.5   Lifelong Robot Learning - Thrun, Mitchell (1993)   (Correct)
3898.1   Connectionist Adaptive Control - Jervis (1993)   (Correct)
3827.7   Continuous Case-Based Reasoning - Ram, Santamaría (1996)   (Correct)
3810.3   Approximate Solutions to Markov Decision Processes - Gordon (1999)   (Correct)
3754.6   Varieties of Helmholtz Machine - Dayan, Hinton (1996)   (Correct)
3665.5   Fuzzy Finite-state Automata Can Be Deterministically Encoded into.. - Omlin, Thornber, Giles (1998)   (Correct)
3627.2   Learning Action Strategies for Planning Domains - Khardon (1997)   (Correct)
3614.1   Multi-Agent Reinforcement Learning: Weighting and Partitioning - Sun, Peterson (1999)   (Correct)
3598.1   The Role Of Exploration In Learning Control - Thrun (1992)   (Correct)
3530.2   Learning Evaluation Functions - Justin A. Boyan (1996)   (Correct)
3526.6   Generalized Markov Decision Processes: Dynamic-programming and.. - Szepesvári, Littman (1996)   (Correct)
3500.8   Hidden State and Reinforcement Learning with Instance-Based State.. - Andrew Mccallum   (Correct)
3403.6   Towards Planning: Incremental Investigations into Adaptive Robot.. - Meeden (1994)   (Correct)
3345.7   Learning Reactive and Planning Rules in a Motivationally Autonomous.. - Donnart, Meyer (1995)   (Correct)
3341.6   Learning Models of Environments with Manifest Causal Structure - Bergman (1995)   (Correct)
3340.2   Toward a Model of Intelligence as an Economy of Idiots - Baum (1997)   (Correct)
3282.2   Living in a partially structured environment: How to bypass the.. - Gaussier, Revel, Joulain, Zrehen (1996)   (Correct)
3244.6   Operations Research Meets Constraint Programming: Some Achievements.. - Tsang, al. (1999)   (Correct)
3224.3   Instructable and Adaptive Web-Agents which Learn to Categorize and.. - Eliassi-Rad (1999)   (Correct)
3149.5   Autonomous Learning of Sequential Tasks: Experiments and Analyses - Ron Sun (1998)   (Correct)
3096.4   Learning to Solve Multiple Goals - Jonas Karlsson (1997)   (Correct)
3088.0   Creating Advice-Taking Reinforcement Learners - Maclin, Shavlik (1996)   (Correct)
3006.6   HQ-Learning - Wiering, Schmidhuber (1997)   (Correct)
2988.5   Hybrid Neural Systems - Wermter, Sun (2000)   (Correct)
2973.4   A Family of Stochastic Methods For Constraint Satisfaction and.. - Tsang, Wang, Davenport, Voudouris.. (1999)   (Correct)
2946.4   Robot Localization and Exploration with Agent-Centered Search - Sven Koenig   (Correct)
2917.7   A General Method For Incremental Self-Improvement And Multi-Agent.. - Jürgen Schmidhuber (1998)   (Correct)
2884.1   Adaptive Retrieval Agents: Internalizing Local Context and Scaling up .. - Menczer, Belew (1999)   (Correct)
2863.9   Evolutionary Algorithms for Reinforcement Learning - Moriarty, Schultz, Grefenstette (1999)   (Correct)
2843.2   Co-Learning in Differential Games - John W. Sheppard   (Correct)
2839.0   The New Wave in Robot Learning - Noel Sharkey (1997)   (Correct)
2800.6   Model-Based Learning for Mobile Robot Navigation from the Dynamical.. - Tani (1996)   (Correct)
2793.4   Learning, Action, and Consciousness: A Hybrid Approach toward.. - Sun (1996)   (Correct)
2790.9   Individual Learning of Coordination Knowledge - Sen, Sekaran (1998)   (Correct)
2769.6   Making the World Differentiable: On Using Self-Supervised Fully.. - Jürgen Schmidhuber (1990)   (Correct)
2766.9   A Teaching Strategy for Memory-Based Control - John Sheppard (1997)   (Correct)
2760.6   An Approach to Learning Mobile Robot Navigation - Thrun (1995)   (Correct)
2736.8   Learning Controllers for Industrial Robots - Baroglio, al. (1996)   (Correct)
2718.4   A Hybrid Agent Architecture For Reactive Sequential Decision Making - Sun, Peterson (1997)   (Correct)
2717.3   Experience-Based Creativity - Levinson (1991)   (Correct)
2661.7   Shifting Inductive Bias with Success-Story Algorithm, Adaptive Levin.. - Jürgen Schmidhuber, Jieyu Zhao.. (1997)   (Correct)
2657.1   A Study of Reinforcement Learning in the Continuous Case by the Means .. - Munos (1999)   (Correct)
2654.7   A Distributed User Adaptive Neuro-Fuzzy Controller Application for.. - Schildt, Zainzinger (1998)   (Correct)
2608.6   Some Studies in Distributed Machine Learning and Organizational Design - Weiss (1994)   (Correct)
2578.0   Robot Shaping: Developing Situated Agents through Learning - Dorigo, Colombetti (1993)   (Correct)
2572.9   Reinforcement Learning With Self-Modifying Policies - Jürgen Schmidhuber, Jieyu Zhao.. (1997)   (Correct)
2569.2   A Unified Analysis of Value-Function-Based Reinforcement-Learning.. - Szepesvári, Littman (1998)   (Correct)
2562.4   A Constructive Connectionist Approach Towards Continual Robot Learning - Großmann, Poli (1997)   (Correct)
2561.0   Generalization - Wah (1998)   (Correct)
2545.3   Complexity Analysis of Real-Time Reinforcement Learning Applied to.. - Koenig, Simmons (1997)   (Correct)
2543.9   Evolutionary Robotics: Exploiting the full power of self-organization - Nolfi (1998)   (Correct)
2539.1   Case-based reactive navigation: A case-based method for on-line.. - Ram, Arkin, Moorman, Clark   (Correct)
2516.2   Average Reward Reinforcement Learning: Foundations, Algorithms, and.. - Mahadevan (1996)   (Correct)
2506.3   Life, Mind and Robots. The Ins and Outs of Embodied Cognition - Sharkey, Ziemke (1999)   (Correct)
2504.2   Module-Based Reinforcement Learning: Experiments with a Real Robot - Kalmár, al. (1998)   (Correct)
2490.8   Learning to Control Dynamic Systems Via Associative Reinforcement.. - Vijaykumar Gullapalli Computer   (Correct)
2489.5   Modular Neural Networks for Learning Context-Dependent Game Strategies - Boyan (1992)   (Correct)
2486.8   Probabilistic Knowledge Base Validation - Gleason (1995)   (Correct)
2463.6   Learning With Uninterpreted Sensors And Effectors - Pierce (1995)   (Correct)
2438.2   Large-Scale Planning Under Uncertainty: A Survey - Littman, Majercik   (Correct)
2419.9   An Application of Reinforcement Learning to Dialogue Strategy.. - Walker (2000)   (Correct)
2405.6   Parallel Cooperative Classifier Systems: A proposal for a unifying.. - Antonella Giani (1997)   (Correct)
2390.9   Incorporating Advice into Agents that Learn from Reinforcements - Richard Maclin (1994)   (Correct)
2382.6   Building Agent Teams Using an Explicit Teamwork Model and Learning - Tambe, Adibi, Al-Onaizan, Kaminka.. (1998)   (Correct)
2359.8   Learning Situation-Specific Coordination in Cooperative Multi-agent.. - Prasad, Lesser (1999)   (Correct)
2353.8   Efficient Reinforcement Learning through Symbiotic Evolution - Moriarty, Miikkulainen (1996)   (Correct)
2353.4   Neural vehicles - Kröse, van Dam (1997)   (Correct)
2322.7   Symbiotic Evolution of Neural Networks - Vogiatzis (1994)   (Correct)
2303.3   Multistrategy Learning of Adaptive Reactive Controllers - Santamaria, Ram   (Correct)
2298.9   Some Experiments with a Hybrid Model for Learning Sequential Decision .. - Ron Sun (1998)   (Correct)
2298.0   Simple Principles Of Metalearning - Jürgen Schmidhuber, Jieyu Zhao.. (1996)   (Correct)
2266.4   Reinforcement Learning Through Gradient Descent - Baird, III (1999)   (Correct)
2241.8   Modelling Intelligent Behaviour: The Markov Decision Process Approach - Geffner   (Correct)
2235.1   First Results with Instance-Based State Identification for.. - McCallum (1994)   (Correct)
2234.2   Reactive Search: Toward Self-Tuning Heuristics - Battiti (1996)   (Correct)
2233.7   Memory Based Learning of Pursuit Games - Sheppard, Salzberg   (Correct)
2211.6   Efficient Exploration In Reinforcement Learning - Sebastian B. Thrun (1992)   (Correct)
2195.2   Reactive Search, a history-based heuristic for MAX-SAT - Battiti, Protasi (1996)   (Correct)
2190.3   W-learning: Competition among selfish Q-learners - Mark Humphrys (1995)   (Correct)
2189.1   BISMARC: A Biologically Inspired System for Map-based Autonomous.. - Huntsberger, Rose (1998)   (Correct)
2179.5   Exploration Strategies for Model-based Learning in Multi-agent Systems - Carmel, Markovitch (1997)   (Correct)
2177.2   Investigating Fault Tolerance in Artificial Neural Networks - Bolt (1991)   (Correct)
2177.2   Learning Navigational Behaviors using a Predictive Sparse Distributed .. - Rao, Fuentes (1996)   (Correct)
2176.3   A Generalized Reinforcement-Learning Model: Convergence and.. - Littman, Szepesvári (1996)   (Correct)
2155.2   An incremental approach to developing intelligent neural network.. - Meeden (1995)   (Correct)
2134.5   HQ-Learning: Discovering Markovian Subgoals For Non-Markovian.. - Wiering, Schmidhuber (1996)   (Correct)
2131.7   Self-Segmentation of Sequences: Automatic Formation of Hierarchies of .. - Sun (2000)   (Correct)
2131.3   Hierarchical Control and Learning for Markov Decision Processes - Parr (1998)   (Correct)
2118.9   Reinforcement Learning Soccer Teams with Incomplete World Models - Marco Wiering, Rafal P. Salustowicz, .. (1999)   (Correct)
2113.8   Adaptive Load Balancing: A Study in Multi-Agent Learning - Schaerf, Shoham, Tennenholtz (1995)   (Correct)
2107.3   Explanation-Based Learning and Reinforcement Learning: A Unified View - Dietterich, al. (1997)   (Correct)
2105.9   The learning barrier: Moving from innate to learned systems of.. - Oliphant (1998)   (Correct)
2099.8   Learning with Mixtures of Trees - Meila-Predoviciu (1999)   (Correct)
2099.6   On the Complexity of Solving Markov Decision Problems - Littman, Dean, Kaelbling (1995)   (Correct)
2092.0   Adaptive Critic Designs - Prokhorov, Wunsch (1997)   (Correct)
2088.4   Computational Design Principles for Multiple Autonomous Vehicle.. - Wan, Braspenning (1995)   (Correct)
2086.4   An Overview of Planning Under Uncertainty - Blythe (1999)   (Correct)
2071.3   Conjectural Equilibrium in Multiagent Learning - Wellman, Hu (1998)   (Correct)
2063.4   Reinforcement Learning with Replacing Eligibility Traces - Singh (1996)   (Correct)
2059.6   Convergence Results for Single-Step On-Policy Reinforcement-Learning.. - Singh, Jaakkola, al. (1998)   (Correct)
2043.9   Machine Learning for Robots: A Comparison of Different Paradigms - Mahadevan (1996)   (Correct)
2037.1   Attentional Network Streams of Synchronized 40Hz Activity in a.. - Baird, Troyer, Eeckman (1997)   (Correct)
2022.4   Machine Learning Techniques for Adaptive Logic-Based Multi-Agent.. - Alonso, Kudenko (1999)   (Correct)
2019.1   Discovery of Subroutines in Genetic Programming - Rosca, Ballard (1996)   (Correct)
2017.2   Anytime Learning and Adaptation of Structured Fuzzy Behaviors - Andrea Bonarini   (Correct)
2008.0   Learning Hierarchical Behaviors - Andre (1998)   (Correct)
2001.6   Reinforcement Learning and Animat Emotions - Wright (1996)   (Correct)
1998.8   Hierarchical learning of efficient skill application for autonomous.. - Kaiser, Dillmann (1995)   (Correct)
1998.7   What's Interesting? - Schmidhuber (1997)   (Correct)
1991.3   Automating the Construction of Internet Portals with Machine Learning - McCallum, Nigam, Rennie, Seymore   (Correct)
1991.1   Reinforcement Learning for Autonomous Three-Dimensional Object.. - Paletta, Prantl, Pinz (1998)   (Correct)
1985.4   Learning Social Behaviors - Mataric (1997)   (Correct)
1972.1   On Learning How to Learn Learning Strategies - Schmidhuber (1995)   (Correct)
1969.1   Learning Algorithms for Networks with Internal and External Feedback - Jürgen Schmidhuber (1990)   (Correct)
1956.5   Model-based Learning of Interaction Strategies in Multi-agent Systems - Carmel, Markovitch (1997)   (Correct)
1937.2   On Training Automated Agents - Clouse (1995)   (Correct)
1927.9   Active object recognition by view integration and reinforcement.. - Paletta, Pinz (1998)   (Correct)
1925.2   Characterizing the Benefits of Model-Based Vs. Direct-Control.. - Schuurmans, Greenwald   (Correct)
1922.6   ALECSYS and the AutonoMouse: Learning to Control a Real Robot by.. - Dorigo (1995)   (Correct)
1918.6   Dynamic Non-Bayesian Decision Making - Dov Monderer (1997)   (Correct)
1897.2   Learning Team Strategies: Soccer Case Studies - Rafal P. Salustowicz, Marco A.. (1998)   (Correct)
1895.7   An On-Line Method to Evolve Behavior and to Control a Miniature Robot .. - Nordin, Banzhaf (1997)   (Correct)
1879.2   Reinforcement Learning for Planning and Control - Dean, Basye, Shewchuk (1993)   (Correct)
1876.1   Solving Semi-Markov Decision Problems using Average Reward.. - Das, Gosavi, Mahadevan, Marchalleck. (1999)   (Correct)
1870.1   PANIC: A Parallel Evolutionary Rule Based System - Antonella Giani (1995)   (Correct)
1865.1   Neural Sequence Chunkers - Jürgen Schmidhuber (1991)   (Correct)
1864.4   Instance-Based Utile Distinctions for Reinforcement Learning with.. - Andrew Mccallum (1995)   (Correct)
1863.0   Robot Shaping: Experiment In Behavior Engineering - Dorigo, Colombetti (1997)   (Correct)
1862.3   Reinforcement Learning in Non-Markov Environments - Whitehead, Lin (1992)   (Correct)
1853.4   Induction of decision trees using RELIEFF - Kononenko, Simec (1995)   (Correct)
1851.9   A Multiagent Framework for Planning, Reacting, and Learning - Weiss (1999)   (Correct)
1848.9   Prioritized Sweeping: Reinforcement Learning with Less Data and Less.. - Moore, Atkeson (1993)   (Correct)
1841.7   Combining Genetic Algorithms with Memory Based Reasoning - Sheppard, Salzberg (1995)   (Correct)
1832.5   Representation of behavioral history for learning in nonstationary.. - Mataric (1999)   (Correct)
1828.2   On the Computational Economics of Reinforcement Learning - Barto, Singh (1990)   (Correct)
1825.6   Probabilistic Incremental Program Evolution - Rafal Salustowicz, Jürgen Schmidhuber (1997)   (Correct)
1808.7   Learning Decision Strategies with Genetic Algorithms - John Grefenstette (1992)   (Correct)
1807.6   Inductive Learning of Reactive Action Models - Benson (1995)   (Correct)
1805.6   A Lifelong Learning Perspective for Mobile Robot Control - Thrun (1994)   (Correct)
1799.4   Evolutionary Robotics in Behavior Engineering and Artificial Life - Floreano   (Correct)
1793.1   Reinforcement Learning & Artificial Neural Networks - The Optimal.. - Kavehercy (1996)   (Correct)
1793.0   Online Learning with Random Representations - Sutton, Whitehead (1993)   (Correct)
1772.6   W-learning: A simple RL-based Society of Mind - Mark Humphrys University   (Correct)
1771.8   Reward Functions for Accelerated Learning - Maja Mataric (1994)   (Correct)
1766.3   Module Based Reinforcement Learning for a Real Robot - Kalmár, Szepesvári, Lorincz   (Correct)

CiteSeer - citeseer.org - Terms of Service - Privacy Policy - Copyright © 1997-2002 NEC Research Institute