See this document in CiteSeerX!

Faster Temporal Credit Assignment in Learning Classifier Systems  (Make Corrections)  
Pawel Cichosz, Jan J. Mulawka
Proceedings of the First Polish Conference on Evolutionary Algorithms (KAE-96)



  Home/Search   Context   Related

 
View or download:
pw.edu.pl/pub/pawel/bblambda.ps.gz
cs.bham.ac.uk/pub/...Cichosz1996a.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  pw.edu.pl/~pawel/pubs (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Classifier systems are genetics-based learning systems using the paradigm of reinforcement learning. In the most challenging case of delayed reinforcement, it involves a difficult temporal credit assignment problem. Standard classifier systems solve this problem using the bucket brigade algorithm. In this paper we show how to make the temporal credit assignment process faster by augmenting this algorithm by some refinements borrowed from a related field of reinforcement learning algorithms... (Update)

Active bibliography (related documents):   More   All
0.3:   GBQL: A Novel Genetics-Based Reinforcement Learning Architecture - Cichosz, Mulawka   (Correct)
0.2:   Classifier Fitness Based on Accuracy - Wilson (1995)   (Correct)
0.2:   Q-Learning and Redundancy Reduction in Classifier Systems.. - Antonella Giani   (Correct)

Similar documents based on text:   More   All
0.7:   Lookahead and Latent Learning in ZCS - Bull   (Correct)
0.6:   Connectionist Learning Classifier System - Vasilyev (2002)   (Correct)
0.5:   Classifier Systems Learning In Dynamic Environment - Vasilyev (2001)   (Correct)

BibTeX entry:   (Update)

@inproceedings{ cichosz96faster,
    author = "Pawel Cichosz and Jan J. Mulawka",
    title = "Faster temporal credit assignment in learning classifier systems",
    booktitle = "Proceedings of the First Polish Conference on Evolutionary Algorithms ({KAE}-96)",
    year = "1996",
    url = "citeseer.ist.psu.edu/82697.html" }
Citations (may not include all citations):
658   Learning from Delayed Rewards (context) - Watkins - 1989
563   Learning to predict by the methods of temporal differences - Sutton - 1988
141   Temporal Credit Assignment in Reinforcement Learning (context) - Sutton - 1984
101   Reinforcement Learning for Robots Using Neural Networks (context) - Lin - 1993
95   Classifier systems and genetic algorithms (context) - Booker, Goldberg et al. - 1989
23   Truncating temporal differences: On the efficient implementa.. - Cichosz - 1995
21   learning and classifier systems (context) - Dorigo, Bersini - 1994
8   Fast and efficient reinforcement learning with truncated tem.. - Cichosz, Mulawka - 1995
7   Reinforcement learning algorithms based on the methods of te.. (context) - Cichosz - 1994
5   ZCS: A zeroth order classifier system (context) - Wilson - 1994

Documents on the same site (http://tichy.ipe.pw.edu.pl/~pawel/pubs.html):   More
Integrated Learning and Planning Based on Truncating Temporal.. - Cichosz   (Correct)
Truncated Temporal Differences with Function Approximation.. - Cichosz (1996)   (Correct)
An Analysis of Experience Replay in Temporal Difference Learning - Cichosz   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC