40 citations found. Retrieving documents...
Holland, J.H., 1985. "Properties of the Bucket Brigade Algorithm," Proceedings of the 1st international Conference on Genetic Algorithms and Their Applications, ed. Grefenstette, J.J., L.E. Associates, pp. 1--7.

 Home/Search   Document Not in Database   Summary   Related Articles   Check  

This paper is cited in the following contexts:

First 50 documents

Designing Efficient Exploration with MACS: Modules and.. - Gérard, Sigaud   (Correct)

....Research on Learning Classi er Systems (LCSs) has received increasing attention over the last few years. This surge of interest has concretized itself in two di erent directions. First, a trend called classical LCSs hereafter comes from the simpli cation of Holland s initial framework [Hol85] by Wilson. The design of ZCS [Wil94] and then XCS [Wil95] resulted in a dramatic increase of LCS performance and applicability. The latter system, using the accuracy of the reward prediction as a tness measure, has proven its e ectiveness on di erent classes of problems such as adaptive behavior ....

J.H. Holland. Properties of the bucket brigade algorithm. In J.J. Grefenstette, editor, Proceedings of the 1st international Conference on Genetic Algorithms and their applications (ICGA85), pages 17. L.E. Associates, july 1985.


L'apprentissage Par Renforcement Indirect Dans Les.. - Sigaud, Gérard   (Correct)

....de l apprentissage par renforcement tabulaire, chaque classeur [Condition] Action] q modlisant plusieurs triplets (tat, action, valeur) de la Q table. En rgle gnrale, la qualit associe chaque classeur est apprise grce di rents algorithmes de propagation de la rcompense, comme Bucket Brigade [Holland, 1985] ou Q learning, dans XCS [Wilson, 1995] par exemple. Mais les algorithmes de type Q learning propagent trs lentement la qualit des di rentes situations. La qualit d un couple (tat, action) est mise jour uniquement lorsqu un agent ralise e ectivement cette action partir de cet tat, et la mise ....

Holland, J. H. (1985). Properties of the bucket brigade algorithm. Edit dans Grefenstette, J. J., editeur, Proceedings of the 1st international Conference on Genetic Algorithms and their applications (ICGA85), pages 17. L.E. Associates.


Combining Latent Learning with Dynamic Programming in the.. - Gérard, Meyer, Sigaud   (Correct)

.... The rst proposals for a LCS devoted to RL problems are presented in [Hol76] The rst implementation of an actual LCS, called CS1, can be found in [HR78] Wilson [Wil95] introduced in LCSs a learning algorithm similar to Q learning [Wat89] to replace the traditional Bucket Brigade algorithm [Hol85]. This work led to a revival of LCS research since the new accuracy based approach in XCS overcomes the over generalization problems found in previous LCSs [Wil89] The usual formal representation of RL problems is a Markov Decision Process (MDP) which is de ned by: a nite state space S; a ....

....is possible to build a model of the expected payo with a smaller number of classi ers than the number of possible triples in a tabular representation. Usually, the payo predictions p of the classi ers are learned according to di erent propagation of delayed reward algorithms, like Bucket Brigade [Hol85] or Q learning, as in XCS [Wil95] The main issue with generalization is to to organize C parts (conditions) and A parts (actions) so that the don t care symbols are well placed. When a distinction should be speci ed by a speci c attribute but is not, then the classi er is too general. It matches ....

J. H. Holland. Properties of the bucket brigade algorithm. In J. J. Grefenstette, editor, Proceedings of the 1st international Conference on Genetic Algorithms and their applications (ICGA85), pages 17. L.E. Associates, july 1985. 24


Internal Models and Anticipations in Adaptive Learning.. - Butz, Sigaud, Gérard   (2 citations)  (Correct)

....the policy representation may be more compact especially in environments in which a lot of sensations are available but only a subset of the sensations is task relevant. Recently, Wilson implemented several improvements in the LCS model. He modi ed the traditional Bucket Brigade algorithm [26] to resemble the Qlearning mechanism propagating Q values over the population of classi ers [66, 67] Moreover, Wilson drastically simpli ed the LCS model [66] Then, he modi ed Holland s original strength based criterion for learning the more a rule receives reward (on average) the more t ....

Holland, J.H.: Properties of the bucket brigade algorithm. Proceedings of an International Conference on Genetic Algorithms and their Applications (1985) 1-7


Generalization and Latent Learning in Learning Classifier Systems - Gérard   (Correct)

.... Q learning to Learning Classi er Systems The rst proposals for a LCS devoted to RL problems are presented in [Hol76] The rst LCS, called CS1, can be found in [HR78] Wilson [Wil95] introduced an algorithm similar to Q learning [Wat89] in LCSs instead of the traditional Bucket Brigade algorithm [Hol85]. This work led to a revival of LCS research since the accuracy based approach in XCS overcomes the over generalization problem in previous LCSs [Wil89] The usual formal representation of RL problems is a Markov Decision Process (MDP) which is de ned by: a nite state space S; a nite set ....

....the don t care symbols, it is possible to build an expected payo map which is smaller than the number of possible triples of a tabular representation. Usually, the payo prediction p of the classi ers are set according to di erent back propagation of delayed reward algorithms like Bucket Brigade [Hol85] or Q learning, as in XCS [Wil95] The main problem is to set C and A parts so that the don t care symbols are set in the right places. To do so, LCSs usually use Genetic Algorithms (GAs) so as to evolve a population of classi ers. Each classi er is an individual which is evaluated through the ....

J.H. Holland. Properties of the bucket brigade algorithm. In J.J. Grefenstette, editor, Proceedings of the 1st international Conference on Genetic Algorithms and their applications (ICGA85), pages 17. L.E. Associates, july 1985.


Adaptivity and Learning in Intelligent Real-Time Systems - Lind, Jung, Gerber   (Correct)

....problem for confidlearn is the fact that it is not used in the typical way for predicting the value of actions, but for learning to estimate computations. Behavior processes, such as aimGoal, aimPlayer, and aimSelf do not have external, but mainly internal effects. The bucket brigade algorithm [2] was originally introduced in the context of rule based systems. Each rule is assigned a particular priority that is increased every time the rule was active. At the same time, the receiving rule must pay a particular amount of its priority to the rules that where active in the prior cycle ....

....amount of its priority to the rules that where active in the prior cycle because these rules created the context that allowed the paying rule to become active. Identifying rules with process chunks, a similar mechanism is applicable to confidences. We extend the bucket brigade algorithm [2] not only to distribute positive and negative feedback among the currently active processes and the processes that were active in the prior cycle, but also among the processes that were active at times t Gamma 2; t Gamma m. 4.2 Memory Based Reasoning: Matching Experiences For situated ....

J. H. Holland. Properties of the bucket brigade algorithm. In J. J. Grefenstette, editor, Proceedings of the First International Conference on Genetic Algorithms and Their Applications, 1985.


YACS: a new Learning Classifier System using Anticipation - Gérard, Stolzmann, Sigaud   (1 citation)  (Correct)

....within a common description so that the representation of the problem gets smaller. The rst LCS, called CS1, can be found in [Holland and Reitman, 1978] Wilson [Wilson, 1995] introduced an algorithm similar to Q learning [Watkins, 1989] in LCSs instead of the traditional Bucket Brigade algorithm [Holland, 1985]. This work led to a revival of LCS research since the accuracy based approach in XCS overcomes the problem in previous LCSs where especially deferred reward leads to over generalization [Wilson, 1989] Additional to the generalization capabilities of LCSs in policy learning tasks, an internal ....

Holland, J. (1985). Properties of the bucket brigade algorithm. In Grefenstette, J., (Ed.), Proceedings of the 1st international Conference on Genetic Algorithms and their applications (ICGA85), pages 17. L.E. Associates.


The Anticipatory Classifier System and Genetic Generalization - Butz, Goldberg, Stolzmann (2000)   (Correct)

....oe(t 1) modifying [P ] 16 t t 1 17 while(not end of one trial) 3.2 Reinforcement Learning As in XCS, our reinforcement learning approach adapts the Q learning idea in reinforcement learning (Watkins Dayan, 1992) to the ACS framework. This step away from the traditional bucket brigade (Holland, 1985) which is comparable with Sarsa in the reinforcement learning field (Sutton Barto, 1998) enables a general policy independence of the reward learning in ACS and LCSs in general. A first mathematical analysis of Q learning in generalizing systems such as LCSs can be found in Lanzi (2000b) In ....

Holland, J. H. (1985). Properties of the bucket brigade algorithm. In Proceedings of an international conference on genetic algorithms and their applications pp. 1--7. Carnegie-Mellon University, Pittsburgh, PA: John J. Grefenstette.


Adaptation by Reinforcement Learning in Cooperative Autonomous.. - Mataric (1995)   (Correct)

.... direct instruction or answers from the environment the learning is considered unsupervised (Barto 1990) Reinforcement learning has been applied to a variety of domains and problems, including maze learning (Minsky 1954) checkers (Samuel 1959) the Bucket Brigade algorithm in Classifier Systems (Holland 1985), and Temporal Differencing (Sutton 1988) It has been implemented with with a variety of algorithms ranging from table lookup to neural networks. Most recently, RL has been attempted on situated, embodied agents. The next section briefly outlines and summarizes the assumptions that were ....

Holland, J. H. (1985), Properties of the bucket brigade algorithm, in `Proceedings, International Conference on genetic Algorithms and Their Applications ', Pittsburgh, PA, pp. 1--7.


Introducing a Genetic Generalization Pressure to the.. - Butz, Goldberg.. (2000)   (7 citations)  (Correct)

....classifier already exists, its numeriosity is increased. Finally, classifiers are deleted in the action set, if the size of the resulting set is greater than the action set size threshold as . 2. 4 The Bucket Brigade Algorithm In order to realize reward learning, the bucket brigade algorithm (Holland, 1985) updates the r values of the classifiers in the resulting learning set. The update is again done with the Widrow3 Hoff delta rule with a learning rate b r : r = 1 Gamma b r ) r b r (fl max(strength(M set (t 1) r(t) fl is a discount factor. 3 The Enhancement of the ALP As explained ....

Holland, J. H. (1985). Properties of the bucket brigade algorithm. In Proceedings of an international conference on genetic algorithms and their applications (pp. 1--7). Carnegie-Mellon University, Pittsburgh, PA: John J. Grefenstette.


First Cognitive Capabilities in the Anticipatory.. - Stolzmann, Butz.. (2000)   (2 citations)  (Correct)

.... Other, more sophisticated approaches are imaginable such as representing different needs by different attributes for different reward predictions and specific drive measures to satisfy the needs (Holland Reitman, 1978) The reward measure r is updated using the bucket brigade algorithm (BBA) (Holland, 1985). It modifies the r values of the classifiers in the resulting learning set. The update is again done with the Widrow Hoff delta rule with a learning rate b r = 0:05: r = 1 Gamma b r ) r b r (fl max cl2Mset (t 1) q cl r cl ) r(t) fl = 0:95 is the discount factor. 3 The ....

Holland, J. H. (1985). Properties of the bucket brigade algorithm. In Proceedings of an international conference on genetic algorithms and their applications (pp. 1--7). Carnegie-Mellon University, Pittsburgh, PA: John J. Grefenstette.


New Challenges for an Anticipatory Classifier System.. - Butz, Goldberg.. (1999)   (Correct)

....got a mark that differs from the state S t , a new classifier is formed in order to differentiate the classifier from the states that the mark represents. The new classifier is specified in the third and fifth component and the mark is removed. Finally, the ACS uses the bucket brigade algorithm (Holland, 1985) to enable reward learning. Indeed, it does not use the more general bucket brigade algorithm that acts on a set of classifiers but it uses the implicit bucket brigade algorithm (Goldberg, 1989) where the currently active classifier makes a payment to the previously active classifier. This method ....

Holland, J. H. (1985). Properties of the bucket brigade algorithm. In Proceedings of an international conference on genetic algorithms and their applications (pp. 1--7). Carnegie-Mellon University, Pittsburgh, PA: John J. Grefenstette.


The Artificial Life Roots of Artificial Intelligence - Steels (1993)   (59 citations)  (Correct)

....of (long term) cumulative reward, i.e. return [118] A technique useful for learning temporal chains is to hand out reinforcement to the last action and from there back to previous associations which played a role. This technique is known as the bucket brigade algorithm and 51 originally due to [46]. Reinforcement learning methods have been shown to be capable of impressive learning behavior in simulations or engineering contexts [81] but there are again serious difficulties in the application to physical autonomous agents. The first major difficulty lies in the determination of the ....

Holland, J.H. (1985) Properties of the bucket brigade algorithm. In J. J. Grefenstette (ed.) Proceedings of the First International Conference on Genetic Algorithms and their appliations. Lawrence Erlbaum, Pittsburgh, Pa. p. 1-7.


Evolution of a Clustering Scheme for a Classifier System: Beyond.. - Tufts (1994)   (Correct)

....and F is the bridging classi er. Note that A F E on its own will not work. F happens to remain active at the same time as B C D, but does not actually set up the conditions for E to occur. Holland theorized that bridging classi ers would: 1) arise on their own to 2) support long chains [10]. Riolo tested the second part of this hypothesis and found that, given the existence of bridging classi ers, learning does occur more rapidly [19] I have not found any examples of bridging classi ers arising on their own in the literature. 5 Proposal hierarchical clustering I propose that a ....

John Holland. Properties of the bucket brigade algorithm. In Proceedings of the First International Conference on Genetic Algorithms and Their Applications, pages 1-7. Lawrence Erlbaum Associates, 1985.


Evolutionary Approaches To The Learning Of Fuzzy.. - Cordón, Jesus, Herrera   (Correct)

....with the environment. 2. A credit assignment system, which learns the reward associated with the individual rules with respect to the global behaviour of the RB obtained by the population. Traditionally, two different schemes have been used to assign credits: the Bucket Brigade algorithm [39], and the Profit Sharing plan [31] 3. A classifier discovering process, that generates new rules from a set of rules by means of GAs. It was Holland [37] who introduced the first ideas about CSs, and developed, jointly with Reitman in [38] the first CS, named CS 1. Since that moment, different ....

Holland, J.H. (1985), "Properties of the bucket brigade algorithm," Proceedings of the First International Conference on Genetic Algorithms (ICGA'85), pp. 1-7.


An Action-Oriented Perspective of Learning in Classifier Systems - Weiß   (Correct)

....repeated execution of this cycle makes up the overall activity of a CS. The Traditional Perspective of Learning. There are two different learning schemes for credit assignment in CSs that have been proposed in the literature: the bucket brigade (BB for short; e.g. Booker, 1982; Goldberg, 1983; Holland 1985; Riolo, 1988) and the profit sharing plan (PSP for short; Holland and Reitman, 1978; Grefenstette, 1988) The fundamental difference between these two schemes is the following: the BB is an incremental learning algorithm according to which the strengths are updated each cycle; against that, the ....

Holland, J.H. (1985). Properties of the bucket brigade algorithm. In Proceedings of the First International Conference on Genetic Algorithms and Their Applications (pp. 1--7). Pittsburgh, PA: Lawrence Erlbaum.


Picking the Best Expert from a Sequence - Bergman, Rivest   (Correct)

.... postcondition with the meaning that if the preconditions are true in the current state and the action is taken, then the postcondition will be true in the next state. These are predictive rules as in [Drescher 89] as opposed to the prescriptive rules in reinforcement learning [Watkins 89, Holland 85] or operators in Soar [Laird et al. 87] An algorithm to learn rules uses triples of previous state, S, action, A, and current state to learn. It may isolate a postcondition, P , in the current state, and generate preconditions that explain the postcondition from the previous state and action. ....

Holland, J. H. (1985), Properties of the bucket brigade algorithm, in `First International Conference on Genetic Algorithms and Their Appli- Picking the Best Expert from a Sequence 227 cations', Pittsburg, PA, pp. 1--7.


Interaction and Intelligent Behavior - Mataric (1994)   (34 citations)  (Correct)

.... (Minsky 1954) Soon thereafter, the problem of learning a scoring functions for playing checkers was successfully addressed with an RL algorithm (Samuel 1959) Subsequently, RL was applied to a variety of domains and problems, most notably in the Bucket Brigade algorithm used in Classifier Systems (Holland 1985), and in a class of learning methods based on Temporal Differencing (Sutton 1988) Reinforcement learning has been implemented with a variety of algorithms ranging from table lookup to neural networks, and on a broad spectrum of applications, including tuning parameters and playing backgammon. ....

Holland, J. H. (1985), Properties of the bucket brigade algorithm, in `Proceedings, International Conference on genetic Algorithms and Their Applications', Pittsburgh, PA, pp. 1--7.


Automatic Acquisition of Go Knowledge from Game Records.. - Kojima (1998)   (1 citation)  (Correct)

....the same length of strings, and the acquired rules are varied in the number of predicates in the IF part. Moreover, CS tends to acquire a huge number of too specific and rarely used rules as discussed in Section 4.4.2. On the other hand, CS considers rules of chains by the bucket brigade algorithm [20], while the proposed algorithm does not. In summary, CS is appropriate for gaining long rule chains with fixed knowledge representations, while the proposed algorithm for gaining a set of complex rules, which are di#cult to describe by fixed knowledge representation. Summary The di#erent features ....

John H. Holland. Properties of the bucket brigade algorithm. In John J. Grefenstette, editor, Proceedings of an International Conference on Genetic Algorithms and Their Applications, pages 1--7. Lawrence Erlbaum Associates, 1985.


Learning Organizational Roles in a Heterogeneous Multi-agent.. - Nagendra Prasad (1996)   (8 citations)  (Correct)

....In L TEAM, the concept of potential leads to different organizations and better quality results and is not a just a speedup device. A related work using classifier systems for learning suitable multi agent organizations is presented in Weiss(Weiss 1994) Multiple agents use a variant of Holland s(Holland 1985) bucket brigade algorithm to learn appropriate instantiations of hierarchical organizations. Though Weiss(Weiss 1994) studies this system in the blocks world domain, it could represent an interesting alternative to the learning mechanism we proposed in this paper for learning organizational ....

Holland, J. H. 1985. Properties of bucket brigade algorithm. In First International Conference on Genetic Algorithms and their Applications, 1--7.


Learning Model for Organizational Learning in Coexistent.. - Takadama, Nomura, al.   (Correct)

....only the local functions make diverse roles as adaptive behaviors in the groups, and effective emergent behaviors occur in the diverse roles. 3. 2 Organizational Learning Oriented Classifier System In order to employ organizational learning for swarm robots, we adopt the Classifier System (CS) Holland 85] which is a rule based machine learning system. This is because robots with CS can learn the order of behaviors for cooperation and can self organize their own roles as a sequence of actions through local interactions. As shown in Fig. 1, each robot has a CS. In CS, each if then rule called a ....

Holland J. H.: "Properties of the Bucket Brigade Algorithm", The 1st International Conference on Genetic Algorithms (ICGA), pp. 1-7, 1985.


Sistemas Clasificadores de Aprendizaje. Aproximaciones Difusas - Herrera-Viedma (1995)   (Correct)

....Michigan [24] donde cada miembro de la poblaci on representa una regla de producci on individual, y por tanto, una poblaci on es un conjunto de reglas. Cada regla tiene asociado un fitness calculado en base a un algoritmo de asignaci on de cr edito (por ejemplo el algoritmo del bucket brigade [25]) el cual se usa cuando se aplica el AG y cuando surgen conflictos entre las reglas. En la aproximaci on Pitt el AG eval ua impl icitamente reglas individuales, mientras en la aproximaci on Michigan lo hace expl icitamente de acuerdo a un algoritmo. La aproximaci on Pitt conlleva un mayor gasto ....

....tarea del sistema de AC incluye la actividad del aprendizaje por modificaci on y ajuste de los pesos de los clasificadores. Tradicionalmente se vienen usando dos diferentes esquemas para controlar la acci on del sistema de AC: 1. El Algoritmo del Bucket Brigade (ABB) Usado por ejemplo en [1, 15, 25], es un esquema de aprendizaje local que precisa pocos requerimientos computacionales, tanto de memoria como de tiempo de CPU. En un SC con este esquema, el ABB se enlaza con el sistema de RC, formando el mecanismo que conduce la competici on entre los clasificadores del SC, conocido con el nombre ....

Holland, J.H., Properties of the Bucket Brigade Algorithm. Proceedings of the First International Conference on Genetic Algoritms and Their Applications, Pittsburgh, PA: Erlbaum 1985, 1-7.


Biological Models Of Security For Virus Propagation In Computer .. - Goel, Bush (2004)   (Correct)

No context found.

Holland, J.H., 1985. "Properties of the Bucket Brigade Algorithm," Proceedings of the 1st international Conference on Genetic Algorithms and Their Applications, ed. Grefenstette, J.J., L.E. Associates, pp. 1--7.


Where Genetic Algorithms Excel - Baum, Boneh, Garrett (1995)   (4 citations)  (Correct)

No context found.

J. H. Holland: "Properties of the bucket brigade algorithm". In Proceedings of the first international conference on genetic algorithms and their applications, pp 1-7, Lawrence Erlbaum Associates (1985).


GA-MINER: Parallel Data Mining with Hierarchical Genetic.. - Flockhart (1995)   (8 citations)  (Correct)

No context found.

John H. Holland, 1985. Properties of the bucket brigade algorithm. In John J. Grefenstette, editor, Proceedings of an International Conference on Genetic Algorithms. Lawrence Erlbaum Associates (Hillsdale).

First 50 documents

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC