• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

An introduction to collective intelligence (1999)

by D H Wolpert, K Tumer
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 57
Next 10 →

Using Collective Intelligence To Route Internet Traffic

by David H. Wolpert, Kagan Tumer, Jeremy Frank - IN ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS , 1999
"... A COllective INtelligence (COIN) is a set of interacting reinforcement learning (RL) algorithms designed in an automated fashion so that their collective behavior optimizes a global utility function. We summarize the theory of COINs, then present experiments using that theory to design COINs to cont ..."
Abstract - Cited by 50 (20 self) - Add to MetaCart
A COllective INtelligence (COIN) is a set of interacting reinforcement learning (RL) algorithms designed in an automated fashion so that their collective behavior optimizes a global utility function. We summarize the theory of COINs, then present experiments using that theory to design COINs to control internet traffic routing. These experiments indicate that COINs outperform all previously investigated RL-based, shortest path routing algorithms.

Collective Intelligence and Braess’ Paradox

by Kagan Tumer, David Wolpert - IN: PROCEEDINGS OF THE SIXTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE , 2000
"... We consider the use of multi-agent systems to control network routing. Conventional approaches to this task are based on Ideal Shortest Path routing Algorithm (ISPA), under which at each moment each agent in the network sends all of its traffic down the path that will incur the lowest cost to that t ..."
Abstract - Cited by 44 (17 self) - Add to MetaCart
We consider the use of multi-agent systems to control network routing. Conventional approaches to this task are based on Ideal Shortest Path routing Algorithm (ISPA), under which at each moment each agent in the network sends all of its traffic down the path that will incur the lowest cost to that traffic. We demonstrate in computer experiments that due to the side-effects of one agent's actions on another agent's traffic, use of ISPA's can result in large global cost. In particular, in a simulation of Braess' paradox we see that adding new capacity to a network with ISPA agents can decrease overall throughput. The theory of COllective INtelligence (COIN) design concerns precisely the issue of avoiding such side-effects. We use that theory to derive an idealized routing algorithm and show that a practical machine-learning-based version of this algorithm, in which costs are only imprecisely estimated substantially outperforms the ISPA, despite having access to less information than does the ISPA. In particular, this practical COIN algorithm avoids Braess' paradox.

General principles of learning-based multi-agent systems

by David H. Wolpert - In Proceedings of the Third International Conference of Autonomous Agents , 1999
"... We consider the problem of how to design large decentralized multiagent systems (MAS’s) in an automated fashion, with little or no hand-tuning. Our approach has each agent run a reinforcement learning algorithm. This converts the problem into one of how to automatically set/update the reward functio ..."
Abstract - Cited by 32 (5 self) - Add to MetaCart
We consider the problem of how to design large decentralized multiagent systems (MAS’s) in an automated fashion, with little or no hand-tuning. Our approach has each agent run a reinforcement learning algorithm. This converts the problem into one of how to automatically set/update the reward functions for each of the agents so that the global goal is achieved. In particular we do not want the agents to “work at cross-purposes ” as far as the global goal is concerned. We use the term artificial COllective INtelligence (COIN) to refer to systems that embody solutions to this problem. In this paper we present a summary of a mathematical framework for COINs. We then investigate the real-world applicability of the core concepts of that framework via two computer experiments: we show that our COINs perform near optimally in a difficult variant of Arthur’s bar problem [1] (and in particular avoid the tragedy of the commons for that problem), and we also illustrate optimal performance for our COINs in the leader-follower problem. 1

Analysis of dynamic task allocation in multi-robot systems

by Kristina Lerman, Chris Jones, Aram Galstyan, Maja J - International Journal of Robotics Research , 2006
"... Dynamic task allocation is an essential requirement for multi-robot systems functioning in unknown dynamic environments. It allows robots to change their behavior in response to environmental changes or actions of other robots in order to improve overall system performance. Emergent coordination alg ..."
Abstract - Cited by 31 (2 self) - Add to MetaCart
Dynamic task allocation is an essential requirement for multi-robot systems functioning in unknown dynamic environments. It allows robots to change their behavior in response to environmental changes or actions of other robots in order to improve overall system performance. Emergent coordination algorithms for task allocation that use only local sensing and no direct communication between robots are attractive because they are robust and scalable. However, a lack of formal analysis tools makes emergent coordination algorithms difficult to design. In this paper we present a mathematical model of a general dynamic task allocation mechanism. Robots using this mechanism have to choose between two types of task, and the goal is to achieve a desired task division in the absence of explicit communication and global knowledge. Robots estimate the state of the environment from repeated local observations and decide which task to choose based on these observations. We model the robots and observations as stochastic processes and study the dynamics of individual robots and the collective behavior. We analyze the effect that the number of observations and the choice of decision functions have on the performance of the system. We validate the mathematical models on a multi-foraging scenario in a multi-robot system. We find that the model’s predictions agree very closely with experimental results from sensor-based simulations. 1

Learning Sequences of Actions in Collectives of Autonomous Agents

by Kagan Tumer, Adrian K. Agogino, David H. Wolpert - In Proceedings of the First International Joint Conference on Autonomous Agents and Multi-Agent Systems , 2002
"... In this paper we focus on the problem of designing a collective of autonomous agents that individually learn sequences of actions such that the resultant sequence of joint actions achieves a predetermined global objective. We are particularly interested in instances of this problem where centralized ..."
Abstract - Cited by 29 (17 self) - Add to MetaCart
In this paper we focus on the problem of designing a collective of autonomous agents that individually learn sequences of actions such that the resultant sequence of joint actions achieves a predetermined global objective. We are particularly interested in instances of this problem where centralized control is either impossible or impractical. For single agent systems in similar domains, machine learning methods (e.g., reinforcement learners [18]) have been successfully used [1, 2, 3, 31]. However, applying such solutions directly to multi-agent systems often proves problematic, as agents may work at cross-purposes, or have difficulty in evaluating their contribution to achievement of the global objective, or both. Accordingly, the crucial design step in multiagent systems centers on determining the private objectives of each agent so that as the agents strive for those objectives, the system reaches a good global solution. In this work we consider a version of this problem involving multiple autonomous agents in a grid world. We use concepts from collective intelligence [19, 27, 30] to design goals for the agents that are "aligned" with the global goal, and are "learnable" in that agents can readily see how their behavior affects their utility. We show that reinforcement learning agents using those goals outperform both "natural" extensions of single agent algorithms and global reinforcement learning solutions based on "team games".

Ant Colony Optimization and its Application to Adaptive Routing in Telecommunication Networks

by Gianni Di Caro , 2004
"... In ant societies, and, more in general, in insect societies, the activities of the individuals, as well asofthesocietyasawhole,arenotregulatedbyanyexplicit formofcentralizedcontrol. Onthe other hand, adaptive and robust behaviors transcending the behavioral repertoire of the single individualcanbeea ..."
Abstract - Cited by 23 (13 self) - Add to MetaCart
In ant societies, and, more in general, in insect societies, the activities of the individuals, as well asofthesocietyasawhole,arenotregulatedbyanyexplicit formofcentralizedcontrol. Onthe other hand, adaptive and robust behaviors transcending the behavioral repertoire of the single individualcanbeeasilyobserved at society level. Thesecomplexglobalbehaviorsaretheresult of self-organizing dynamics driven by local interactions and communications among a number of relatively simple individuals. The simultaneous presence of these and other fascinating and unique characteristics have made ant societies an attractive and inspiring model for building newalgorithmsandnewmulti-agentsystems. Inthelastdecade,antsocietieshavebeentakenasa referenceforanevergrowingbodyof scientific work, mostly in the fields of robotics, operations research, and telecommunications. Among the different works inspired by ant colonies, the Ant Colony Optimization metaheuristic (ACO) is probably the most successful and popular one. The ACO metaheuristic is a multi-agent framework for combinatorial optimization whose main components are: a set of ant-like agents, the use of memory and of stochastic decisions, and strategies of collective and distributed learning. It finds its roots

Multiagent resource allocation with k-additive utility functions

by Yann Chevaleyre, Ulle Endriss, Sylvia Estivie, Nicolas Maudet - In Proc. DIMACS-LAMSADE Workshop on Computer Science and Decision Theory, Annales du LAMSADE , 2004
"... We briefly review previous work on the welfare engineering framework where autonomous software agents negotiate on the allocation of a number of discrete resources, and point out connections to combinatorial optimisation problems, including combinatorial auctions, that shed light on the computationa ..."
Abstract - Cited by 17 (9 self) - Add to MetaCart
We briefly review previous work on the welfare engineering framework where autonomous software agents negotiate on the allocation of a number of discrete resources, and point out connections to combinatorial optimisation problems, including combinatorial auctions, that shed light on the computational complexity of the framework. We give particular consideration to scenarios where the preferences of agents are modelled in terms of k-additive utility functions, i.e. scenarios where synergies between different resources are restricted to bundles of at most k items. Key words: negotiation, representation of utility functions, social welfare, combinatorial optimisation, bidding languages for combinatorial auctions 1

Information theory - the bridge connecting bounded rational game theory and statistical physics

by David H. Wolpert - Statistical Physics , 2004
"... A long-running difficulty with conventional game theory has been how to modify it to accommodate the bounded rationality of all real-world players. A recurring issue in statistical physics is how best to approximate joint probability distributions with decoupled (and therefore far more tractable) di ..."
Abstract - Cited by 16 (10 self) - Add to MetaCart
A long-running difficulty with conventional game theory has been how to modify it to accommodate the bounded rationality of all real-world players. A recurring issue in statistical physics is how best to approximate joint probability distributions with decoupled (and therefore far more tractable) distributions. This paper shows that the same information theoretic mathematical structure, known as Product Distribution (PD) theory, addresses both issues. In this, PD theory not only provides a principled formulation of bounded rationality and a set of new types of mean field theory in statistical physics; it also shows that those topics are fundamentally one and the same. 1

All Learning is Local: Multi-agent learning in global reward games

by Yu-han Chang, Tracey Ho, Leslie Pack Kaelbling
"... In large multiagent games, partial observability, coordination, and credit assignment persistently plague attempts to design good learning algorithms. ..."
Abstract - Cited by 13 (3 self) - Add to MetaCart
In large multiagent games, partial observability, coordination, and credit assignment persistently plague attempts to design good learning algorithms.

Fleet assignment using collective intelligence

by Nicolas E. Antoine, Stefan R. Bieniawski, Ilan M. Kroo, David H. Wolpert - In Proceedings of 42nd Aerospace Sciences Meeting , 2004
"... Airline fleet assignment involves the allocation of aircraft to a set of flights legs in order to meet passenger demand, while satisfying a variety of constraints. Over the course of the day, the routing of each aircraft is determined in order to minimize the number of required flights for a given f ..."
Abstract - Cited by 13 (11 self) - Add to MetaCart
Airline fleet assignment involves the allocation of aircraft to a set of flights legs in order to meet passenger demand, while satisfying a variety of constraints. Over the course of the day, the routing of each aircraft is determined in order to minimize the number of required flights for a given fleet. The associated flow continuity and aircraft count constraints have led researchers to focus on obtaining quasi-optimal solutions, especially at larger scales. In this paper, the authors propose the application of an agent-based integer optimization algorithm to a “cold start ” fleet assignment problem. Results show that the optimizer can successfully solve such highlyconstrained problems (129 variables, 184 constraints).
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University