DMCA
General principles of learning-based multi-agent systems (1999)
Cached
Download Links
- [web.engr.oregonstate.edu]
- [web.engr.oregonstate.edu]
- DBLP
Other Repositories/Bibliography
Venue: | In Proceedings of the Third International Conference of Autonomous Agents |
Citations: | 38 - 7 self |
Citations
4231 | Game theory - Fudenberg, Tirole - 1991 |
2518 | The tragedy of the commons
- Hardin
- 1968
(Show Context)
Citation Context ... global utility it is necessary to avoid having the agents work at cross-purposes lest phenomena like the Tragedy of the Commons (TOC) occur, in which individual avarice works to lower global utility =-=[12]-=-. One way to avoid such phenomena is by modifying the agents' utility functions via punitive legislation. A real world example of an attempt to make such a modification was the cration of anti-trust r... |
1153 | The Theory of Learning in Games - Fudenberg, Levine - 1998 |
377 | The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems,
- Claus, Boutilier
- 1998
(Show Context)
Citation Context ...tistical mechanics; ffl computational ecologies; ffl game theory , in particular, evolutionary game theory. Previous MAS's most similar to a COIN include those where agents use reinforcement learning =-=[8, 13]-=-, and/or where agents actively attempt to model the behavior of other agents [14]. 1 In this paper we introduce some of the concepts from the COIN framework, and then present experiments testing those... |
331 | Multiagent Reinforcement Learning: Theoretical Framework and an Algorithm
- Hu, Wellman
- 1998
(Show Context)
Citation Context ...tistical mechanics; ffl computational ecologies; ffl game theory , in particular, evolutionary game theory. Previous MAS's most similar to a COIN include those where agents use reinforcement learning =-=[8, 13]-=-, and/or where agents actively attempt to model the behavior of other agents [14]. 1 In this paper we introduce some of the concepts from the COIN framework, and then present experiments testing those... |
323 | Improving elevator performance using reinforcement learning
- Crites, Barto
- 1996
(Show Context)
Citation Context ...convergence using the WL reward to be far quicker than that using the GR reward. The GR reward does eventually converge to the global optimum. This is in agreement with the results obtained by Crites =-=[9]-=- for the bank of elevators control problem. However, when ff = [0 0 0 7 0 0 0] the GR reward converged in 1250 weeks. This is more than 4 times the convergence time for the WL reward. When ff = [1 1 1... |
272 | Multiagent systems - Sycara - 1998 |
270 | Coalition structure generation with worst case guarantees - Sandholm, Larson, et al. - 1999 |
185 |
Emergence of cooperation and organization in an evolutionary game
- Challet, Zhang
- 1997
(Show Context)
Citation Context ...ize each agents' local reward function) to ensure that the agents do not work at cross purposes. The problem we chose for this purpose is a more challenging variant of Arthur's bar attendance problem =-=[1, 7, 17]-=-. In this problem, agents have to determine which night in the week to attend a bar. The problem is set up so that if either too few people attend (boring evening) or too many people attend (crowded e... |
154 | A Roadmap of Agent Research and Development. Autonomous Agents and Multi-Agent Systems - Jennings, Sycara, et al. - 1998 |
139 | An introduction to collective intelligence
- Wolpert, Tumer
- 1999
(Show Context)
Citation Context ... the entire system; global performance is "robust"; one can scale up to very large systems; and one can maximally exploit the power of machine learning. We use the term COllective INtelligen=-=ce (COIN) [21, 22, 23]-=- to refer to either MAS's designed in this way, or (in the case of naturally occurring MAS's) to MAS's investigated from this perspective. The COIN framework is related to many other fields. (See [21]... |
109 | The behavior of computational ecologies - Huberman, Hogg - 1988 |
81 | Online learning about other agents in a dynamic multiagent system
- Hu, Wellman
- 1998
(Show Context)
Citation Context ...olutionary game theory. Previous MAS's most similar to a COIN include those where agents use reinforcement learning [8, 13], and/or where agents actively attempt to model the behavior of other agents =-=[14]-=-. 1 In this paper we introduce some of the concepts from the COIN framework, and then present experiments testing those concepts. The restricted version of the framework presented here is not sufficie... |
65 | Using collective intelligence to route Internet traffic
- Wolpert, Tumer, et al.
- 1999
(Show Context)
Citation Context ... the entire system; global performance is "robust"; one can scale up to very large systems; and one can maximally exploit the power of machine learning. We use the term COllective INtelligen=-=ce (COIN) [21, 22, 23]-=- to refer to either MAS's designed in this way, or (in the case of naturally occurring MAS's) to MAS's investigated from this perspective. The COIN framework is related to many other fields. (See [21]... |
39 |
Complexity in economic theory: Inductive reasoning and bounded rationality. The American Economic Review 84(2):406–411
- Arthur
- 1994
(Show Context)
Citation Context ...estigate the real-world applicability of the core concepts of that framework via two computer experiments: we show that our COINs perform near optimally in a difficult variant of Arthur's bar problem =-=[1]-=- (and in particular avoid the tragedy of the commons for that problem), and we also illustrate optimal performance for our COINs in the leader-follower problem. 1 INTRODUCTION In this paper we are int... |
35 | Economic principles of multi-agent systems, - Boutilier, Shoham, et al. - 1997 |
30 | Volatility and agent adaptability in a self-organized market
- JOHNSON, JARVIS, et al.
- 1998
(Show Context)
Citation Context ...ize each agents' local reward function) to ensure that the agents do not work at cross purposes. The problem we chose for this purpose is a more challenging variant of Arthur's bar attendance problem =-=[1, 7, 17]-=-. In this problem, agents have to determine which night in the week to attend a bar. The problem is set up so that if either too few people attend (boring evening) or too many people attend (crowded e... |
26 | A prototype model of stock exchange - Caldarelli, Marsili, et al. - 1997 |
19 | Manifesto for an Evolutionary Economics of Intelligence" in "Neural Networks and Machine Learning" Editor - Baum - 1998 |
9 | Matrix games, mixed strategies, and statistical mechanics - Berg, Engel - 1998 |
6 | The behavior of computational ecologies - Hubermann, Hogg - 1988 |
3 |
Collective intelligence for distributed control
- Wolpert, Wheeler, et al.
- 1999
(Show Context)
Citation Context ... the entire system; global performance is "robust"; one can scale up to very large systems; and one can maximally exploit the power of machine learning. We use the term COllective INtelligen=-=ce (COIN) [21, 22, 23]-=- to refer to either MAS's designed in this way, or (in the case of naturally occurring MAS's) to MAS's investigated from this perspective. The COIN framework is related to many other fields. (See [21]... |