See this document in CiteSeerX!

Pseudo-convergent Q-Learning by Competitive Pricebots (2000)  (Make Corrections)  (1 citation)
Jeffrey O. Kephart, Gerald J. Tesauro
Proc. 17th International Conf. on Machine Learning



  Home/Search   Context   Related

 
View or download:
ibm.com/infoecon/paps/simulq.ps
ibm.com/infoecon/paps/simulq.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  ibm.com/infoecon...researchpapers (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: We study novel aspects of multi-agent Qlearning in a model market in which two identical, competing "pricebots" strategically price a commodity. Two fundamentally different solutions are observed: an exact, stationary solution with zero Bellman error consisting of symmetric policies, and a non-stationary, broken-symmetry pseudosolution, with small but non-zero Bellman error. This "pseudo-convergent" asymmetric solution has no analog in ordinary Qlearning. We calculate analytically... (Update)

Similar documents based on text:   More   All
0.8:   To appear n Proceedings of First International Conference on.. - Char Es On   (Correct)
0.3:   Dynamic Pricing with Limited Competitor Information in a.. - Dasgupta, Das (1901)   (Correct)
0.3:   Multi-agent Q-learning and regression trees for automated.. - Sridharan, Tesauro   (Correct)

BibTeX entry:   (Update)

J.O. Kephart and G.J. Tesauro, "Pseudo-Convergent Q-Learning by Competitive Pricebots," Proc. 17th Int'l Conf. Machine Learning, Morgan Kaufmann, 2000, pp. 463-470. http://citeseer.ist.psu.edu/kephart00pseudoconvergent.html   More

@inproceedings{ kephart00pseudoconvergent,
    author = "Jeffrey O. Kephart and Gerald J. Tesauro",
    title = "Pseudo-convergent {Q}-Learning by Competitive Pricebots",
    booktitle = "Proc. 17th International Conf. on Machine Learning",
    publisher = "Morgan Kaufmann, San Francisco, CA",
    pages = "463--470",
    year = "2000",
    url = "citeseer.ist.psu.edu/kephart00pseudoconvergent.html" }
Citations (may not include all citations):
51   Shopbots and pricebots - Greenwald, Kephart - 1999
1   Workshop on Adaptation and Learning in Multiagent Systems (context) - Sandholm, Crites et al. - 1995

Documents on the same site (http://www.research.ibm.com/infoecon/researchpapers.html):   More
Foresight-Based Pricing Algorithms in an Economy of Software.. - Tesauro, Kephart (1998)   (Correct)
Price Dynamics of Vertically Differentiated Information Markets - Sairamesh, Kephart (1998)   (Correct)
Shopbot Economics - Kephart, Greenwald (1999)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC