Potential-based Algorithms in On-line Prediction and Game Theory

Cached

Download Links

by Nicolo Cesa-Bianchi , Gabor Lugosi
Citations:24 - 3 self

Active Bibliography

50 Adaptive and Self-Confident On-Line Learning Algorithms – Peter Auer, Nicolò Cesa-Bianchi, Claudio Gentile - 2000
Lectures on Prediction of Individual Sequences – Gábor Lugosi - 2001
39 Competitive on-line statistics – Volodya Vovk - 1999
24 Regret minimization under partial monitoring – Nicolò Cesa-Bianchi, Gábor Lugosi, Gilles Stoltz - 2004
204 The nonstochastic multiarmed bandit problem – Peter Auer, Nicolò Cesa-bianchi, Yoav Freund, Robert E. Schapire - 2002
157 Tracking the best expert – Mark Herbster, Manfred, K. Warmuth, Gerhard Widmer, Miroslav Kubat - 1995
40 From external to internal regret – Avrim Blum, Yishay Mansour - 2005
98 Regret in the On-line Decision Problem – Dean P. Foster, Rakesh Vohra - 1999
106 Adaptive Game Playing Using Multiplicative Weights – Yoav Freund, Robert E. Schapire
11 On Bayesian bounds – Arindam Banerjee - 2006
Decision Making in Uncertain and Changing Environments – Karl H. Schlag, et al. - 2009
125 Online Convex Programming and Generalized Infinitesimal Gradient Ascent – Martin Zinkevich - 2003
73 General convergence results for linear discriminant updates – Adam J. Grove, Nick Littlestone, Dale Schuurmans - 1997
Regret int the On-line Decision Problem – Dean P. Foster, Rakesh Vohra - 1999
Reinforcement Learning Without Rewards – Umar Ali Syed - 2010
1 Learning, Regret minimization, and Equilibria – Blum And Mansour, A. Blum, Y. Mansour - 2007
28 Analysis of two gradient-based algorithms for on-line regression – Nicolo Cesa-bianchi - 1999
13 Regret bounds for sleeping experts and bandits – Robert D. Kleinberg, Alexandru Niculescu-mizil, Yogeshwer Sharma - 2008
99 Universal Prediction – Neri Merhav, Meir Feder - 1998