|
50
|
Adaptive and Self-Confident On-Line Learning Algorithms
– Peter Auer, Nicolò Cesa-Bianchi, Claudio Gentile
- 2000
|
|
|
Lectures on Prediction of Individual Sequences
– Gábor Lugosi
- 2001
|
|
39
|
Competitive on-line statistics
– Volodya Vovk
- 1999
|
|
24
|
Regret minimization under partial monitoring
– Nicolò Cesa-Bianchi, Gábor Lugosi, Gilles Stoltz
- 2004
|
|
204
|
The nonstochastic multiarmed bandit problem
– Peter Auer, Nicolò Cesa-bianchi, Yoav Freund, Robert E. Schapire
- 2002
|
|
157
|
Tracking the best expert
– Mark Herbster, Manfred, K. Warmuth, Gerhard Widmer, Miroslav Kubat
- 1995
|
|
40
|
From external to internal regret
– Avrim Blum, Yishay Mansour
- 2005
|
|
98
|
Regret in the On-line Decision Problem
– Dean P. Foster, Rakesh Vohra
- 1999
|
|
106
|
Adaptive Game Playing Using Multiplicative Weights
– Yoav Freund, Robert E. Schapire
|
|
11
|
On Bayesian bounds
– Arindam Banerjee
- 2006
|
|
|
Decision Making in Uncertain and Changing Environments
– Karl H. Schlag, et al.
- 2009
|
|
125
|
Online Convex Programming and Generalized Infinitesimal Gradient Ascent
– Martin Zinkevich
- 2003
|
|
73
|
General convergence results for linear discriminant updates
– Adam J. Grove, Nick Littlestone, Dale Schuurmans
- 1997
|
|
|
Regret int the On-line Decision Problem
– Dean P. Foster, Rakesh Vohra
- 1999
|
|
|
Reinforcement Learning Without Rewards
– Umar Ali Syed
- 2010
|
|
1
|
Learning, Regret minimization, and Equilibria
– Blum And Mansour, A. Blum, Y. Mansour
- 2007
|
|
28
|
Analysis of two gradient-based algorithms for on-line regression
– Nicolo Cesa-bianchi
- 1999
|
|
13
|
Regret bounds for sleeping experts and bandits
– Robert D. Kleinberg, Alexandru Niculescu-mizil, Yogeshwer Sharma
- 2008
|
|
99
|
Universal Prediction
– Neri Merhav, Meir Feder
- 1998
|