• Documents
  • Authors
  • Tables
  • Other Seers ▼
    RefSeer AckSeer CollabSeer SeerSeer
  • Log in
  • Sign up
  • MetaCart

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations | Disambiguate

Combinatorial Bandits

Cached

  • Download as a PDF

Download Links

  • [www.econ.upf.es]
  • [www.econ.upf.es]
  • [www.cs.mcgill.ca]
  • [www.econ.upf.es]

  • Save to List
  • Add to Collection
  • Correct Errors
  • Monitor Changes
by Nicolò Cesa-bianchi , Gábor Lugosi
Citations:11 - 4 self
  • Summary
  • Active Bibliography
  • Co-citation
  • Clustered Documents
  • Version History

BibTeX

@MISC{Cesa-bianchi_combinatorialbandits,
    author = {Nicolò Cesa-bianchi and Gábor Lugosi},
    title = {Combinatorial Bandits},
    year = {}
}

Bookmark

citeulike Connotea Bibsonomy Del.icio.us Digg Reddit

OpenURL

 

Abstract

We study sequential prediction problems in which, at each time instance, the forecaster chooses a binary vector from a certain fixed set S ⊆ {0, 1} d and suffers a loss that is the sum of the losses of those vector components that equal to one. The goal of the forecaster is to achieve that, in the long run, the accumulated loss is not much larger than that of the best possible vector in the class. We consider the “bandit ” setting in which the forecaster has only access to the losses of the chosen vectors. We introduce a new general forecaster achieving a regret bound that, for a variety of concrete choices of S, is of order √ nd ln |S | where n is the time horizon. This is not improvable in general and is better than previously known bounds. We also point out that computationally efficient implementations for various interesting choices of S exist. 1

Citations

2134 Convex Optimization - Boyd, Vandenberghe - 2004
258 J.L.: Random walks and electric networks - Doyle, Snell - 1984
253 A PolynomialTime Approximation Algorithm for the Permanent of a Matrix with Nonnegative Entries - Jerrum, Sinclair, et al.
204 The nonstochastic multiarmed bandit problem - Auer, Cesa-Bianchi, et al.
102 Efficient algorithms for online decision problems - Kalai, Vempala
80 How to get a perfectly random sample from a generic Markov chain and generate a random spanning tree of a directed graph - Propp, Wilson - 1998
67 Local characteristics, entropy and limit theorems for uniform spanning trees and domino tilings via transfer-impedances. Annals of Probability 21 - Burton, Pemantle - 1993
63 Adaptive routing with end-to-end feedback: distributed learning and geometric approaches - Awerbuch, Kleinberg - 2004
62 Probability on trees and networks - Lyons
52 Online geometric optimization in the bandit setting against an adaptive adversary - McMahan, Blum - 2004
51 Path kernels and multiplicative updates - Takimoto, Warmuth - 2003
39 Robbing the bandit: less regret in online geometric optimization against an adaptive adversary - Dani, Hayes - 2006
36 Competing in the dark: An efficient algorithm for bandit linear optimization - Abernethy, Hazan, et al. - 2008
26 The price of bandit information for online optimization - Dani, Hayes, et al. - 2008
21 Learning permutations with exponential weights - Helmbold, Warmuth - 2009
13 Minimax policies for adversarial and stochastic bandits - Audibert, Bubeck - 2009
12 Random walks and electric networks, volume 22 of Carus Mathematical Monographs - Doyle, Snell - 1984
11 High-probability regret bounds for bandit online linear optimization - Bartlett, Dani, et al.
10 Hedging structured concepts - Koolen, Warmuth, et al. - 2010
9 Sampling spin configurations of an ising system - Randall, Wilson - 1999
8 The on-line shortest path problem under partial monitoring - György, Linder, et al.
6 Beating the adaptive bandit with high probability - Abernethy, Rakhlin - 2009
2 The on-line shortest path problem under partial monitoring - Ottucsák
The National Science Foundation
  • About CiteSeerX
  • Submit Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2010 The Pennsylvania State University