MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Direct and indirect algorithms for on-line learning of disjunctions (1999) [8 citations — 5 self]

Download:
Download as a PDF | Download as a PS
by D. P. Helmbold, S. Panizza, M. K. Warmuth
Theoretical Computer Science
http://www.cse.ucsc.edu/~manfred/pubs/disjbayes-revjournal.ps
Add To MetaCart

Abstract:

Abstract. It is easy to design on-line learning algorithms for learning k out of n variable monotone disjunctions by simply keeping one weight per disjunction. Such algorithms use roughly O(n k) weights which can be prohibitively expensive. Surprisingly, algorithms like Winnow require only n weights (one per variable or attribute) and the mistake bound of these algorithms is not too much worse than the mistake bound of the more costly algorithms. The purpose of this paper is to investigate how the exponentially many weights can be collapsed into only O(n) weights. In particular, we consider probabilistic assumptions that enable the Bayes optimal algorithm's posterior over the disjunctions to be encoded with only O(n) weights. This results in a new O(n) algorithm for learning disjunctions which is related to the Bylander's BEG algorithm originally introduced for linear regression. Beside providing a Bayesian interpretation for this new algorithm, we are also able to obtain mistake bounds for the noise free case resembling those that have been derived for the Winnow algorithm. The same techniques used to derive this new algorithm also provide a Bayesian interpretation for a normalized version of Winnow.

Citations

1 Tracking the best disjunction. Journal of Machine Learning, Special issue on concept drift – Auer, Warmuth - 1998
1 Hinge Loss and Average Margin. Advances in Neural Information Processing Systems – Wiley - 1973
1 204-210 [LM99] Littlestone, N., Mesterharm, C.: A Simulation Study of Winnow and Related Learning Algorithms. Unpublished manuscript – Littlestone, Warmuth - 1997