(Enter summary)
Abstract: We give an adversary strategy that forces the Perceptron algorithm to make \Omega\Gamma kN ) mistakes in learning monotone disjunctions over N variables with at most k literals. In contrast, Littlestone's algorithm Winnow makes at most O(k log N ) mistakes for the same problem. Both algorithms use thresholded linear functions as their hypotheses. However, Winnow does multiplicative updates to its weight vector instead of the additive updates of the Perceptron algorithm. The Perceptron algorithm ... (Update)
Similar documents (at the sentence level):
42.8%: The Perceptron algorithm vs. Winnow: linear vs. logarithmic.. - Kivinen, Warmuth (1995)
(Correct)
10.3%: The Curse of Dimensionality and the Perceptron Algorithm - Kivinen, Warmuth (1995)
(Correct)
Active bibliography (related documents): More All
0.1: Rigorous Learning Curve Bounds from Statistical Mechanics - Haussler, Kearns, Seung.. (1996)
(Correct)
0.1: On PAC Learning Using Winnow, Perceptron, and a Perceptron-Like.. - Servedio
(Correct)
0.1: PAC Analogues of Perceptron and Winnow via Boosting the Margin - Servedio (2000)
(Correct)
Similar documents based on text: More All
0.4: On Bayes Methods for On-line Boolean Prediction - Cesa-Bianchi, Helmbold, Panizza (1997)
(Correct)
0.4: Report for Publication of the Activity of the Working Group.. - Shawe-Taylor (1997)
(Correct)
0.3: Entropy Estimation - Bercher, Vignat (1996)
(Correct)
BibTeX entry: (Update)
@techreport{ kivinen95perceptron,
author = "Jyrki Kivinen and Manfred Warmuth",
title = "{THE} {PERCEPTRON} {ALGORITHM} {VS} {WINNOW}: {LINEAR} {VS} {LOGARITHMIC} {MISTAKE} {BOUNDS} {WHEN} {FEW} {INPUT} {VARIABLES} {ARE} {RELEVANT}",
number = "UCSC-CRL-95-44",
year = "1995",
url = "citeseer.ist.psu.edu/kivinen97perceptron.html" }
Citations (may not include all citations):
2133
Pattern Classification and Scene Analysis (context) - Duda, Hart - 1973
537
A theory of the learnable (context) - Valiant - 1984
465
Learnability and the Vapnik-Chervonenkis dimension (context) - Blumer, Ehrenfeucht et al. - 1989
454
the uniform convergence of relative frequencies of events to.. (context) - Vapnik, Chervonenkis - 1971
317
Learning quickly when irrelevant attributes abound: A new li.. (context) - Littlestone - 1988
74
Mistake Bounds and Logarithmic Linear-threshold Learning Alg.. (context) - Littlestone - 1989
67
A new algorithm for minimizing convex functions over convex .. (context) - Vaidya - 1989
64
and linear threshold learning using Winnow (context) - Littlestone, attributes et al. - 1991
40
The statistical mechanics of learning a rule (context) - Watkin, Rau et al. - 1993
32
line learning of linear functions
- Littlestone, Long et al. - 1995
29
Tracking the best disjunction
- Auer, Warmuth - 1995
26
How fast can a threshold gate learn (context) - Maass, Tur'an - 1994
15
Comparing several linear-threshold learning algorithms on ta.. (context) - Littlestone - 1995
14
English translation in Soviet Mathematics Doklady (context) - Khachiyan, algorithm et al. - 1979
10
Learning curves in large neural networks (context) - Sompolinsky, Seung et al. - 1991
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC