See this document in CiteSeerX!

A Bound on the Error of Cross Validation Using the Approximation and Estimation Rates, with Consequences for the Training-Test Split (1996)  (Make Corrections)  (17 citations)
Michael Kearns
Advances in Neural Information Processing Systems



  Home/Search   Context   Related

 
View or download:
informatik.unibonn.de/III/l...cv.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  informatik.unibonn.de/III/leh... (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: : We give an analysis of the generalization error of cross validation in terms of two natural measures of the difficulty of the problem under consideration: the approximation rate (the accuracy to which the target function can be ideally approximated as a function of the number of hypothesis parameters), and the estimation rate (the deviation between the training and generalization errors as a function of the number of hypothesis parameters). The approximation rate captures the complexity of... (Update)

Context of citations to this paper:   More

...of model selection which has be widely studied and criticized. Foundational papers include [2, 3, 4] and recent contributions include [5, 6]. In this paper, our emphasis is on exhibiting a simple and general scaling law which can guide experimentalists in pattern recognition:...

...with k = n l decreasing. The optimal choice of l and k depend on the learner L, the hypothesis space H, and the learning task Pr( x; y) [Kearns, 1996]. Nevertheless, there are good heuristics for selecting reasonable values for l and k [Kearns, 1996] Let s nally also look at...

Cited by:   More
The Maximum-Margin Approach to Learning Text Classifiers -.. - Joachims (2000)   (Correct)
Model Selection for Probabilistic Clustering Using Cross-Validated .. - Smyth (1998)   (Correct)
On Optimal Data Split For Generalization Estimation And Model .. - Larsen, Goutte (1999)   (Correct)

Active bibliography (related documents):   More   All
0.2:   A Bound on the Error of Cross Validation Using the Approximation.. - Kearns (1996)   (Correct)
0.2:   An Experimental and Theoretical Comparison of Model.. - Kearns, Mansour, Ng, Ron   (Correct)
0.1:   Algorithmic Stability and Sanity-Check Bounds for Leave-One-Out .. - Kearns, Ron (1997)   (Correct)

Similar documents based on text:   More   All
0.0:   Preventing "Overfitting" of Cross-Validation Data - Ng (1997)   (Correct)
0.0:   Toward Efficient Agnostic Learning - Kearns (1994)   (Correct)
0.0:   Animal Improvement Programs Laboratory, Agricultural.. - Us Department Of   (Correct)

Related documents from co-citation:   More   All
9:   An Experimental and Theoretical Comparison of Model Selection Methods - Kearns, Mansour et al.
6:   Statistical Learning Theory (context) - Vapnik
6:   Estimation of Dependencies based on empirical data (context) - Vapnik - 1982

BibTeX entry:   (Update)

Kearns, M. J. (1996). A bound on the error of Cross Validation using the approximation and estimation rates, with consequences for the training-test split. In Advances in Neural Information Processing Systems 8, pages 183--189. Morgan Kaufmann. http://citeseer.ist.psu.edu/article/kearns96bound.html   More

@inproceedings{ kearns96bound,
    author = "Michael Kearns",
    title = "A Bound on the Error of Cross Validation Using the Approximation and Estimation Rates, with Consequences for the Training-Test Split",
    booktitle = "Advances in Neural Information Processing Systems",
    volume = "8",
    publisher = "The {MIT} Press",
    editor = "David S. Touretzky and Michael C. Mozer and Michael E. Hasselmo",
    pages = "183--189",
    year = "1996",
    url = "citeseer.ist.psu.edu/article/kearns96bound.html" }
Citations (may not include all citations):
454   the uniform convergence of relative frequencies of events to.. (context) - Vapnik, Chervonenkis - 1971
417   Stochastic Complexity in Statistical Inquiry (context) - Rissanen - 1989
348   Estimation of Dependences Based on Empirical Data (context) - Vapnik - 1982
258   Cross-validatory choice and assessment of statistical predic.. (context) - Stone - 1974
214   Universal approximation bounds for superpositions of a sigmo.. (context) - Barron - 1991
101   Minimum complexity density estimation (context) - Barron, Cover - 1991
66   An experimental and theoretical comparison of model selectio.. - Kearns, Mansour et al. - 1995
58   Statistical mechanics of learning from examples (context) - Seung, Sompolinsky et al. - 1992
23   Asymptotics for and against cross-validation (context) - Stone - 1977
8   Rigourous learning curve bounds from statistical mechanics (context) - Haussler, Kearns et al. - 1994



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.informatik.uni-bonn.de/III/lehre/seminare/Mustererkennung/WS99/):   More
Hierarchical mixtures of experts and the EM algorithm - Jordan, Jacobs (1994)   (Correct)
Selective sampling using the Query by Committee algorithm - Yoav Freund, H.. (1995)   (Correct)
Factorial Hidden Markov Models - Ghahramani, Jordan (1996)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC