See this document in CiteSeerX!

Racing Committees for Large Datasets (2002)  (Make Corrections)  (2 citations)
Eibe Frank, Geoffrey Holmes, Richard Kirkby, Mark Hall



  Home/Search   Context   Related

 
View or download:
cs.waikato.ac.nz/~...t_al_DS_2002.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  cs.waikato.ac.nz/~...Publications (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: This paper proposes a method for generating classifiers from large datasets by building a committee of simple base classifiers using a standard boosting algorithm. It permits the processing of large datasets even if the underlying base learning algorithm cannot efficiently do so. (Update)

Cited by:   More
Mining Data Streams Using Option Trees - Holmes, Kirkby, Pfahringer (2004)   (Correct)
Fast and Light Boosting for Adaptive Mining of Data Streams - Chu, Zaniolo (2004)   (Correct)

Active bibliography (related documents):   More   All
0.7:   Constructing Simpler Decision Trees from Ensemble Models.. - Fourier Analysis Byung-Hoon   (Correct)
0.4:   MobiMine: Monitoring the Stock Market from a PDA - Kargupta, Park, Pittie, Liu, .. (2001)   (Correct)
0.3:   Experimental Comparisons of Online and Batch Versions of.. - Oza, Russell   (Correct)

Similar documents based on text:   More   All
0.6:   Multiclass Alternating Decision Trees - Holmes, Pfahringer, Kirkby.. (2002)   (Correct)
0.6:   Visualizing Class Probability Estimators - Frank, Hall   (Correct)
0.4:   Interactive Machine Learning - Letting Users Build.. - Ware, Frank, Holmes.. (2000)   (Correct)

Related documents from co-citation:   More   All
2:   Mining High-Speed Data Streams - Domingos, Hulten - 2000

BibTeX entry:   (Update)

E. Frank, G. Holmes, R. Kirkby, and M. Hall. Racing committees for large datasets. In Discovery Science, 2002. http://citeseer.ist.psu.edu/frank02racing.html   More

@misc{ frank02racing,
  author = "E. Frank and G. Holmes and R. Kirkby and M. Hall",
  title = "Racing committees for large datasets",
  text = "E. Frank, G. Holmes, R. Kirkby, and M. Hall. Racing committees for large
    datasets. In Discovery Science, 2002.",
  year = "2002",
  url = "citeseer.ist.psu.edu/frank02racing.html" }
Citations (may not include all citations):
696   UCI repository of machine learning databases (context) - Blake, Keogh et al. - 1998
500   Experiments with a new boosting algorithm - Freund, Schapire - 1996
236   Additive logistic regression: a statistical view of boosting - Friedman, Hastie et al. - 2000
62   Pruning adaptive boosting - Margineantu, Dietterich - 1997
49   The UCI KDD archive (context) - Hettich, Bay - 1999
32   Static versus dynamic sampling for data mining - John, Langley - 1996
21   Arcing classi ers (context) - Breiman - 1998
21   A study of two sampling methods for analysing large datasets.. - Srinivasan - 1999
9   Cost complexity-based pruning of ensemble classi ers - Prodromidis, Stolfo - 2001
9   A streaming ensemble algorithm (context) - Street, Kim - 2001
7   The application of AdaBoost for distributed (context) - Fan, Stolfo et al. - 1999
5   Pasting small votes for classi cation in large databases and.. (context) - Breiman - 1999
3   Experimental comparisons of online and batch versions of bag.. - Oza, Russell - 2001
2   Pruning classi ers in a distributed meta-learning system (context) - Prodromidis, Stolfo et al. - 1998

Documents on the same site (http://www.cs.waikato.ac.nz/~eibe/Publications.html):   More
Domain-Specific Keyphrase Extraction - Frank, Paynter, Witten (1999)   (Correct)
Making Better Use of Global Discretization - Frank, Witten (1999)   (Correct)
Pruning Decision Trees and Lists - Frank (2000)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC