See this document in CiteSeerX!

Subsetting as an approach to distributed learning  (Make Corrections)  
Simon Thompson and Max Bramer. Artificial Intelligence Research Group,...



  Home/Search   Context   Related

 
View or download:
sis.port.ac.uk/technical_repo...abel.ps
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  sis.port.ac.uk/tecrep (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: This paper discusses one approach to the scaling problem of machine learning (Chan and Stolfo 1995) namely the decomposition of large datasets into smaller subsets so that the learning task on each subset is greatly reduced in comparison to the learning task for the whole dataset. A previous study (Catlett 1992) has examined this approach and concluded that classifiers learned on a subset of available data were significantly less accurate that classifiers learned on training sets entire. We... (Update)

Active bibliography (related documents):   More   All
0.5:   Parallel Genetic Algorithms and Machine Learning. - Thompson Bramer   (Correct)
0.1:   Distributed Data Mining Bibliography - Hillol   (Correct)
0.1:   A Survey of Methods for Scaling Up Inductive Algorithms - Provost, Kolluri (1999)   (Correct)

Similar documents based on text:   More   All
0.1:   Toward Scalable and Parallel Inductive Learning: A Case Study in.. - Chan (1994)   (Correct)
0.1:   Scalability of Learning Arbiter and Combiner Trees from.. - Chan, Stolfo   (Correct)
0.1:   Addressing the Curse of Imbalanced Training Sets: One-Sided.. - Kubat, Matwin (1997)   (Correct)

BibTeX entry:   (Update)

@misc{ thompson-subsetting,
  author = "Simon Thompson",
  title = "Subsetting as an Approach to Distributed Learning.",
  url = "citeseer.ist.psu.edu/91966.html" }
Citations (may not include all citations):
1262   Classification and regression trees (context) - Breiman, Friedman et al. - 1984
503   Instance-Based Learning Algorithms (context) - Aha, Kibler et al. - 1991  ACM   DBLP
71   A comparative evaluation of voting and meta-learning on part.. - Chan, Stolfo - 1995  DBLP
47   Megainduction: A test flight (context) - Catlett - 1991  DBLP
34   Boosting and C (context) - Quinlan - 1996
34   Structured induction in expert systems (context) - Shapiro - 1987  ACM
21   An empirical comparison of genetic and decision-tree classif.. (context) - Quinlan - 1988  DBLP
8   Using SQL primitives and parallel DB servers to speedup know.. - Freitas - 1995
7   Agent-Based Knowledge Discovery - Davies, Edwards - 1995
2   Programmes for Machine Learning (context) - Quinlan - 1993

Documents on the same site (http://www.sis.port.ac.uk/tecrep.html):   More
Case-Based Reasoning: A Technique for 'Decision Support.. - Dattani, Bramer   (Correct)
MPGAIA - A Massively Parallel Genetic Algorithm for Image.. - Thompson, Bramer, Kalus   (Correct)
Parallel Genetic Algorithms and Machine Learning. - Thompson Bramer   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC