(Enter summary)
Abstract: There has been considerable interest recently in various approaches
to scaling up machine learning systems to large and distributed data
sets. We have been studying approaches based upon the parallel application
of multiple learning programs at distributed sites, followed
by a meta-learning stage to combine the multiple models in a principled
fashion. In this paper, we empirically determine the "best" data
partitioning scheme for a selected data set to compose "appropriatelysized
" subsets and... (Update)
Context of citations to this paper: More
.... domain are not immediately discernible, but rather requires extensive experimentation to find the best models, and the best meta classifiers [20]. 2.3 Cost based Models for Fraud Detection Most of the machine learning literature concentrates on model accuracy (either training...
...even with a complete analysis [45, 46] in the ROC space. 6 Extensive experiments evaluating di erent data distributions are presented in [42]. 26 local and 50 internal classi ers (those imported from their peer data sites) each site also imported 60 external classi ers...
Cited by: More
Distributed Data Mining Systems - Prodromidis (1999)
(Correct)
A Perspective View And Survey Of Meta-Learning - Vilalta, Drissi (2002)
(Correct)
Cost Complexity-based Pruning of Ensemble Classifiers - Prodromidis, Stolfo (1999)
(Correct)
Similar documents (at the sentence level): More
22.1%: Cost Complexity Pruning of Ensemble Classifiers - Prodromidis, Stolfo
(Correct)
14.1%: Agent-Based Distributed Learning Applied to Fraud Detection - Prodromidis, Stolfo (1999)
(Correct)
9.7%: Effective and Efficient Pruning of Meta-Classifiers in a.. - Prodromidis, Stolfo (1999)
(Correct)
Active bibliography (related documents): More All
0.1: A Survey of Methods for Scaling Up Inductive Algorithms - Provost, Kolluri (1999)
(Correct)
0.1: Cost-based Modeling for Fraud and Intrusion Detection.. - Stolfo, Fan, Lee
(Correct)
0.1: On the Management of Distributed Learning Agents - Prodromidis (1997)
(Correct)
Similar documents based on text: More All
0.1: Recursive-Stacking To Improve The Accuracy of Combined.. - Fan, Stolfo, al.
(Correct)
0.1: Integrating Multiple Learned Models for Improving and Scaling.. - Most Modern
(Correct)
0.1: Distributed Data Mining: The JAM System Architecture - Prodromidis, Stolfo..
(Correct)
Related documents from co-citation: More All
6: Stacked Generalization
- Wolpert - 1992
5: Jam: Java agents for meta-learning over distributed databases
- Stolfo, Prodromidis et al. - 1997
4: Sharing learned models among remote database partitions by local meta-learning
- Chan, Stolfo - 1996
BibTeX entry: (Update)
A.L. Prodromidis and S.J. Stolfo. A comparative evaluation of meta-learning strategies over large and distributed data sets. In Workshop on Meta-learning, Sixteenth Intl. Conf. Machine Learning, 1999. Submitted for publication. http://citeseer.ist.psu.edu/prodromidis99comparative.html More
@misc{ prodromidis99comparative,
author = "A. Prodromidis and S. Stolfo",
title = "A comparative evaluation of meta-learning strategies over large and distributed
data sets",
text = "A.L. Prodromidis and S.J. Stolfo. A comparative evaluation of meta-learning
strategies over large and distributed data sets. In Workshop on Meta-learning,
Sixteenth Intl. Conf. Machine Learning, 1999. Submitted for publication.",
year = "1999",
url = "citeseer.ist.psu.edu/prodromidis99comparative.html" }
Citations (may not include all citations):
2177
programs for machine learning (context) - Quinlan - 1993
1359
Induction of decision trees (context) - Quinlan - 1986 ACM DBLP
1262
Classification and Regression Trees (context) - Breiman, Friedman et al. - 1984
537
A theory of the learnable (context) - Valiant - 1984 ACM DBLP
454
the uniform convergence of relative frequencies of events to.. (context) - Vapnik, Chervonenkis - 1971
367
Stacked generalization
- Wolpert - 1992 ACM
261
The weighted majority algorithm
- Littlestone, Warmuth - 1989 ACM DBLP
248
An Introduction to computational learning theory (context) - Kearns, Vazirani - 1994 ACM
87
ective rule induction (context) - Cohen - 1995
85
Analysis and visualization of classifier performance: Compar..
- Provost, Fawcett - 1997
85
Bounds on the sample complexity of bayesian learning using i..
- Kearns, Haussler et al. - 1994 ACM DBLP
71
A comparative evaluation of voting and meta-learning on part..
- Chan, Stolfo - 1995 DBLP
62
Megainduction: machine learning on very large databases (context) - Catlett - 1992
58
Statistical mechanics of learning from examples (context) - Seung, Sompolinsky et al. - 1992
54
Cost-sensitive classification: Empirical evaluation of a hyd..
- Turney - 1995
54
Meta-learning for multistrategy and parallel learning (context) - Chan, Stolfo - 1993
47
Megainduction: A test flight (context) - Catlett - 1991 DBLP
47
Theory and Application of Correspondence Analysis (context) - Greenacre - 1984
42
Computational learning theory: An Introduction (context) - Anthony, Biggs - 1992
36
Toward scalable learning with non-uniform class and cost dis..
- Chan, Stolfo - 1998 DBLP
33
Robust classification systems for imprecise environments
- Provost, Fawcett - 1998
28
Using correspondence analysis to combine classifiers
- Merz - 1998
17
Working Notes for the AAAI-96 Workshop on Integrating Multip.. (context) - Chan, Stolfo et al. - 1996
17
Pruning meta-classifiers in a distributed data mining system
- Prodromidis, Stolfo - 1998
10
Scaling up inductive algorithms: An overview
- Provost, Kolluri - 1997
9
Agent-based distributed learning applied to fraud detection
- Prodromidis, Stolfo
2
Large datasets lead to overly complex models: As explanation.. (context) - Jensen, Oates - 1998
Documents on the same site (http://www.cs.columbia.edu/~andreas/publications/publications.html): More
Extensible Resource Management For Cluster Computing - Islam, Prodromidis.. (1996)
(Correct)
JAM: Java Agents for Meta-Learning over Distributed.. - Stolfo, Prodromidis.. (1997)
(Correct)
JAM: Java Agents for Meta-Learning over Distributed.. - Stolfo, Prodromidis.. (1997)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC