(Enter summary)
Abstract: . Many factors influence a learning process and the performance of a learned classifier.
In this paper we investigate the performance effects of class distribution in the training set. We
also study different methods of measuring performance based on cost models and the performance
effects of training class distribution with respect to the different cost models. Observations from
these effects help us devise a distributed multi-classifier meta-learning approach to learn in domains
with skewed... (Update)
Context of citations to this paper: More
.... detection domain is extremely dependent on the dollar amount of each credit card transaction, Chan et al. in their studies [12] and [13] represented the cost model in terms of overheads, which are equivalent to operational costs that is needed for each investigation and...
Cited by: More
Benefit Maximizing Classification Using Feature Intervals - Ikizler (2002)
(Correct)
Similar documents (at the sentence level): More
42.5%: Learning with Non-uniform Class and Cost Distributions.. - Chan, Stolfo (1998)
(Correct)
32.7%: Toward Scalable Learning with Non-uniform Distributions.. - Philip Chan (1999)
(Correct)
22.2%: Toward Scalable Learning with Non-uniform Class and Cost.. - Chan (1998)
(Correct)
Active bibliography (related documents): More All
0.3: Distributed Data Mining Bibliography - Hillol
(Correct)
0.2: Machine Learning for the Detection of Oil Spills in.. - Kubat, Holte, Matwin (1998)
(Correct)
0.2: Cost Complexity-based Pruning of Ensemble Classifiers - Prodromidis, Stolfo (1999)
(Correct)
Similar documents based on text: More All
0.3: Submitted 12/21/02 Learning when Training Data are Costly: .. - Distribution On Tree
(Correct)
0.1: Sharing Learned Models among Remote Database Partitions by.. - Chan, Stolfo (1996)
(Correct)
0.1: Margin Error and Generalization Capabilities of.. - Elisseeff.. (1999)
(Correct)
BibTeX entry: (Update)
P. Chan and S. Stolfo. Learning with Non-uniform Class and Cost Distributions: Effects and a Distributed Multi-Classifier Approach. In Workshop Notes KDD-98 Workshop on Distributed Data Mining, pages 1-9, 1998. http://citeseer.ist.psu.edu/article/chan98learning.html More
@misc{ chan98learning,
author = "P. Chan and S. Stolfo",
title = "Learning with Non-uniform Class and Cost Distributions: Effects and a Distributed
Multi-Classifier Approach",
text = "P. Chan and S. Stolfo. Learning with Non-uniform Class and Cost Distributions:
Effects and a Distributed Multi-Classifier Approach. In Workshop Notes KDD-98
Workshop on Distributed Data Mining, pages 1-9, 1998.",
year = "1998",
url = "citeseer.ist.psu.edu/article/chan98learning.html" }
Citations (may not include all citations):
2177
programs for machine learning (context) - Quinlan - 1993
1262
Classification and Regression Trees (context) - Breiman, Friedman et al. - 1984
367
Stacked generalization
- Wolpert - 1992
248
Fast effective rule induction
- Cohen - 1995
180
The CN2 induction algorithm (context) - Clark, Niblett - 1989
145
SPRINT: A scalable parallel classifier for data mining
- Shafer, Agrawal et al. - 1996
115
Scalable parallel data mining for association rules
- Han, Karypis et al. - 1997
62
Pruning adaptive boosting
- Margineantu, Dietterich - 1997
54
Cost-sensitive classification: Empirical evaluation of a hyb..
- Turney - 1995
47
Megainduction: A test flight (context) - Catlett - 1991
41
Reducing misclassification costs (context) - Pazzani, Merz et al. - 1994
32
Introduction to IND and Recursive Partitioning (context) - Buntine, Caruana - 1991
26
Sharing learned models among remote database partitions by l..
- Chan, Stolfo - 1996
19
Cost-sensitive learning bibliography (context) - Turney - 1998
17
Cost-sensitive learning of classification knowledge and its .. (context) - Tan - 1993
16
Learning to represent codons: A challenge problem for constr..
- Craven, Shavlik - 1993
10
Mining databases with different schemas: Integrating imcompa..
- Prodromidis, Stolfo - 1998
7
Pruning classifiers in a distributed meta-learning system
- Prodromidis, Stolfo et al. - 1998
7
The application of adaboost for distributed (context) - Fan, Stolfo et al. - 1999
6
Addressing the curse of imbalanaced training sets: One sided.. (context) - Kubat, Matwin - 1997
6
Learning with skewed class distributions--summary of respons.. (context) - Fawcett - 1996
4
Analysis and visualization of classifier performance: Compar.. (context) - Learning, Provost et al. - 1997
2
Adaptive fraud detection (context) - List, No et al. - 1997
2
Scaling up inductive learning with massive parallelism (context) - NON-UNIFORM, Provost et al. - 1996
1
Meta-learning for multistrategy and parallel learning (context) - Learning, -- et al. - 1993
1
Improving minority class prediction using casespecific featu.. (context) - Research, Cardie et al. - 1997
Documents on the same site (http://www.cs.columbia.edu/~sal/JAM/PROJECT/recent-project-papers.html): More
A Comparative Evaluation of Voting and Meta-learning on.. - Chan, Stolfo (1995)
(Correct)
Learning Patterns from Unix Process Execution Traces for.. - Lee, Stolfo (1997)
(Correct)
Meta-Learning in Distributed Data Mining Systems: Issues.. - Prodromidis, Chan, al. (2000)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC