See this document in CiteSeerX!

Learning with Non-uniform Class and Cost Distributions: Effects and a Multi-classifier Approach (1998)  (Make Corrections)  (1 citation)
Philip K. Chan, et al.



  Home/Search   Context   Related

 
View or download:
columbia.edu/~sal/hpap...mljcost.ps.gz
fit.edu/~pkc/papers/mlj.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  columbia.edu/~s...projectpapers (more)
From:  fit.edu/~pkc/papers/
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: . Many factors influence a learning process and the performance of a learned classifier. In this paper we investigate the performance effects of class distribution in the training set. We also study different methods of measuring performance based on cost models and the performance effects of training class distribution with respect to the different cost models. Observations from these effects help us devise a distributed multi-classifier meta-learning approach to learn in domains with skewed... (Update)

Context of citations to this paper:   More

.... detection domain is extremely dependent on the dollar amount of each credit card transaction, Chan et al. in their studies [12] and [13] represented the cost model in terms of overheads, which are equivalent to operational costs that is needed for each investigation and...

Cited by:   More
Benefit Maximizing Classification Using Feature Intervals - Ikizler (2002)   (Correct)

Similar documents (at the sentence level):   More
42.5%:   Learning with Non-uniform Class and Cost Distributions.. - Chan, Stolfo (1998)   (Correct)
32.7%:   Toward Scalable Learning with Non-uniform Distributions.. - Philip Chan (1999)   (Correct)
22.2%:   Toward Scalable Learning with Non-uniform Class and Cost.. - Chan (1998)   (Correct)

Active bibliography (related documents):   More   All
0.3:   Distributed Data Mining Bibliography - Hillol   (Correct)
0.2:   Machine Learning for the Detection of Oil Spills in.. - Kubat, Holte, Matwin (1998)   (Correct)
0.2:   Cost Complexity-based Pruning of Ensemble Classifiers - Prodromidis, Stolfo (1999)   (Correct)

Similar documents based on text:   More   All
0.3:   Submitted 12/21/02 Learning when Training Data are Costly: .. - Distribution On Tree   (Correct)
0.1:   Sharing Learned Models among Remote Database Partitions by.. - Chan, Stolfo (1996)   (Correct)
0.1:   Margin Error and Generalization Capabilities of.. - Elisseeff.. (1999)   (Correct)

BibTeX entry:   (Update)

P. Chan and S. Stolfo. Learning with Non-uniform Class and Cost Distributions: Effects and a Distributed Multi-Classifier Approach. In Workshop Notes KDD-98 Workshop on Distributed Data Mining, pages 1-9, 1998. http://citeseer.ist.psu.edu/article/chan98learning.html   More

@misc{ chan98learning,
  author = "P. Chan and S. Stolfo",
  title = "Learning with Non-uniform Class and Cost Distributions: Effects and a Distributed
    Multi-Classifier Approach",
  text = "P. Chan and S. Stolfo. Learning with Non-uniform Class and Cost Distributions:
    Effects and a Distributed Multi-Classifier Approach. In Workshop Notes KDD-98
    Workshop on Distributed Data Mining, pages 1-9, 1998.",
  year = "1998",
  url = "citeseer.ist.psu.edu/article/chan98learning.html" }
Citations (may not include all citations):
2177   programs for machine learning (context) - Quinlan - 1993
1262   Classification and Regression Trees (context) - Breiman, Friedman et al. - 1984
367   Stacked generalization - Wolpert - 1992
248   Fast effective rule induction - Cohen - 1995
180   The CN2 induction algorithm (context) - Clark, Niblett - 1989
145   SPRINT: A scalable parallel classifier for data mining - Shafer, Agrawal et al. - 1996
115   Scalable parallel data mining for association rules - Han, Karypis et al. - 1997
62   Pruning adaptive boosting - Margineantu, Dietterich - 1997
54   Cost-sensitive classification: Empirical evaluation of a hyb.. - Turney - 1995
47   Megainduction: A test flight (context) - Catlett - 1991
41   Reducing misclassification costs (context) - Pazzani, Merz et al. - 1994
32   Introduction to IND and Recursive Partitioning (context) - Buntine, Caruana - 1991
26   Sharing learned models among remote database partitions by l.. - Chan, Stolfo - 1996
19   Cost-sensitive learning bibliography (context) - Turney - 1998
17   Cost-sensitive learning of classification knowledge and its .. (context) - Tan - 1993
16   Learning to represent codons: A challenge problem for constr.. - Craven, Shavlik - 1993
10   Mining databases with different schemas: Integrating imcompa.. - Prodromidis, Stolfo - 1998
7   Pruning classifiers in a distributed meta-learning system - Prodromidis, Stolfo et al. - 1998
7   The application of adaboost for distributed (context) - Fan, Stolfo et al. - 1999
6   Addressing the curse of imbalanaced training sets: One sided.. (context) - Kubat, Matwin - 1997
6   Learning with skewed class distributions--summary of respons.. (context) - Fawcett - 1996
4   Analysis and visualization of classifier performance: Compar.. (context) - Learning, Provost et al. - 1997
2   Adaptive fraud detection (context) - List, No et al. - 1997
2   Scaling up inductive learning with massive parallelism (context) - NON-UNIFORM, Provost et al. - 1996
1   Meta-learning for multistrategy and parallel learning (context) - Learning, -- et al. - 1993
1   Improving minority class prediction using casespecific featu.. (context) - Research, Cardie et al. - 1997

Documents on the same site (http://www.cs.columbia.edu/~sal/JAM/PROJECT/recent-project-papers.html):   More
A Comparative Evaluation of Voting and Meta-learning on.. - Chan, Stolfo (1995)   (Correct)
Learning Patterns from Unix Process Execution Traces for.. - Lee, Stolfo (1997)   (Correct)
Meta-Learning in Distributed Data Mining Systems: Issues.. - Prodromidis, Chan, al. (2000)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC