MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Generating classifier committees by stochastically selecting both attributes and training examples (1998) [3 citations — 0 self]

Download:
Download as a PDF | Download as a PS
by Zijian Zheng
Proceedings 5th Pacific Rim International Conferences on Artificial Intelligence (PRI-CAI’98
http://www3.cm.deakin.edu.au/~zijian/Papers/pricai98-sasc-bag.ps.gz
Add To MetaCart

Abstract:

Abstract. Boosting and Bagging, as two representative approaches to learning classifier committees, have demonstrated great success, especially for decision tree learning. They repeatedly build different classifiers using a base learning algorithm by changing the distribution of the training set. Sasc, as a different type of committee learning method, can also significantly reduce the error rate of decision trees. It generates classifier committees by stochastically modifying the set of attributes but keeping the distribution of the training set unchanged. It has been shown that Bagging and Sasc are, on average, less accurate than Boosting, but the performance of the former is more stable than that of the latter in terms of less frequently obtaining significantly higher error rates than the base learning algorithm. In this paper, we propose a novel committee learning algorithm, called SascBag, that combines Sasc and Bagging. It creates different classifiers by stochastically varying both the attribute set and the distribution of the training set. Experimental results in a representative collection of natural domains show that, for decision tree learning, the new algorithm is, on average, more accurate than Boosting, Bagging, and Sasc. It is more stable than Boosting. In addition, like Bagging and Sasc, SascBag is amenable to parallel and distributed processing while Boosting is not. This gives SascBag another advantage over Boosting for parallel machine learning and datamining. 1

Citations

3215 C4.5: Programs for Machine Learning – Quinlan - 1993
2138 UCI Repository of Machine Learning Databases – Merz, Murphy - 1996
1133 A decision-theoretic generalization of on-line learning and an application to boosting – Freund, Schapire - 1997
1004 Experiments with a new boosting algorithm – Schapire - 1996
483 Boosting the Margin: A New Explanation for the Effectiveness of Voting Methods – Schapire, Freund, et al. - 1997
453 The strength of weak learnability – Schapire - 1990
356 An empirical comparison of voting classification algorithms: Bagging, boosting and variants – Bauer, Kohavi - 1999
338 A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection – Kohavi - 1995
337 Solving multiclass learning problems via error-correcting output codes – Dietterich, Bakiri - 1995
298 Boosting a weak learning algorithm by majority – Freund - 1995
108 Error reduction through learning multiple descriptions – Ali, Pazzani - 1996
81 A Theory of Learning Classification Rules – Buntine - 1990
63 Multiple decision trees – Kwok, Carter - 1990
62 Hybrid system for protein secondary structure prediction – Zhang, Mesirov, et al.
40 Machine learning bias, statistical bias, and statistical variance of decision tree algorithms – Dietterich, Kong - 1995
37 Stacked generalization, Neural Networks 5 – Wolpert - 1992
33 Option decision trees with majority votes – Kohavi, Kunz - 1997
24 Bagging predictors, Machine Learning 24 – Breiman - 1996
23 Learning probabilistic relational concept descriptions – Ali - 1996
18 Ensembles as a sequence of classifiers – Asker, Maclin - 1997
13 Stochastic attribute selection committees – Zheng, Webb - 1998
13 Naive Bayesian Classifier Committees – Zheng - 1998
7 T.G.: Machine learning research – Dietterich - 1997
4 eds): Working Notes of AAAI Workshop on Integrating Multiple Learned Models for Improving and Scaling Machine Learning Algorithms (available at http://www.cs.fit.edu/~imlm/papers.html – Chan, Stolfo, et al. - 1996
3 Arcing classifiers. Technical Report (available at: http://www.stat – Breiman - 1996
2 expert-level performance on a science image analysis task by a system using combined artificial neural networks – Cherkauer - 1996
1 Error correction and error reduction in ensemble classifiers. Connection Science 8 – Tumer, Ghosh - 1996