Download:
|
by Zijian Zheng
Proceedings 5th Pacific Rim International Conferences on Artificial Intelligence (PRI-CAI’98
http://www3.cm.deakin.edu.au/~zijian/Papers/pricai98-sasc-bag.ps.gz
Add To MetaCart
Abstract:
Abstract. Boosting and Bagging, as two representative approaches to learning classifier committees, have demonstrated great success, especially for decision tree learning. They repeatedly build different classifiers using a base learning algorithm by changing the distribution of the training set. Sasc, as a different type of committee learning method, can also significantly reduce the error rate of decision trees. It generates classifier committees by stochastically modifying the set of attributes but keeping the distribution of the training set unchanged. It has been shown that Bagging and Sasc are, on average, less accurate than Boosting, but the performance of the former is more stable than that of the latter in terms of less frequently obtaining significantly higher error rates than the base learning algorithm. In this paper, we propose a novel committee learning algorithm, called SascBag, that combines Sasc and Bagging. It creates different classifiers by stochastically varying both the attribute set and the distribution of the training set. Experimental results in a representative collection of natural domains show that, for decision tree learning, the new algorithm is, on average, more accurate than Boosting, Bagging, and Sasc. It is more stable than Boosting. In addition, like Bagging and Sasc, SascBag is amenable to parallel and distributed processing while Boosting is not. This gives SascBag another advantage over Boosting for parallel machine learning and datamining. 1
Citations
|
3215
|
C4.5: Programs for Machine Learning
– Quinlan
- 1993
|
|
2138
|
UCI Repository of Machine Learning Databases
– Merz, Murphy
- 1996
|
|
1133
|
A decision-theoretic generalization of on-line learning and an application to boosting
– Freund, Schapire
- 1997
|
|
1004
|
Experiments with a new boosting algorithm
– Schapire
- 1996
|
|
483
|
Boosting the Margin: A New Explanation for the Effectiveness of Voting Methods
– Schapire, Freund, et al.
- 1997
|
|
453
|
The strength of weak learnability
– Schapire
- 1990
|
|
356
|
An empirical comparison of voting classification algorithms: Bagging, boosting and variants
– Bauer, Kohavi
- 1999
|
|
338
|
A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection
– Kohavi
- 1995
|
|
337
|
Solving multiclass learning problems via error-correcting output codes
– Dietterich, Bakiri
- 1995
|
|
298
|
Boosting a weak learning algorithm by majority
– Freund
- 1995
|
|
108
|
Error reduction through learning multiple descriptions
– Ali, Pazzani
- 1996
|
|
81
|
A Theory of Learning Classification Rules
– Buntine
- 1990
|
|
63
|
Multiple decision trees
– Kwok, Carter
- 1990
|
|
62
|
Hybrid system for protein secondary structure prediction
– Zhang, Mesirov, et al.
|
|
40
|
Machine learning bias, statistical bias, and statistical variance of decision tree algorithms
– Dietterich, Kong
- 1995
|
|
37
|
Stacked generalization, Neural Networks 5
– Wolpert
- 1992
|
|
33
|
Option decision trees with majority votes
– Kohavi, Kunz
- 1997
|
|
24
|
Bagging predictors, Machine Learning 24
– Breiman
- 1996
|
|
23
|
Learning probabilistic relational concept descriptions
– Ali
- 1996
|
|
18
|
Ensembles as a sequence of classifiers
– Asker, Maclin
- 1997
|
|
13
|
Stochastic attribute selection committees
– Zheng, Webb
- 1998
|
|
13
|
Naive Bayesian Classifier Committees
– Zheng
- 1998
|
|
7
|
T.G.: Machine learning research
– Dietterich
- 1997
|
|
4
|
eds): Working Notes of AAAI Workshop on Integrating Multiple Learned Models for Improving and Scaling Machine Learning Algorithms (available at http://www.cs.fit.edu/~imlm/papers.html
– Chan, Stolfo, et al.
- 1996
|
|
3
|
Arcing classifiers. Technical Report (available at: http://www.stat
– Breiman
- 1996
|
|
2
|
expert-level performance on a science image analysis task by a system using combined artificial neural networks
– Cherkauer
- 1996
|
|
1
|
Error correction and error reduction in ensemble classifiers. Connection Science 8
– Tumer, Ghosh
- 1996
|