See this document in CiteSeerX!

Bagging-Like Effects for Decision Trees and Neural Nets in Protein Secondary Structure Prediction (2001)  (Make Corrections)  (1 citation)
Nitesh Chawla, Thomas E. Moore, Jr., Kevin W. Bowyer, Lawrence O. Hall, Clayton Springer, Philip Kegelmeyer
ACM SIGKDD Workshop on Data Mining in Bio-Informatics



  Home/Search   Context   Related

 
View or download:
rpi.edu/~zaki/BIOKDD01/....chawla.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  rpi.edu/~zaki/BIOKDD01/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: In the Third Critical Assessment of Techniques for Protein Structure Prediction ("CASP-3") contest, the best performance was obtained with a classifier that uses neural networks, a window size of fifteen around a given amino acid, and a training set of about 299,186 amino acids. We set out to investigate the possibility of obtaining better performance by using a bagging-like committee of binary decision trees, created using an order-of-magnitude more training data. There are two main reasons to ... (Update)

Cited by:   More
Why are Neural Networks Sometimes Much More Accurate.. - Hall, Liu, Bowyer..   (Correct)

Similar documents (at the sentence level):
22.3%:   Bagging Is A Small-Data-Set Phenomenon - Nitesh Chawla Thomas (2001)   (Correct)
15.6%:   Distributed Learning with Bagging-Like Performance - Nitesh Chawla Thomas (2003)   (Correct)

Active bibliography (related documents):   More   All
1.9:   Bagging-Like Effects for Decision Trees and Neural Nets in.. - Thomas (2001)   (Correct)
0.3:   Informatica 28 Page Xxx--Yyy 1 - Improve Prediction With   (Correct)
0.3:   Improve Prediction with Remote Learners in Internet Environment - Zhou, Li, Dai (2005)   (Correct)

Similar documents based on text:   More   All
0.7:   Distributed Pasting of Small Votes - Bowyer (2002)   (Correct)
0.7:   Creating Ensembles of Classifiers - Nitesh Chawla Steven (2000)   (Correct)
0.6:   SMOTE: Synthetic Minority Over-sampling Technique - Chawla, Bowyer, Hall.. (2002)   (Correct)

BibTeX entry:   (Update)

N. Chawla, T.E. Moore, K.W. Bowyer, L.O. Hall, C. Springer, and W.P. Kegelmeyer. Bagging-like effects for decision trees and neural nets in protein secondary structure prediction. In BIOKDD01: Workshop on DataMining in Bioinformatics at KDD01, pages 50-- 59, 2001. http://citeseer.ist.psu.edu/chawla01bagginglike.html   More

@inproceedings{ chawlacvpr,
  author = "Nitesh V. Chawla, Thomas E. Moore, Lawrence O. Hall, Kevin W. BowyerW. Philip Kegelmeyer, Clayton Springer",
  title = "Bagging-Like Effects for Decision Trees and Neural Nets in Protein Secondary",
booktitle = "ACM SIGKDD Workshop on Data Mining in Bio-Informatics"
    Structure Prediction" ,
year=2002,
  url = "citeseer.ist.psu.edu/chawla01bagginglike.html" }
Citations (may not include all citations):
2177   Programs for Machine Learning (context) - Quinlan - 1992
696   UCI repository of machine learning databases (context) - Blake, Merz - 1998
657   Bagging predictors - Breiman - 1996
372   The cascade-correlation learning architecture - Fahlman, Lebiere - 1990
180   Boosting a weak learning algorithm by majority - Freund - 1995
155   An empirical comparison of voting classification algorithms:.. - Bauer, Kohavi - 1999
145   Sprint: A scalable parallel classifier for data mining - Shafer, Agrawal et al. - 1996
59   Towards parallel and distributed learning by meta-learning - Chan, Stolfo - 1993
44   Protein secondary structure prediction based on decision-spe.. (context) - Jones - 1999
43   Cached sufficient statistics for efficient machine learning .. - Moore, Lee - 1998
29   Learning arbiter and combiner trees from partitioned data fo.. - Chan, Stolfo - 1995
22   Large datasets lead to overly complex models: an explanation.. - Oates, Jensen
18   Pasting bites together for prediction in large data sets - Breiman - 1999
18   Efficient progressive sampling - Provost, Jensen et al. - 1999
16   A Bayesian account and its implications (context) - Domingos, bagging - 1997
8   Parallel induction algorithms for data mining - Darlington, Guo et al. - 1997
6   Decision tree learning on very large data sets (context) - Hall, Chawla et al. - 1998
6   A parallel decision tree builder for mining very large visua.. (context) - Bowyer, Chawla et al. - 2000
6   Learning rules from distributed data - Hall, Chawla et al. - 1999
5   Distributed learning on very large data sets - Hall, Bowyer et al. - 2000
3   Scaling up: Distribtuted machine learning with cooperation (context) - Provost, Hennessy - 1996
2   Scaling leaning by metalearning over disjoint and partially .. (context) - Chan, Stolfo - 1996
http://predictioncenter.llnl.gov/
http://www.sandia.gov/ASCI/Red/

Documents on the same site (http://www.cs.rpi.edu/~zaki/BIOKDD01/):   More
Classification of Genes Using Probabilistic Models of.. - Pavlidis, Tang (2001)   (Correct)
Analysis Of An Associative Memory Neural Network For Pattern.. - Bicciato (2001)   (Correct)
Learning to Recognize Brain Specific Proteins Based on .. - Huss, Boström, Asker, ..   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC