MetaCartSign in to MyCiteSeer

Include Citations | Advanced Search | Help

Include Citations | Advanced Search | Help

  Prediction of contact maps using support vector machines (2003) [5 citations — 0 self]

Download:
Download as a PDF
by Ying Zhao, George Karypis
In Proc. of the IEEE Symposium on BioInformatics and BioEngineering
http://www-users.cs.umn.edu/~karypis/publications/Papers/PDF/bibe068_zhao.pdf
Add To MetaCart

Abstract:

Contact map prediction is of great interest for its application in fold recognition and protein 3D structure determination. In this paper we present a contact-map prediction algorithm that employs Support Vector Machines as the machine learning tool and incorporates various features such as sequence profiles and their conservation, correlated mutation analysis based on various amino acid physicochemical properties, and secondary structure. In addition, we evaluated the effectiveness of the different features on contact map prediction for different fold classes. On average, our predictor achieved a prediction accuracy of 0.2238 with an improvement over a random predictor of a factor 11.7, which is better than reported studies. Our study showed that predicted secondary structure features play an important roles for the proteins containing beta structures. Models based on secondary structure features and CMA features produce different sets of predictions. Our study also suggests that models learned separately for different protein fold families may achieve better performance than a unified model. 1

Citations

4514 Statistical Learning Theory – Vapnik - 1998
697 Making large-Scale SVM Learning Practical – Joachims - 1999
526 CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res – Thompson, Higgins, et al. - 1994
457 SCOP: a structural classification of proteins database for the investigation of sequences and structures – Murzin, Brenner, et al. - 1995
121 CATH—a hierarchic classification of protein domain structures – ORENGO, MICHIE, et al. - 1997
60 DT: GenTHREADER: an efficient and reliable protein fold recognition method for genomic sequences – Jones - 1999
33 AAindex: amino acid index database – Kawashima, Kanehisa - 2000
22 Impact of local and non-local interactions on thermodynamics and kinetics of protein folding – Abkevich, Gutin, et al. - 1995
20 Effective Use of Sequence Correlation and Conservation in Fold Recognition – Olmea, Rost, et al. - 1999
15 Prediction of contact maps with neural networks and correlated mutations – Fariselli, Olmea, et al. - 2001
13 The Protein Data Bank and the challenge of structural genomics – Berman, Bhat, et al. - 2000
13 Tests for comparing related amino-acid sequences – McLachlan - 1971
12 Coevolving protein residues: maximum likelihood identification and relationship to structure – Pollock, Taylor, et al. - 1999
12 Mining residue contacts in proteins using local structure predictions – Zaki, Jin, et al. - 2000
10 Predicting protein stability changes upon mutation using database-derived potentials: solvent accessibility determines the importance of local versus non-local interactions along the sequence – Gilis, Rooman - 1997
8 Prediction of contact maps by recurrent neural network architectures and hidden context propagation from all four cardinal corners – Pollastri, Baldi - 2002
7 Progress in predicting inter-residue contacts of proteins with neural networks and correlated mutations – Fariselli, Olmea, et al. - 2001
7 The folding of an enzyme. ii. substructure of barnase and the contribution of different interactions to protein stability – Serrano, Kellis, et al. - 1992
7 The prediction of protein contacts from multiple sequence alignments – DJ, Casari, et al.
6 The folding of an enzyme. i. theory of protein engineering analysis of stability and pathways of protein folding – Fersht, Matouschek, et al. - 1992
5 Effectiveness of correlation analysis in identifying protein residues undergoing correlated evolution – Pollock, Taylor - 1997
2 Exploring local and non-local interactions for protein stability by structural motif engineering – Niggemann, Steipe
2 Evaluation of a novel method for the identification of coevolving protein residues – Pritchard, Bladon, et al.
2 Recognition of protein structure: Determining the relative energetic contributions of beta-strands, alpha-helices and loops – Reva, Topiol - 2000