Results 1 - 10
of
77
Hidden markov models that use predicted local structure for fold recognition: alphabets of backbone geometry
- Proteins
, 2003
"... An important problem in computational biology is predicting the structure of the large number of pu-tative proteins discovered by genome sequencing projects. Fold-recognition methods attempt to solve the problem by relating the target proteins to known structures, searching for template proteins hom ..."
Abstract
-
Cited by 70 (11 self)
- Add to MetaCart
(Show Context)
An important problem in computational biology is predicting the structure of the large number of pu-tative proteins discovered by genome sequencing projects. Fold-recognition methods attempt to solve the problem by relating the target proteins to known structures, searching for template proteins homologous to the target. Remote homologs which may have significant structural similarity are often not detectable by sequence similarities alone. To address this, we incorporated predicted local structure, a generalization of secondary structure, into two-track profile HMMs. We did not rely on a simple helix-strand-coil definition of secondary structure,
Small libraries of protein fragments model native protein structures accurately
- J. Mol. Biol
, 2002
"... The three-dimensional structure of proteins has been a subject of intense study for several decades. A common way to simplify these complex structures is to consider restrictions on the local mainchain ..."
Abstract
-
Cited by 57 (8 self)
- Add to MetaCart
The three-dimensional structure of proteins has been a subject of intense study for several decades. A common way to simplify these complex structures is to consider restrictions on the local mainchain
A hidden Markov model derived structural alphabet for proteins
- J Mol Biol
, 2004
"... Understanding and predicting protein structures depend on the complexity and the accuracy of the models used to represent them. We have recently set up a Hidden Markov Model to optimally compress protein three-dimensional conformations into a one-dimensional series of letters of a structural alphabe ..."
Abstract
-
Cited by 51 (10 self)
- Add to MetaCart
Understanding and predicting protein structures depend on the complexity and the accuracy of the models used to represent them. We have recently set up a Hidden Markov Model to optimally compress protein three-dimensional conformations into a one-dimensional series of letters of a structural alphabet. Such a model learns simultaneously the shape of representative structural letters describing the local conformation and the logic of their connections, i.e. the transition matrix between the letters. Here, we move one step further and report some evidence that such a model of protein local architecture also captures some accurate amino acid features. All the letters have specific and distinct amino acid distributions. Moreover, we show that words of amino acids can have significant propensities for some letters. Perspectives point towards the prediction of the series of letters describing the structure of a protein from its amino acid sequence. D 2005 Elsevier B.V. All rights reserved.
SAM-T08, HMM-based Protein Structure Prediction
, 2009
"... The SAM-T08 web server is a protein-structure prediction server that provides several useful intermediate results in addition to the final predicted 3D structure: three multiple sequence alignments of putative homologs using different iterated search procedures, prediction of local structure feature ..."
Abstract
-
Cited by 32 (4 self)
- Add to MetaCart
The SAM-T08 web server is a protein-structure prediction server that provides several useful intermediate results in addition to the final predicted 3D structure: three multiple sequence alignments of putative homologs using different iterated search procedures, prediction of local structure features including various backbone and burial properties, calibrated E-values for the significance of template searches of PDB, and residue-residue contact predictions. The server has been validated as part of the CASP8 assessment of structure prediction as having good performance across all classes of predictions.
PREDICT-2ND: a tool for generalized protein local structure prediction
, 2008
"... MOTIVATION: Predictions of protein local structure, derived from sequence alignment information alone, provide visualization tools for biologists to evaluate the importance of amino acid residue positions of interest in the absence of X-ray crystal/NMR structures or homology models. They are also us ..."
Abstract
-
Cited by 15 (2 self)
- Add to MetaCart
MOTIVATION: Predictions of protein local structure, derived from sequence alignment information alone, provide visualization tools for biologists to evaluate the importance of amino acid residue positions of interest in the absence of X-ray crystal/NMR structures or homology models. They are also useful as inputs to sequence analysis and modeling tools such as hidden Markov models (HMMs), which can be used to search for homology in databases of known protein structure. In addition, local structure predictions can be used as a component of cost functions in genetic algorithms that predict protein tertiary structure. We have developed a program (PREDICT-2ND) that trains multilayer neural networks and have applied it to numerous local structure alphabets, tuning network parameters such as the number of layers, the number of units in each layer, and the window sizes of each layer. We have had the most success with four-layer networks, with gradually increasing window sizes at each layer. RESULTS: Because the four-layer neural nets occasionally get trapped in poor local optima, our training protocol now uses many different random starts, with short training runs, followed by more training on the best performing networks from the short runs. One recent addition to the program is the option to add a guide sequence to the profile inputs, increasing the number of inputs per position by 20. We find that use of a guide sequence provides a small but consistent improvement in the predictions for several different local-structure alphabets. AVAILABILITY: Local structure prediction with the methods described here is available for use online at
Protein block expert (pbe): a web-based protein structure analysis server using a structural alphabet
- Nucl. Acids. Res
, 2006
"... Encoding protein 3D structures into 1D string using short structural prototypes or structural alphabets opens a new front for structure comparison and analysis. Using the well-documented 16 motifs of Protein Blocks (PBs) as structural alphabet, we have developed a methodology to compare protein stru ..."
Abstract
-
Cited by 14 (5 self)
- Add to MetaCart
(Show Context)
Encoding protein 3D structures into 1D string using short structural prototypes or structural alphabets opens a new front for structure comparison and analysis. Using the well-documented 16 motifs of Protein Blocks (PBs) as structural alphabet, we have developed a methodology to compare protein structures that are encoded as sequences of PBs by aligning them using dynamic programming which uses a substitution matrix for PBs. This methodology is implemented in the applications available in Protein Block Expert (PBE) server. PBE addresses common issues in the field of protein structure analysis such as comparison of proteins structures and identification of protein structures in structural databanks that resemble a given structure. PBE-T provides facility to transform any PDB file into sequences of PBs. PBE-ALIGNc performs comparison of two protein structures based on the alignment of their corresponding PB sequences. PBE-ALIGNm is a facility for mining SCOP database for similar structures based on the alignment of PBs. Besides, PBE provides an interface to a database (PBE-SAdb) of preprocessed PB sequences from SCOP culled at 95 % and of all-against-all pairwise PB alignments at family and superfamily levels. PBE server is freely available at
On questions of structure
- Journal of Computational K-Theory
, 2007
"... motifs across protein families ..."
(Show Context)
Calibrating E-values for hidden Markov models with reverse-sequence null models
- Bioinformatics
, 2005
"... Motivation: Hidden Markov models (hmms) calculate the probability that a sequence was generated by a given model. Log-odds scoring provides a context for evaluating this probability, by considering it in relation to a null hypothesis. We have found that using a reverse-sequence null model effectivel ..."
Abstract
-
Cited by 9 (3 self)
- Add to MetaCart
Motivation: Hidden Markov models (hmms) calculate the probability that a sequence was generated by a given model. Log-odds scoring provides a context for evaluating this probability, by considering it in relation to a null hypothesis. We have found that using a reverse-sequence null model effectively removes biases due to sequence length and composition and reduces the number of false positives in a database search. Any scoring system is an arbitrary measure of the quality of database matches. Significance estimates of scores are essential, because they eliminate model- and methoddependent
Protein structural motif prediction in multidimensional phi-psi space leads to improved secondary structure prediction
- J. Comput. Biol
, 2006
"... Provided by the author(s) and University College Dublin Library in accordance with publisher policies. Please ..."
Abstract
-
Cited by 7 (2 self)
- Add to MetaCart
Provided by the author(s) and University College Dublin Library in accordance with publisher policies. Please