• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Matt: local flexibility aids protein multiple structure alignment (2008)

by M Menke
Venue:PLoS Comput. Biol
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 45
Next 10 →

A conditional neural fields model for protein threading

by Jianzhu Ma, et al. , 2012
"... Motivation: Alignment errors are still the main bottleneck for current template-based protein modeling (TM) methods, including protein threading and homology modeling, especially when the sequence identity between two proteins under consideration is low (<30%). Results: We present a novel protein ..."
Abstract - Cited by 11 (2 self) - Add to MetaCart
Motivation: Alignment errors are still the main bottleneck for current template-based protein modeling (TM) methods, including protein threading and homology modeling, especially when the sequence identity between two proteins under consideration is low (&lt;30%). Results: We present a novel protein threading method, CNFpred, which achieves much more accurate sequence–template alignment by employing a probabilistic graphical model called a Conditional Neural Field (CNF), which aligns one protein sequence to its remote template using a non-linear scoring function. This scoring function accounts for correlation among a variety of protein sequence and structure features, makes use of information in the neighborhood of two residues to be aligned, and is thus much more sensitive than the widely used linear or profile-based scoring function. To train this CNF threading model, we employ a novel quality-sensitive method, instead of the standard maximum-likelihood method, to maximize directly the expected quality of the training set. Experimental results show that CNFpred generates significantly better alignments than the best profile-based and threading methods on several public (but small) benchmarks as well as our own large dataset. CNFpred outperforms others regardless of the lengths or classes of proteins, and works particularly well for proteins with sparse sequence profiles due to the effective utilization of structure information. Our methodology can also be adapted to protein sequence alignment.

Iterative refinement of structure-based sequence alignments by Seed Extension

by C Kim, C H Tai, B Lee - BMC Bioinformatics , 2009
"... ..."
Abstract - Cited by 3 (2 self) - Add to MetaCart
Abstract not found
(Show Context)

Citation Context

... use a Monte-Carlo procedure after initial structural alignment [13,14], FATCAT and MATT adopt AFP (aligned fragment pair)-based dynamic programming without constructing initial structural alignments =-=[15,16]-=-, and other programs mostly rely on residue-level dynamic programming algorithm according to various scoring schemes with or without initial rigidbody superposition [17-20]. We previously developed th...

Optimal simultaneous superpositioning of multiple structures with missing data

by Douglas L. Theobald, Phillip A. Steindel - Bioinformatics , 2012
"... Motivation: Superpositioning is an essential technique in structural biology that facilitates the comparison and analysis of conformational differences among topologically similar structures. Performing a superposition requires a one-to-one correspondence, or alignment, of the point-sets in the diff ..."
Abstract - Cited by 2 (0 self) - Add to MetaCart
Motivation: Superpositioning is an essential technique in structural biology that facilitates the comparison and analysis of conformational differences among topologically similar structures. Performing a superposition requires a one-to-one correspondence, or alignment, of the point-sets in the different structures. However, in practice some points are usually “missing ” from several structures, for example when the alignment contains gaps. Current superposition methods deal with missing data simply by superpositioning a subset of points that are shared among all the structures. This practice is inefficient, as it ignores important data, and it fails to satisfy the common least-squares criterion. In the extreme, disregarding missing positions prohibits the calculation of a superposition altogether. Results: Here we present a general solution for determining an optimal superposition when some of the data are missing. We use the Expectation-Maximization algorithm, a classic statistical technique for dealing with incomplete data, to find both maximum likelihood solutions and the optimal least-squares solution as a special case. Availability and Implementation: The methods presented here are implemented in THESEUS 2.0, a program for superpositioning macromolecular structures. ANSI C source code and selected compiled binaries for various computing platforms are freely available under the GNU open source license from
(Show Context)

Citation Context

...all four proteins. usually by excluding many atoms from the calculation (e.g., Birzele et al., 2007; Dror, 2003; Guda et al., 2001; Hill and Reilly, 2006; Konagurthu et al., 2006; Maiti et al., 2004; =-=Menke et al., 2008-=-; Ortiz, 2002; Shatsky et al., 2002; Ye and Godzik, 2005). For the proteins in Figures 1 and 2, standard practice would calculate the superposition based on only the small subset of fully shared resid...

mulPBA: an efficient multiple protein structure alignment method based on a

by Sylvain Léonard, Agnel Praveen Joseph, Jean-christophe Gelly, Re G. De Brevern, F- Paris
"... structural alphabet. ..."
Abstract - Cited by 2 (0 self) - Add to MetaCart
structural alphabet.
(Show Context)

Citation Context

...sRossmann fold (SCOP Ids: 1gd1o1, 1gpba_, 4mdha1, 5ldha1, 6ldha1 and 8adha2)).sThey have been superimposed with different available servers like MASS (Dror,sBenyamini, Nussinov & Wolfson 2003), MATT (=-=Menke et al. 2008-=-), SALIGNs(Madhusudhan, Webb, Marti-Renom, Eswar & Sali 2009) and POSA (Ye & Godziks2005). The values of Nrms, Ngdt and N3.5 underline the difficulty of this superimposition.sBoth MASS and MATT give r...

Redundancy-aware learning of

by Drew Bryant, Yousif Shamoo , 2012
"... protein structure-function relationships by ..."
Abstract - Cited by 2 (0 self) - Add to MetaCart
protein structure-function relationships by
(Show Context)

Citation Context

...44], CATHEDRAL [45], CE [46], VAST [47], HOMSTRAD [48] and MATT [49] have proven capable of recognzing fold similarity among highly divergent protein sequences. Sequencebased approaches include HMMER =-=[50]-=-, PROSITE [51], CLUSTALW [52] and MUSCLE [53], to name a few. Local structure alignment approaches, such as those used for local search and comparison discussed in the previous paragraph provide an ad...

Simultaneous Alignment and Folding of Protein Sequences

by Jérôme Waldispühl, Charles W. O’donnell, Sebastian Will, Srinivas Devadas, Rolf Backofen, Bonnie Berger
"... Abstract. Accurate comparative analysis tools for low-homology proteins remains a difficult challenge in computational biology, especially sequence alignment and consensus folding problems. We present partiFold-Align, the first algorithm for simultaneous alignment and consensus folding of unaligned ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
Abstract. Accurate comparative analysis tools for low-homology proteins remains a difficult challenge in computational biology, especially sequence alignment and consensus folding problems. We present partiFold-Align, the first algorithm for simultaneous alignment and consensus folding of unaligned protein sequences; the algorithm’s complexity is polynomial in time and space. Algorithmically, partiFold-Align exploits sparsity in the set of super-secondary structure pairings and alignment candidates to achieve an effectively cubic running time for simultaneous pairwise alignment and folding. We demonstrate the efficacy of these techniques on transmembrane β-barrel proteins, an important yet difficult class of proteins with few known three-dimensional structures. Testing against structurally derived sequence alignments, partiFold-Align significantly outperforms state-of-the-art pairwise sequence alignment tools in the most difficult low sequence homology case and improves secondary structure prediction where current approaches fail. Importantly, partiFold-Align requires no prior training. These general techniques are widely applicable to many more protein families. partiFold-Align is available at
(Show Context)

Citation Context

...families (28 alignments, see supplementary material for an illustration of the breakdown), and for testing purposes consider this the “correct” pairwise alignment. For structural alignments, the Matt =-=[22]-=- algorithm is used, which has demonstrated state-of-the-art structural alignment accuracy. During analysis, the resulting alignments are then sorted by relative sequence identity 5 (assuming the Matt ...

Smolign: A Spatial Motifs Based Protein Multiple Structural Alignment Method

by Hong Sun, Ahmet Sacan, Hakan Ferhatosmanoglu, Yusu Wang
"... Abstract—Availability of an effective tool for protein multiple structural alignment (MSTA) is essential for discovery and analysis of biologically significant structural motifs that can help solve functional annotation and drug design problems. Existing MSTA methods collect residue correspondences ..."
Abstract - Cited by 1 (0 self) - Add to MetaCart
Abstract—Availability of an effective tool for protein multiple structural alignment (MSTA) is essential for discovery and analysis of biologically significant structural motifs that can help solve functional annotation and drug design problems. Existing MSTA methods collect residue correspondences mostly through pairwise comparison of consecutive fragments, which can lead to suboptimal alignments, especially when the similarity among the proteins is low. We introduce a novel strategy based on: building a contact-window based motif library from the protein structural data, discovery and extension of common alignment seeds from this library, and optimal superimposition of multiple structures according to these alignment seeds by an enhanced partial order curve comparison method. The ability of our strategy to detect multiple correspondences simultaneously, to catch alignments globally, and to support flexible alignments, endorse a sensitive and robust automated algorithm that can expose similarities among protein structures even under low similarity conditions. Our method yields better alignment results compared to other popular MSTA methods, on several protein structure datasets that span various structural folds and represent different protein similarity levels. A web-based alignment tool, a downloadable executable, and detailed alignment results for the datasets used here are available at
(Show Context)

Citation Context

...(flexible) 2.00 177 TABLE 3: Multiple alignment results for the Homstrad benchmark. mRMSD and core size are averages of all Homstrad datasets. The results (except for those of Smolign) are taken from =-=[45]-=-. components separated by a long and flexible alpha helix. Due to bending of this alpha helical segment, it is not possible to simultaneously align the two substructures by a rigid alignment (Figure 2...

Formatt: Correcting Protein Multiple Structural Alignments by Sequence Peeking

by Shilpa Nadimpalli, Noah Daniels, Lenore Cowen
"... We present Formatt, a multiple structure alignment pro-gram based on the Matt purely geometric multiple struc-tural alignment program, that also takes into account se-quence similarity when constructing alignments. We show that Formatt is superior to Matt in alignment quality based on objective meas ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
We present Formatt, a multiple structure alignment pro-gram based on the Matt purely geometric multiple struc-tural alignment program, that also takes into account se-quence similarity when constructing alignments. We show that Formatt is superior to Matt in alignment quality based on objective measures (most notably Staccato sequence and structure scores) while preserving the same advantages in core length and RMSD that Matt has as a flexible structure aligner, as compared to other multiple structure alignment programs on popular benchmark datasets. Applications in-clude producing better training data for threading methods. 1.
(Show Context)

Citation Context

... errors produced by structure alignment programs that do not take sequence into account can be illustrated by an example pair of proteins, aligned by our group’s own structure alignment program, Matt =-=[10]-=-. Figure 1 illustrates how the structural alignments produced are quite similar, but the Formatt sequence alignment has fewer gaps, and thus fewer non-core residues (three) than Matt (five). Note that...

POSA: a user-driven, interactive multiple protein structure alignment server

by Zhanwen Li, Padmaja Natarajan, Yuzhen Ye, Thomas Hrabe, Adam Godzik - Nucleic Acids Res , 2014
"... POSA (Partial Order Structure Alignment), available at ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
POSA (Partial Order Structure Alignment), available at
(Show Context)

Citation Context

...A is used to identify the conserved regions that form the common structural core of a protein family. Such alignments are identified by MPStrA algorithms like MUSTA (3), CEMC (4), MultiProt (5), Matt =-=(6)-=-, MISTRAL (7), Smolign (8), 3DCOMB (9), SALIGN (10), msTALI (11), mulPBA (12),DeepAlign (13),MUSTANG (14), PDBeFold (15) and others (16,17). Despite its age, POSA performs reasonably well when benchma...

Towards Reliable Automatic Protein Structure Alignment

by Xuefeng Cui, Shuai Cheng Li, Dongbo Bu, Ming Li
"... Abstract. A variety of methods have been proposed for structure simi-larity calculation, which are called structure alignment or superposition. One major shortcoming in current structure alignment algorithms is in their inherent design, which is based on local structure similarity. In this work, we ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
Abstract. A variety of methods have been proposed for structure simi-larity calculation, which are called structure alignment or superposition. One major shortcoming in current structure alignment algorithms is in their inherent design, which is based on local structure similarity. In this work, we propose a method to incorporate global information in obtain-ing optimal alignments and superpositions. Our method, when applied to optimizing the TM-score and the GDT score, produces significantly better results than current state-of-the-art protein structure alignment tools. Specifically, if the highest TM-score found by TMalign is lower than 0.6 and the highest TM-score found by one of the tested methods is higher than 0.5, there is a probability of 42 % that TMalign failed to find TM-scores higher than 0.5, while the same probability is reduced to 2 % if our method is used. This could significantly improve the accuracy of fold detection if the cutoff TM-score of 0.5 is used. In addition, existing structure alignment algorithms focus on structure similarity alone and simply ignore other important similarities, such as sequence similarity. Our approach has the capacity to incorporate multi-ple similarities into the scoring function. Results show that sequence sim-ilarity aids in finding high quality protein structure alignments that are more consistent with eye-examined alignments in HOMSTRAD. Even when structure similarity itself fails to find alignments with any con-sistency with eye-examined alignments, our method remains capable of finding alignments highly similar to, or even identical to, eye-examined alignments. 1
(Show Context)

Citation Context

...It should be interesting to allow flexible ROTRANs within the same cluster to find flexible structure alignments as seen in FATCAT [38] and to find flexible multi-structure alignments as seen in Matt =-=[39]-=-. Moreover, the alignment quality can be further studied by evaluating CASP protein structure prediction [31], by checking self-consistency [37], and by simulating the SCOP fold detection [26]. All th...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University