• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

MUSCLE: multiple sequence alignment with high accuracy and high throughput (2004)

by R C Edgar
Venue:in Nucleic Acids Research
Add To MetaCart

Tools

Sorted by:
Results 1 - 10 of 2,510
Next 10 →

A New Voronoi-Based Surface Reconstruction Algorithm

by Nina Amenta, Marshall Bern, Manolis Kamvysselis , 2002
"... We describe our experience with a new algorithm for the reconstruction of surfaces from unorganized sample points in R³. The algorithm is the first for this problem with provable guarantees. Given a “good sample” from a smooth surface, the output is guaranteed to be topologically correct and converg ..."
Abstract - Cited by 414 (9 self) - Add to MetaCart
We describe our experience with a new algorithm for the reconstruction of surfaces from unorganized sample points in R³. The algorithm is the first for this problem with provable guarantees. Given a “good sample” from a smooth surface, the output is guaranteed to be topologically correct and convergent to the original surface as the sampling density increases. The definition of a good sample is itself interesting: the required sampling density varies locally, rigorously capturing the intuitive notion that featureless areas can be reconstructed from fewer samples. The output mesh interpolates, rather than approximates, the input points. Our algorithm is based on the three-dimensional Voronoi diagram. Given a good program for this fundamental subroutine, the algorithm is quite easy to implement.

Pfam: clans, web tools and services

by Robert D. Finn, Jaina Mistry, Benjamin Schuster-böckler, Sam Griffiths-jones, Volker Hollich, Timo Lassmann, Simon Moxon, Mhairi Marshall, Ajay Khanna, Richard Durbin, Sean R. Eddy, Erik L. L. Sonnhammer, Alex Bateman - Nucleic Acids Res , 2006
"... Pfam is a database of protein families that currently contains 7973 entries (release 18.0). A recent development in Pfam has enabled the grouping of related families into clans. Pfam clans are described in detail, together with the new associated web pages. Improvements to the range of Pfam web tool ..."
Abstract - Cited by 296 (13 self) - Add to MetaCart
Pfam is a database of protein families that currently contains 7973 entries (release 18.0). A recent development in Pfam has enabled the grouping of related families into clans. Pfam clans are described in detail, together with the new associated web pages. Improvements to the range of Pfam web tools and the first set of Pfam web services that allow programmatic access to the database and associated tools are also presented. Pfam is available on the web in the UK
(Show Context)

Citation Context

...e links to a visualization of the profile–profile alignment (10) (Figure 1C). The clan alignment is an alignment of all the clan seed alignments (Figure 1D). These are produced by an option in MUSCLE =-=(11)-=- that aligns two input multiple sequencesD250 Nucleic Acids Research, 2006, Vol. 34, Database issue Table 2. Summary of the new website features and web services, including server location Feature Mir...

PROBCONS: Probabilistic consistency-based multiple sequence alignment

by Chuong B. Do, Mahathi S. P. Mahabhashyam, Michael Brudno, Serafim Batzoglou - Genome Res , 2005
"... To study gene evolution across a wide range of organisms, biologists need accurate tools for multiple sequence alignment of protein families. Obtaining accurate alignments, however, is a difficult computational problem because of not only the high computational cost but also the lack of proper objec ..."
Abstract - Cited by 256 (10 self) - Add to MetaCart
To study gene evolution across a wide range of organisms, biologists need accurate tools for multiple sequence alignment of protein families. Obtaining accurate alignments, however, is a difficult computational problem because of not only the high computational cost but also the lack of proper objective functions for measuring alignment quality. In this paper, we introduce prob-abilistic consistency, a novel scoring function for multiple sequence comparisons. We present PROBCONS, a practical tool for progressive protein multiple sequence alignment based on prob-abilistic consistency, and evaluate its performance on several standard alignment benchmark datasets. On the BAliBASE, SABmark, and PREFAB benchmark alignment databases, PROB-CONS achieves statistically significant improvement over other leading methods while maintain-ing practical speed. PROBCONS is publicly available as a web resource. Source code and execu-tables are available under the GNU Public License at
(Show Context)

Citation Context

...listic consistency methodology. We compared our systems against several current leading alignment tools including Align-m (Van Walle et al. 2004), CLUSTALW, DIALIGN, MAFFT (Katoh et al 2002), MUSCLE (=-=Edgar 2004-=-), and T-Coffee on the BAliBASE, SABmark (Van Walle et al. 2004), and PREFAB (Edgar 2004) benchmark alignment databases. In this comparison, PROBCONS shows a clear statistically significant improvemen...

Pfam: clans, web tools and services. Nucleic Acids Res

by Robert D. Finn, Jaina Mistry, Benjamin Schuster-böckler, Sam Griffiths-jones, Volker Hollich, Timo Lassmann, Simon Moxon, Mhairi Marshall, Ajay Khanna, Richard Durbin, Sean R. Eddy, Erik L. L. Sonnhammer, Alex Bateman , 2006
"... doi:10.1093/nar/gkj149 ..."
Abstract - Cited by 127 (0 self) - Add to MetaCart
doi:10.1093/nar/gkj149
(Show Context)

Citation Context

...e links to a visualization of the profile–profile alignment (10) (Figure 1C). The clan alignment is an alignment of all the clan seed alignments (Figure 1D). These are produced by an option in MUSCLE =-=(11)-=- that aligns two input multiple sequenceD250 Nucleic Acids Research, 2006, Vol. 34, Database issue Table 2. Summary of the new website features and web services, including server location Feature Mir...

Predicting the functional impact of protein mutations: application to cancer genomics

by Boris Reva, Yevgeniy Antipin, Chris S - Nucleic Acids Res. 39, e118. ACS Chemical Biology Letters dx.doi.org/10.1021/cb500347p | ACS Chem. Biol , 2011
"... As large-scale re-sequencing of genomes reveals many protein mutations, especially in human cancer tissues, prediction of their likely functional impact becomes important practical goal. Here, we introduce a new functional impact score (FIS) for amino acid residue changes using evolutionary conserva ..."
Abstract - Cited by 102 (3 self) - Add to MetaCart
As large-scale re-sequencing of genomes reveals many protein mutations, especially in human cancer tissues, prediction of their likely functional impact becomes important practical goal. Here, we introduce a new functional impact score (FIS) for amino acid residue changes using evolutionary conservation patterns. The information in these patterns is derived from aligned families and sub-families of sequence homologs within and between species using combinatorial entropy formalism. The score performs well on a large set of human protein mutations in separating disease-associated variants (19200), assumed to be strongly functional, from common polymorphisms (35600), assumed to be weakly functional (area under the receiver operating characteristic curve of 0.86). In cancer, using recurrence, multiplicity and annotation for 10000 mutations in the COSMIC database, the method does well in assign-ing higher scores to more likely functional mutations (‘drivers’). To guide experimental prioritization, we report a list of about 1000 top human cancer genes frequently mutated in one or more cancer types ranked by likely functional impact; and, an additional 1000 candidate cancer genes with rare but likely functional mutations. In addition, we estimate that at least 5 % of cancer-relevant mutations involve switch of function, rather than simply loss or gain of function.
(Show Context)

Citation Context

...the BLAST program (60) retrieving up to 700 homologous sequences at an E-value threshold of 0.05 from the Uniprot sequence database (61); the resulting hits were then aligned using the MUSCLE program =-=(62)-=-. In place of ið! Þ we also use the notation FIS (functional impact score based on evolutionary information) or simply the word ‘score’ in tables and computer output. (Details of derivation of the ...

OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groups

by Feng Chen, Aaron J. Mackey, Christian J. Stoeckert, David S. Roos - Nucleic Acids Res , 2006
"... of ortholog groups ..."
Abstract - Cited by 96 (12 self) - Add to MetaCart
of ortholog groups
(Show Context)

Citation Context

... E-value (based on log 5[E-value]), average percent coverage (fraction of aligned regions, based on the shorter sequence) and average percent identity. In addition, MUSCLE multiple sequence alignment =-=(12)-=- and BioLayout graphical visualization of sequence similarities (13) are provided for groups with <100 proteins. The 10OrthoMCL-DB web interface is run by Perl CGI scripts that implement a simple MVC ...

Ancestral polyploidy in seed plants and angiosperms.

by Yuannian Jiao , Norman J Wickett , Saravanaraj Ayyampalayam , André S Chanderbali , Lena Landherr , Paula E Ralph , Lynn P Tomsho , Yi Hu , Haiying Liang , Pamela S Soltis , Douglas E Soltis , Sandra W Clifton , Scott E Schlarbaum , Stephan C Schuster , Hong Ma , Jim Leebens-Mack , Claude W Depamphilis - Nature , 2011
"... ..."
Abstract - Cited by 95 (1 self) - Add to MetaCart
Abstract not found

Phytozome: a comparative platform for green plant

by David M. Goodstein, Shengqiang Shu, Russell Howson, Rochak Neupane, Richard D. Hayes, Joni Fazo, Therese Mitros, William Dirks, Uffe Hellsten, Nicholas Putnam, Daniel S. Rokhsar , 2011
"... genomics ..."
Abstract - Cited by 83 (1 self) - Add to MetaCart
Abstract not found
(Show Context)

Citation Context

...es will be added to a parent family as paralogs if they have a hit to the parent that is stronger than the parent’s best outgroup hit. This process is repeated down to the root node. MSAs from MUSCLE =-=(45)-=- and Hidden Markov Model (HMM) profiles from HMMER3 (46) are created for each core family. These profiles are used to ‘pledge’ peptides from non-core genomes into existing core families using HMMScan ...

M-Coffee: combining multiple sequence alignment methods with T-Coffee

by Iain M. Wallace, Desmond G. Higgins, Cedric Notredame - Nucleic Acids Res , 2006
"... We introduce M-Coffee, a meta-method for assembling multiple sequence alignments (MSA) by combining the output of several individual methods into one single MSA. M-Coffee is an extension of T-Coffee and uses consistency to estimate a consensus alignment. We show that the procedure is robust to varia ..."
Abstract - Cited by 60 (13 self) - Add to MetaCart
We introduce M-Coffee, a meta-method for assembling multiple sequence alignments (MSA) by combining the output of several individual methods into one single MSA. M-Coffee is an extension of T-Coffee and uses consistency to estimate a consensus alignment. We show that the procedure is robust to variations in the choice of constituent methods and reasonably tolerant to duplicate MSAs. We also show that performances can be improved by carefully selecting the constituent methods. M-Coffee outperforms all the individual methods on three major reference datasets: HOMSTRAD, Prefab and Balibase. We also show that on a case-by-case basis, M-Coffee is twice as likely to deliver the best alignment than any individual method. Given a collection of pre-computed MSAs, M-Coffee has similar CPU requirements to the original T-Coffee. M-Coffee is a freeware open-source package available from
(Show Context)

Citation Context

...accuracy improvement over existing methods. Since then, consistency based objective functions have been used within several new multiple alignment packages, including POA (13), MAFFT 5 (14), Muscle 6 =-=(5)-=-, ProbCons (15) and PCMA (16). More than 50 MSA methods have been described over the last 10 years (Medline, January 08, 2006), with no less than 20 new publications in 2005 alone. The complexity and ...

Probalign: Multiple sequence alignment using partition function posterior probabilities

by Usman Roshan, Dennis R. Livesay - Bioinformatics , 2006
"... Motivation: The maximum expected accuracy optimization criterion for multiple sequence alignment uses pairwise posterior probabilities of residues to align sequences. The partition function methodology is one way of estimating these probabilities. Here, we combine these two ideas for the first time ..."
Abstract - Cited by 59 (7 self) - Add to MetaCart
Motivation: The maximum expected accuracy optimization criterion for multiple sequence alignment uses pairwise posterior probabilities of residues to align sequences. The partition function methodology is one way of estimating these probabilities. Here, we combine these two ideas for the first time to construct maximal expected accuracy sequence alignments. Results: We bridge the two techniques within the program Probalign. Our results indicate that Probalign alignments are generally more accurate than other leading multiple sequence alignment methods (i.e., Probcons, MAFFT, and MUSCLE) on the BAliBASE 3.0 protein alignment benchmark. Similarly, Probalign also outperforms these methods on the HOMSTRAD and OXBENCH benchmarks. Probalign ranks statistically significantly highest (P-value &lt; 0.005) on all three benchmarks. Deeper scrutiny of the technique indicates that the improvements are largest on datasets containing N/C terminal extensions and on datasets containing long and heterogeneous length proteins. These points are demonstrated on both real and simulated data. Finally, our method also produces accurate alignments on long and heterogeneous length datasets containing protein repeats. There, alignment accuracy scores are at least 10% and 15 % higher than the other three methods when standard deviation of length is at least 300 and 400 respectively. Availability: Open source code implementing Probalign as well as for producing the simulated data, and all real and simulated data are freely available from
(Show Context)

Citation Context

...ment methods are calculated using the Friedman rank test (Kanji 1999), which is a standard measure used for discriminating alignments in benchmarking studies (Thompson et. al., 1999; Do et al., 2005; =-=Edgar 2004-=-; Katoh et al., 2005). Roughly speaking, the lower the reported P-value the less likely it is that the difference in ranking between the methods is due to chance. We consider P-values below 0.05 (a st...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University