See this document in CiteSeerX!

A Linear Time Algorithm for Finding All Maximal Scoring Subsequences (1999)  (Make Corrections)  (6 citations)
Walter L. Ruzzo, Martin Tompa
Seventh International Conference on Intelligent Systems for Molecular Biology



  Home/Search   Context   Related

 
View or download:
washington.edu/homes/tompa/...maxseq.ps
Cached:  PDF   PS.gz  PS  Image  Update  Help

From:  washington.edu/homes/tompa/pap... (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Given a sequence of real numbers ("scores"), we present a practical linear time algorithm to find those nonoverlapping, contiguous subsequences having greatest total scores. This improves on the best previously known algorithm, which requires quadratic time in the worst case. The problem arises in biological sequence analysis, where the high-scoring subsequences correspond to regions of unusual composition in a nucleic acid or protein sequence. For instance, Altschul, Karlin, and others ... (Update)

Context of citations to this paper:   More

...negative and there are lower and upper bounds on the size of a tile. The papers which most closely relate to our work are the references [20] and [28] The authors in [20] provide an O(n) time algorithm to find all maximal scoring subsequences of a sequence of length n. In...

.... a linear algorithm for finding the maximal substrings and superstrings of two given strings was suggested in the context of bioinformatics [38]. Embedding a variation of this algorithm in our system will possibly reduce the complexity of string matching. Our home grown tool,...

Cited by:   More
Estimating Seed Sensitivity on Homogeneous Alignments - Kucherov, Noé, Ponty   (Correct)
Algorithms for Finding Maximal-Scoring Segment Sets (Extended.. - Csürös   (Correct)
Fast Optimal Genome Tiling with Applications to.. - Berman, Bertone..   (Correct)

Similar documents (at the sentence level):
12.5%:   Lecture Notes on Biological Sequence Analysis - Tompa (2000)   (Correct)

Active bibliography (related documents):   More   All
1.2:   A Linear Time Algorithm for Finding All Maximal Scoring.. - Ruzzo, Tompa (1999)   (Correct)
0.2:   Bootstrapping and Normalization for Enhanced Evaluations of.. - Green, Brenner (2002)   (Correct)
0.2:   Biological Sequence Comparison: An Overview of Techniques - Murphy (1994)   (Correct)

Similar documents based on text:   More   All
0.5:   MaxSubSeq: an algorithm for finding all the.. - Fariselli.. (2002)   (Correct)
0.4:   On certain formulas of Karlin and Szegö - Leclerc   (Correct)
0.3:   Non-symmetric Score Matrices and the Detection of.. - Mueller, Rahmann.. (2001)   (Correct)

Related documents from co-citation:   More   All
5:   Identification of common molecular subsequences - Smith, Waterman - 1981
3:   Introduction to algorithms (context) - Cormen, Leiserson et al. - 1991
3:   A basic local alignment search tool (context) - Altschul, Gish et al. - 1990

BibTeX entry:   (Update)

Walter L. Ruzzo and Martin Tompa. A linear time algorithm for finding all maximal scoring subsequences. In Proceedings of the Seventh International Conference on Intelligent Systems for Molecular Biology, pages 234--241, Heidelberg, Germany, August 1999. AAAI Press. http://citeseer.ist.psu.edu/article/ruzzo99linear.html   More

@inproceedings{ ruzzo99linear,
    author = "W. L. Ruzzo and M. Tompa",
    title = "A Linear Time Algorithm for Finding All Maximal Scoring Subsequences",
    booktitle = "Seventh International Conference on Intelligent Systems for Molecular Biology",
    address = "Heidelberg, Germany",
    pages = "234--241",
    year = "1999",
    url = "citeseer.ist.psu.edu/article/ruzzo99linear.html" }
Citations (may not include all citations):
240   Identification of common molecular subsequences - Smith, Waterman - 1981
78   Programming Pearls (context) - Bentley - 1986
62   Introduction to Algorithms: A Creative Approach (context) - Manber - 1989
60   A simple method for displaying the hydropathic character of .. (context) - Kyte, Doolittle - 1982
25   Applications and statistics for multiple high-scoring segmen.. (context) - Karlin, Altschul - 1993
16   A new algorithm for best subsequence alignments with applica.. - Waterman, Eggert - 1987
15   Proofs as programs (context) - Bates, Constable - 1985
13   Locally optimal subalignments using nonlinear similarity fun.. (context) - Altschul, Erickson
13   Pattern recognition in genetic sequences by mismatch density (context) - Sellers - 1984
10   Strong limit theorems of empirical functionals for large exc.. (context) - Dembo, Karlin - 1991
8   Statistical composition of high-scoring segments from molecu.. (context) - Karlin, Dembo et al. - 1990
5   Compositional biases of bacterial genomes and evolutionary i.. (context) - Karlin, Mrazek et al. - 1997
4   A nonlinear measure of subalignment similarity and its signi.. (context) - Altschul, Erickson
3   Chance and significance in protein and DNA sequence analysis (context) - Karlin, Brendel - 1992
1   Evaluating the statistical significance of multiple distinct.. (context) - Altschul - 1997
1   Statistical-methods and insights for protein and 7th Intl (context) - Karlin, Bucher et al. - 1991

Documents on the same site (http://www.cs.washington.edu/homes/tompa/papers/):   More
An Exact Method for Finding Short Motifs in Sequences, with.. - Tompa (1999)   (Correct)
Lower Bounds on Universal Traversal Sequences Based on Chains of.. - Buss (1995)   (Correct)
Communication-Space Tradeoffs For Unrestricted Protocols - Beame, Tompa, Yan (1994)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC