See this document in CiteSeerX!

Pattern Discovery In Sequence Databases: Algorithms And Applications To DNA/Protein Classification (1997)  (Make Corrections)  (1 citation)
Gung-Wei Chirn
Department of Computer and Information Science, New Jersey Institute of Technology



  Home/Search   Context   Related

 
View or download:
njit.edu/~chin/thesis.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  njit.edu/~chin/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: Sequence databases comprise sequence data, which are linear structural descriptions of many natural entities. Approximate pattern discovery in a sequence database can lead to important conclusions or prediction of new phenomena. Traditional database technology is not suitable for accomplishing the task, and new techniques need to be developed. In this dissertation, we propose several new techniques for discovering patterns in sequence databases. Our techniques incorporate pattern matching... (Update)

Cited by:   More
New Techniques for DNA Sequence Classification - Wang, Rozen, Shapiro, Shasha, .. (1999)   (Correct)

Active bibliography (related documents):   More   All
0.9:   DNA Sequence Classification Using Compression-Based.. - Loewenstern, Hirsh.. (1995)   (Correct)
0.9:   Detection of Alu Sequences in DNA: A Neural Network Approach - Ma, Wang   (Correct)
0.9:   Using Background Knowledge to Improve Inductive Learning of DNA.. - Hirsh (1994)   (Correct)

Similar documents based on text:   More   All
0.3:   Resetting Vector Clocks in Distributed Systems - Yen, Huang (1997)   (Correct)
0.3:   Common-Acoustic-Poles/zeros Approximation - Of Head-Related Transfer (2001)   (Correct)
0.3:   Dynamic Periodic Location Area Update in Mobile Networks - Yi-Bing Lin And   (Correct)

BibTeX entry:   (Update)

Chirn, G.W. 1996. Pattern discovery in sequence databases: Algorithms and applications to DNA/protein classification. Ph.D. Dissertation, Department of Computer and Information Science, New Jersey Institute of Technology. http://citeseer.ist.psu.edu/chirn97pattern.html   More

@phdthesis{ chirn96pattern,
  author = "G. Chirn",
  title = "Pattern discovery in sequence databases: Algorithms and 
           applications to DNA/protein classification",
  school = "Department of Computer and Information Science, 
                 New Jersey Institute of Technology",
  year = "1996",
  url = "citeseer.ist.psu.edu/chirn97pattern.html" }
Citations (may not include all citations):
2133   Pattern Classification and Scene Analysis (context) - Duda, Hart - 1973
431   Basic local alignment search tool (context) - Altschul, Gish et al. - 1990
270   The string-to-string correction problem (context) - Wagner, Fischer - 1974
217   Human Behavior and the Principle of Least Effort (context) - Zipf - 1949
196   Fast text searching allowing errors (context) - Wu, Manber - 1992
172   A space-economical suffix tree construction algorithm (context) - McCreight - 1976
155   Improved tools for biological sequence comparison (context) - Pearson, Lipman - 1988
153   Sampling Techniques (context) - Cochran - 1977
114   Finding approximate pattern in strings (context) - Ukkonen - 1985
103   Practical selectivity estimation through adaptive sampling - Lipton, Naughton et al. - 1990
98   Detecting subtle sequence signals: a gibbs sampling strategy.. (context) - Lawrence, Altschul et al. - 1993
74   Amino acid substitution matrices from protein blocks (context) - Henikoff, Henikoff - 1992
60   Fast parallel and serial approximate string matching (context) - Landau, Vishkin - 1989
54   Rapid and sensitive protein similarity searches (context) - Lipman, Pearson - 1985
53   Methods for assessing the statistical significance of molecu.. (context) - Karlin, Altschul - 1990
51   Combinatorial pattern discovery for scientific data: Some pr.. (context) - Wang, Chirn et al. - 1994
40   Sequential sampling procedures for query size estimation (context) - Haas, Swami - 1992
36   Flash: A fast look-up algorithm for string homology (context) - Califano, Rigoutsos - 1993
35   New techniques for best-match retrieval (context) - Shasha, Wang - 1990
32   Automated assembly of protein blocks for database searching (context) - Henikoff, Henikoff - 1991
30   The rapid generation of mutation data matrices from protein .. (context) - Jones, Taylor et al. - 1992
29   Information content of binding sites on nucleotide sequences (context) - Schneider, Stormo et al. - 1986
28   Discovering simple DNA sequences by the algorithmic signific.. (context) - Milosavljevic, Jurka - 1993
28   Atlas of protein sequence and structure (context) - Dayhoff, Schwartz et al. - 1978
28   Atlas of protein sequence and structure (context) - Schwartz, Dayhoff - 1978
26   Application of neural networks and other machine learning al.. (context) - Lapedes, Barnes et al. - 1989
26   Color set size problem with applications to string matching (context) - Hui - 1992
26   Rigorous pattern-recognition methods for dna sequences. anal.. (context) - Galas, Eggert et al. - 1985
25   Using dirichlet mixture priors to derive hidden markov model.. - Brown, Hughey et al. - 1993
24   Selection of dna binding sites by regulatory proteins (context) - Berg, von Hippel - 1987
24   Selection of dna binding sites by regulatory protein: The le.. (context) - Berg - 1988
24   Computer methods to locate signals in nucleic acid sequences (context) - Staden - 1984
23   Automatic generation of primary sequence patterns from sets .. (context) - Smith, Smith - 1990
23   Approximate tree matching in the presence of variable length.. - Zhang, Shasha et al. - 1994
21   Identifying protein-binding sites from unaligned dna fragmen.. (context) - Stormo - 1989
21   A search for common patterns in many sequences (context) - Roytberg - 1992
18   Identification of protein coding regions by database similar.. (context) - Gish, States - 1993
16   Random sampling from B + trees (context) - Olken, Rotem - 1989
16   Identification of protein sequence homology by consensus tem.. (context) - Taylor - 1986
16   Discovering active motifs in sets of related protein sequenc.. - Wang, Marr et al. - 1994
16   Using background knowledge to improve inductive learning of .. - Hirsh, Noordewier - 1994
16   Exact and approximate algorithms for unordered tree matching (context) - Shasha, Wang et al. - 1994
15   A fast string matching algorithm (context) - Boyer, Moore - 1977
14   Multiple sequence alignment (context) - Bacon, Anderson - 1986
12   A system for approximate tree matching (context) - Wang, Zhang et al. - 1994
12   Conserved sequences and structures of group I introns: build.. (context) - Cech - 1988
11   Protein family classification based on searching a database .. (context) - Henikoff, Henikoff - 1994
10   Performance evaluation of amino acid substitution matrices (context) - Henikoff, Henikoff - 1993
9   A fast and sensitive multiple sequence alignment algorithm (context) - Vingron, Argos - 1989
8   Prediction of structural and functional features of protein .. (context) - Hirst, Sternberg - 1992
8   Statistical estimators for aggregate relational algebra quer.. (context) - Hou, Ozsoyoglu - 1991
8   Discovery by minimal length encoding: a case study in molecu.. (context) - Milosavljevic, Jurka - 1993
7   Clone clustering by hybridization (context) - Milosavljevic, Strezoska et al. - 1995
7   Construction of a dictionary of sequence motifs that charact.. (context) - Ogiwara, Uchiyama et al. - 1992
7   DNA replication (context) - Kornberg - 1980
7   Sequence landscapes (context) - Clift, Haussler et al. - 1986
7   Worth Publishers (context) - Curtis - 1975
6   Algorithms for approximate graph matching (context) - Wang, Zhang et al. - 1995
6   Reconstruction and analysis of human alu genes (context) - Jurka, Milosavljevic - 1991
5   Neural network models for promoter recognition (context) - Lukashin, Anshelevich et al. - 1989
5   Protein database searches for multiple alignments (context) - Altschul, Lipman - 1990
4   Nucleotide sequence homologies in control regions of prokary.. (context) - Studnicka - 1987
4   Analysis of the occurrence of promotersites in dna (context) - Mulligan, McClure - 1986
4   Mathematical Methods for DNA Sequence Analysis (context) - Waterman - 1989
4   Prosite: a dictionary of protein sites and patterns (context) - Bairoch - 1989
4   Detection of common motifs in RNA secondary structures (context) - Margalit, Shapiro et al. - 1989
4   Storage and Retrieval of Chemical Information (context) - Ash, Chubb et al. - 1985
4   Finding protein similarities with nucleotide sequence databa.. (context) - Henikoff, Wallace et al. - 1990
3   Chemical Information Systems (context) - Ash, Hyde - 1975
3   The protein kinase family: Conserved features and deduced ph.. (context) - Hanks, Quinn et al. - 1988
3   The elucidation of protein function by sequence motif analys.. (context) - Hodgman - 1989
3   Discovering dependencies via algorithmic mutual information:.. (context) - Milosavljevic - 1995
2   A simple tool to search for sequence motifs that are conserv.. (context) - Tatusov, Koonin - 1994
2   Using knowledge-based neural networks to refine existing bio.. (context) - Shavlik, Towell et al. - 1992
2   Escherichia coli promoters. ii. a spacing-class dependent pr.. (context) - O'Neill, Chiafari - 1989
2   Identification and characterization of new human medium reit.. (context) - Jurka, Kaplan et al. - 1993
2   Application of learning techniques to splicing site recognit.. (context) - Quinqueton, Moreau - 1985
2   Search for promoter sites of prokaryotic dna using learning .. (context) - Sallantin, Haiech et al. - 1985
2   Sources and evolution of human Alu repeated sequences (context) - Britten, Baron et al. - 1988
2   Improved detection of helix-turn-helix dna-binding motifs in.. (context) - Dodd, Egan - 1990
2   Cold Spring Harbor Symposia on Quantitative Biology (context) - O'Farrell, Leopold - 1991
2   A fundamental division in the Alu family of repeated sequenc.. (context) - Jurka, Smith - 1988
1   Prediction of function in dna sequence (context) - Gelfand - 1995
1   PATMAT: A searching and extraction program for sequence, pat.. (context) - Wallace, Henikoff - 1992
1   Existence of at least three distince Alu subfamilies (context) - Willard, Nguyen et al. - 1987
1   The Alu family developed through successive waves of fixatio.. (context) - Quentin - 1988
1   A visualization tool for pattern matching and discovery in s.. (context) - Chang, Wang et al. - 1996
1   Secondary structure of the 5' external transcribed spacer of.. (context) - Michot, Bachellerie - 1991
1   Fast identification of approximately matching substrings (context) - Cobb - 1994
1   Studies of frequently recurring substructures in human alpha.. (context) - Le and, Currey et al. - 1987
1   DNA sequence recognition by hybridization to short oligomers (context) - Milosavljevic - 1995

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC