See this document in CiteSeerX!

Text Searching: Theory and Practice  (Make Corrections)  
Ricardo A. Baeza-Yates, Gonzalo Navarro



  Home/Search   Context   Related

 
View or download:
dcc.uchile.cl/~gnavarro/p...fla03.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  dcc.uchile.cl/~gnavarro/publ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: We present the state of the art of the main component of text retrieval systems: the search engine. We outline the main lines of research and issues involved. We survey the relevant techniques in use today for text searching and explore the gap between theoretical and practical algorithms. The main observation is that simpler ideas are better in practice. (Update)

Active bibliography (related documents):   More   All
0.9:   Fast and Simple Character Classes and Bounded Gaps Pattern.. - Navarro, Raffinot (2003)   (Correct)
0.8:   Extending LEDA to Secondary Memory - Andreas Crauser And (1999)   (Correct)
0.8:   Regular Expression Searching on Compressed Text - Navarro   (Correct)

Similar documents based on text:   More   All
0.4:   Fast Searching on Compressed Text Allowing Errors - de Moura, Navarro, al. (1998)   (Correct)
0.3:   A Model and a Visual Query Language for Structured Text - Baeza-Yates, Navarro.. (1998)   (Correct)
0.3:   Unbalancing: the Key to Index High Dimensional Metric Spaces - Chávez, Navarro   (Correct)

BibTeX entry:   (Update)

@misc{ baeza-yates-text,
  author = "Ricardo A. Baeza-Yates and Gonzalo Navarro",
  title = "Text Searching: Theory and Practice",
  url = "citeseer.ist.psu.edu/605426.html" }
Citations (may not include all citations):
866   Techniques and Tools (context) - Aho, Sethi et al. - 1986
372   Modern Information Retrieval - Baeza-Yates, Ribeiro-Neto - 1999
347   Fast pattern matching in strings (context) - Knuth, Morris et al. - 1977
243   Information Retrieval: Data Structures and Algorithms (context) - Frakes, Baeza-Yates - 1992
214   A fast string searching algorithm (context) - Boyer, Moore - 1977
204   Oxford University Press (context) - Crochemore, Rytter - 1994
196   Fast text searching allowing errors (context) - Wu, Manber - 1992
158   Linear pattern matching algorithm (context) - Weiner - 1973
148   The theory and computation of evolutionary distances: Patter.. (context) - Sellers - 1980
114   Finding approximate patterns in strings (context) - Ukkonen - 1985
86   A guided tour to approximate string matching - Navarro - 2001
73   A space-economical sux tree construction algorithm (context) - McCreight - 1976
71   An improved algorithm for approximate string matching (context) - Galil, Park - 1990
59   Speeding up two string matching algorithms (context) - Crochemore, Czumaj et al. - 1994
57   Regular expression search algorithm (context) - Thompson - 1968
56   The myriad virtues of subword trees (context) - Apostolico - 1985
56   A very fast substring search algorithm (context) - Sunday - 1990
55   A fast bit-vector algorithm for approximate string matching .. - Myers - 1999
53   Generalized string matching (context) - Abrahamson - 1987
52   Cambridge University Press (context) - eld, Strings et al. - 1997
47   New indices for text: Pat trees and pat arrays (context) - Gonnet, Baeza-Yates et al. - 1992
45   Approximate matching of regular expressions (context) - Myers, Miller - 1989
38   Information Retrieval: Computational and Theoretical Aspects (context) - Heaps - 1978
36   Van Nostrand Reinhold (context) - Witten, Mo et al. - 1999
35   String matching and other products (context) - Fischer, Paterson - 1974
35   Faster approximate string matching (context) - Baeza-Yates, Navarro - 1999
34   Software Practice and Experience (context) - Horspool, searching - 1980
32   A new approach to text searching (context) - Baeza-Yates, Gonnet - 1989
29   Ecient string matching: an aid to bibliographic search (context) - Aho, Corasick - 1975
29   The complexity of pattern matching for a random string (context) - Yao - 1979
29   An algorithm for string matching with a sequence of don't ca.. (context) - Manber, Baeza-Yates - 1991
26   Sux arrays: a new method for on-line string searches (context) - Manber, Myers - 1993
26   samples in approximate string matching (context) - Sutinen, Tarhio et al. - 1996
25   From ukkonen to mccreight and weiner: A unifying view of lin.. - Giegerich, Kurtz - 1997
24   Adding compression to block addressing inverted indexes - Navarro, Moura et al. - 2000
23   Tight bounds on the complexity of the Boyer-Moore string mat.. (context) - Cole - 1991
22   A four russians algorithm for regular expression pattern mat.. (context) - Myers - 1992
21   Flexible Pattern Matching in Strings { Practical on-line sea.. (context) - Navarro, Ranot - 2002
17   A hybrid indexing method for approximate string matching - Navarro, Baeza-Yates - 2000
14   A string matching algorithm fast on the average (context) - Commentz-Walter - 1979
13   Indexing text with approximate q-grams - Navarro, Sutinen et al. - 2000
13   A fast algorithm for multi-pattern searching - Wu, Manber - 1994
13   Fast text searching for regular expressions or automaton sea.. (context) - Baeza-Yates, Gonnet - 1996
13   The exact complexity of string matching (context) - Colussi, Galil et al. - 1990
11   A subquadratic algorithm for approximate regular expression .. (context) - Wu, Manber et al. - 1995
10   Direct construction of compact directed acyclic word graphs - Crochemore, erin - 1997
10   Approximate string matching over sux trees - Ukkonen - 1993
9   Fast and exible string matching by combining bit-parallelism.. (context) - Navarro, Ranot - 2000
9   Handbook of Algorithms and Data Structures { In Pascal and C (context) - Gonnet, Baeza-Yates - 1991
9   gram technique for automatic correction of substitution (context) - Ullman, n- - 1977
8   Simple linear work sux array construction - Karkkainen, Sanders - 2003
8   Linear-time construction of sux arrays (context) - Kim, Sim et al. - 2003
8   Nr-grep: a fast and exible pattern matching tool (context) - Navarro - 2001
7   Average-optimal multiple approximate string matching - Fredriksson, Navarro - 2003
7   Fast regular expression search - Navarro, Ranot - 1999
7   Glimpse: A tool to search through entire le systems (context) - Manber, Wu - 1994
6   Block-addressing indices for approximate text retrieval (context) - Baeza-Yates, Navarro - 2000
6   a fast approximate pattern-matching tool (context) - Wu, Manber - 1992
6   Constructing sux trees on-line in linear time (context) - Ukkonen - 1992
6   Approximate string matching with local similarity (context) - Chang, Marr - 1994
5   Complete inverted les for ecient text retrieval and analysis (context) - Blumer, Blumer et al. - 1987
5   Fast and exible word searching on compressed text (context) - Moura, Navarro et al. - 2000
4   Fast string matching using an n-gram algorithm - Kim, Shawe-Taylor - 1991
4   Approximate regular expression searching with arbitrary inte.. - Navarro - 2002
3   Hierarchies of indices for text searching (context) - Baeza-Yates, Barbosa et al. - 1996
2   Software Practice and Experience (context) - Tarhio, Peltola et al. - 1997
2   Factor oracle of a set of words (context) - Allauzen, Ranot - 1999
2   On constructing sux arrays in external memory (context) - Crauser, Ferragina - 2002
2   Ecient implementation of lazy sux trees (context) - Giegerich, Kurtz et al. - 1999
2   Space ecient linear time construction of sux arrays (context) - Ko, Aluru - 2003
2   Faster bit-parallel approximate string matching - Hyyr, Navarro - 2002
1   An ecient text searching system (context) - Gonnet - 1987
1   Software-Practice and Experience (context) - Baeza-Yates, searching - 1989
1   Ecient string matching with don't-care patterns (context) - Pinter - 1985
1   Ecient experimental string matching by weak factor recogniti.. (context) - Allauzen, Crochemore et al. - 2089
1   time string matching using only a xed number of local storag.. (context) - Galil, Seiferas - 1981

Documents on the same site (http://www.dcc.uchile.cl/~gnavarro/publ.html):   More
A More Precise Solution to Two Problems on Tries - Navarro, Poblete   (Correct)
Fast Approximate String Matching in a Dictionary - Baeza-Yates, Navarro (1998)   (Correct)
An Optimal Index for PAT Arrays - Navarro (1996)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC