Adding compression in text retrieval systems
Abstract: In this article we discuss recent methods for compressing the text and the index of text retrieval systems. By compressing both the complete text and the index, the total amount of space is less than half the size of the original text alone. Most surprisingly, the time required to build the index and also to answer a query is much less than if the index and text had not been compressed. This is one of the few cases where there is no space-time trade-off. Moreover, the text can be kept... (Update)
Cited by: More
Compressing Distributed Text in Parallel with (s.. - Bonacic, Farina..
(Correct)
Applying the Contexts Model in Semistructured Text Databases - Adiego, Navarro, Fuente
(Correct)
Merging Prediction by Partial Matching with Structural.. - Adiego, Fuente, Navarro (2004)
(Correct)
Similar documents (at the sentence level):
10.6%: Adding Compression to Block Addressing Inverted Indices - Navarro, de Moura.. (2000)
(Correct)
6.4%: Fast and Flexible Word Searching on Compressed Text - de Moura, Navarro.. (2000)
(Correct)
5.3%: Adding Compression to Block Addressing Inverted Indexes - Navarro, de Moura.. (2000)
(Correct)
Active bibliography (related documents): More All
0.2: NR-grep: A Fast and Flexible Pattern Matching Tool - Navarro (2000)
(Correct)
0.1: Improving Web Search Efficiency via a Locality.. - de Moura, Santos, .. (2005)
(Correct)
0.1: Boyer-Moore String Matching over Ziv-Lempel Compressed Text - Navarro, Tarhio (2000)
(Correct)
Similar documents based on text: More All
1.5: Direct Pattern Matching on Compressed Text - de Moura, Navarro, Ziviani (1998)
(Correct)
0.4: An Efficient Compression Code for Text Databases - Brisaboa, Iglesias, Navarro, ..
(Correct)
0.3: Local Versus Global Link Information - In The Web (2003)
(Correct)
Related documents from co-citation: More All
8: Adding compression to block addressing inverted indexes
- Navarro, Moura et al. - 2000
7: A Universal Algorithm for Sequential Data Compression
- Ziv, Lempel
6: Software Practice and Experience (context) - Mo, Eddy et al. - 1996
BibTeX entry: (Update)
Nivio Ziviani, Edleno Silva de Moura, Gonzalo Navarro, and Ricardo Baeza-Yates. Compression: A key for next-generation text retrieval systems. IEEE Computer, 33(11):37--44, November 2000. 50 http://citeseer.ist.psu.edu/ziviani00compression.html More
@article{ ziviani00compression,
author = "Nivio Ziviani and Edleno Silva de Moura and Gonzalo Navarro and Ricardo Baeza-Yates",
title = "Compression: {A} Key for Next-Generation Text Retrieval Systems",
journal = "IEEE Computer",
volume = "33",
number = "11",
pages = "37--44",
year = "2000",
url = "citeseer.ist.psu.edu/ziviani00compression.html" }
Citations (may not include all citations):
1575
Computer Architecture: A Quantitative Approach (context) - Patterson, Hennessy - 1995
338
A method for the construction of minimum-redundancy codes (context) - Huffman - 1952
196
Fast text searching allowing errors (context) - Wu, Manber - 1992
150
Accessibility of information on the web (context) - Lawrence, Giles - 1999 - http://www.wwwmetrics.com/
134
Modern Information Retrieval (context) - Baeza-Yates, Ribeiro-Neto - 1999
85
Overview of the third text retrieval conference (context) - Harman - 1995
81
A new approach to text searching (context) - Baeza-Yates, Gonnet - 1992
64
Managing Gigabytes (context) - Witten, Moffat et al. - 1999
24
Adding compression to block addressing inverted indexes
- Navarro, Moura et al. - 2000
24
Faster approximate string matching
- Baeza-Yates, Navarro - 1999
23
Document filtering for fast ranking (context) - Persin - 1994
9
Fast and flexible word searching on compressed text (context) - Moura, Navarro et al. - 2000
Documents on the same site (http://www.dcc.uchile.cl/~gnavarro/publ.html): More
A More Precise Solution to Two Problems on Tries - Navarro, Poblete
(Correct)
Fast Approximate String Matching in a Dictionary - Baeza-Yates, Navarro (1998)
(Correct)
An Optimal Index for PAT Arrays - Navarro (1996)
(Correct)
Online articles have much greater impact More about CiteSeer.IST Add search form to your site Submit documents Feedback
CiteSeer.IST - Copyright Penn State and NEC