See this document in CiteSeerX!

Adding Compression to Block Addressing Inverted Indexes (2000)  (Make Corrections)  (24 citations)
Gonzalo Navarro, Edleno Silva de Moura, Marden Neubert, Nivio Ziviani, Ricardo Baeza-Yates
Information Retrieval



  Home/Search   Context   Related

 
View or download:
dcc.uchile.cl/~gnavarr...kluwer00.ps.gz
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  dcc.uchile.cl/~gnavarro/publ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: . Inverted index compression, block addressing and sequential search on compressed text are three techniques that have been separately developed for efficient, low-overhead text retrieval. Modern text compression techniques can reduce the text to less than 30% of its size and allow searching it directly and faster than the uncompressed text. Inverted index compression obtains significant reduction of their original size at the same processing speed. Block addressing makes the inverted lists... (Update)

Cited by:   More
Compressing Inverted Files - Trotman (2003)   (Correct)
Index Structures for Distributed Text Databases - Marin   (Correct)
Compressing Distributed Text in Parallel with (s.. - Bonacic, Farina..   (Correct)

Similar documents (at the sentence level):
63.2%:   Adding Compression to Block Addressing Inverted Indices - Navarro, de Moura.. (2000)   (Correct)
61.4%:   Adding Compression to Block Addressing Inverted Indexes - Navarro, de Moura.. (2000)   (Correct)

Active bibliography (related documents):   More   All
0.8:   Fast and Flexible Word Searching on Compressed Text - de Moura, Navarro.. (2000)   (Correct)
0.3:   A Public Digital Library based on Full-Text.. - Witten.. (1998)   (Correct)
0.3:   A Public Digital Library based on Full-text.. - Witten.. (1998)   (Correct)

Similar documents based on text:   More   All
1.4:   Direct Pattern Matching on Compressed Text - de Moura, Navarro, Ziviani (1998)   (Correct)
0.5:   Indexing Text using the Ziv-Lempel Trie - Navarro (2002)   (Correct)
0.4:   A Fast Distributed Suffix Array Generation Algorithm - Kitajima, Navarro   (Correct)

Related documents from co-citation:   More   All
11:   Managing Gigabytes (context) - Witten, Moffat - 1994
9:   Overview of the Third Text REtrieval Conference (context) - Harman - 1995
9:   Compression of individual sequences via variable-rate coding - Ziv, Lempel

BibTeX entry:   (Update)

G. Navarro, E. Moura, M. Neubert, N. Ziviani, and R. Baeza-Yates. Adding compression to block addressing inverted indexes. Kluwer Information Retrieval Journal, 3(1):49--77, 2000. http://citeseer.ist.psu.edu/navarro00adding.html   More

@article{ navarro00adding,
    author = "Gonzalo Navarro and Edleno Silva de Moura and Marden Neubert and Nivio Ziviani and Ricardo Baeza-Yates",
    title = "Adding Compression to Block Addressing Inverted Indexes",
    journal = "Information Retrieval",
    volume = "3",
    number = "1",
    publisher = "Kluwer Academic Publishers",
    pages = "49--77",
    year = "2000",
    url = "citeseer.ist.psu.edu/navarro00adding.html" }
Citations (may not include all citations):
338   A method for the construction of minimum-redundancy codes (context) - Huffman - 1952
196   Fast text searching allowing errors (context) - Wu, Manber - 1992
170   The Harvest Information Discovery and Access System - Bowman, Danzig et al. - 1994
137   Universal codeword sets and representations of the integers (context) - Elias - 1975
118   Glimpse: A tool to search through entire file systems - Manber, Wu - 1994
85   Overview of the Third Text REtrieval Conference (context) - Harman - 1995
81   A new approach to text searching (context) - Baeza-Yates, Gonnet - 1992
72   A locally adaptive data compression scheme (context) - Bentley, Sleator et al. - 1986
64   Managing Gigabytes (context) - Witten, Moffat et al. - 1999
60   Run-length encodings (context) - Golomb - 1966
38   Information Retrieval - Computational and Theoretical Aspect.. (context) - Heaps - 1978
38   Fast Incremental Indexing for Full-Text Information Retrieva.. - Brown, Callan et al. - 1994
34   Integrating contents and structure in text retrieval (context) - Baeza-Yates, Navarro - 1996
26   Inverted files (context) - Harman, Fox et al. - 1992
25   Large text searching allowing errors (context) - Ara'ujo, Navarro et al. - 1997
14   Word-based text compression (context) - Moffat - 1989
13   Text Compression for Dynamic Document Databases - Moffat, Zobel et al. - 1997
13   Fast Algorithms for Two Dimensional and Multiple Pattern Mat.. (context) - Baeza-Yates, R'egnier - 1990
10   Compression of Indexes with Full Positional Information in V.. (context) - Linoff, Stanfill - 1993
9   Block-Addressing Indices for Approximate Text Retrieval - Baeza-Yates, Navarro - 2000
8   Economical Inversion of Large Text Files (context) - Moffat - 1992
6   Scalable Text Retrieval for Large Digital Libraries - Hawking - 1997
5   A Model and a Visual Query Language for Structured Text (context) - Baeza-Yates, Navarro et al. - 1998
3   Another Distributed Searching Architecture for the Web (context) - Baeza-Yates - 2000
3   Linear time sorting of skewed distributions (context) - Moura, Navarro et al. - 1999
3   situ generation of compressed inverted files (context) - Moffat, Bell - 1995
2   Fast file search using text compression (context) - Turpin, Moffat - 1997
1   Efficient Structures for Phrase Querying (context) - Williams, Zobel et al. - 1999



The graph only includes citing articles where the year of publication is known.


Documents on the same site (http://www.dcc.uchile.cl/~gnavarro/publ.html):   More
A More Precise Solution to Two Problems on Tries - Navarro, Poblete   (Correct)
Fast Approximate String Matching in a Dictionary - Baeza-Yates, Navarro (1998)   (Correct)
An Optimal Index for PAT Arrays - Navarro (1996)   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC