See this document in CiteSeerX!

Word-based Compression Methods with Empty Words and Nonwords for Text Retrieval Systems  (Make Corrections)  
Jiri Dvorsky, Jaroslav Pokorny, Vaclav Snasel



  Home/Search   Context   Related

 
View or download:
alpha.inf.upol.cz/~dvorsky...DATASEM.PS
Cached:  PS.gz  PS  PDF   Image  Update  Help

From:  alpha.inf.upol.cz/~dvorsky/ (more)
(Enter author homepages)

Rate this article: (best)
  Comment on this article  
(Enter summary)

Abstract: : In this article we present a new compression method, called WLZW, which is a word-based modification of classic LZW. The modification is similar to the approach used in the HuffWord compression algorithm. Due to special using WLZW in text databases, some its features seem to be preferable in comparing to similar previous approaches. The algorithm is two-phase, it uses only one table for words and non-words, and a single data structure for lexicon is usable as text index. The compression ratio ... (Update)

Active bibliography (related documents):   More   All
2.6:   Word-based Compression Methods and Indexing for Text.. - Dvorsk, Pokorný, Snášel (1999)   (Correct)
0.7:   A Fast Block-sorting Algorithm for lossless Data Compression - Schindler (1996)   (Correct)
0.2:   A scalable architecture for XML retrieval - Gabriella Kazai Thomas (2003)   (Correct)

Similar documents based on text:   More   All
0.5:   Text Compression with Random Access - Dvorsky   (Correct)
0.3:   Improving LZW - Horspool (1991)   (Correct)
0.2:   Multi-dimensional Sparse Matrix Storage - Dvorsky, Kratky   (Correct)

BibTeX entry:   (Update)

@misc{ dvorsky-wordbased,
  author = "Jiri Dvorsky and Jaroslav Pokorny and Vaclav Snasel",
  title = "Word-based Compression Methods with Empty Words and Nonwords for Text Retrieval
    Systems",
  url = "citeseer.ist.psu.edu/399171.html" }
Citations (may not include all citations):
228   A Technique for High-Performance Data Compression (context) - Welch - 1984
121   Handbook of Algorithms and Data Structures (context) - Gonnet, Beaza-Yates - 1991
34   Data Compression in Full-Text Retrieval Systems (context) - Bell - 1993
8   Springer Verlag (context) - Salomon, Compression - 1998
6   Lempel: An universal algorithm for sequential data compressi.. (context) - Ziv - 1977
5   Bell: Managing Gigabytes: Compressing and Indexing Documents.. (context) - Witten, Moffat - 1994
4   Data Structures & Algorithms (context) - Frakes, Ed et al. - 1992
3   Lempel: Compression of individual sequences via variable-rat.. (context) - Ziv - 1978
2   Cormack: Construction Word-based Text Compression Algorithms (context) - Horspool - 1992
2   el: Compress methods for Text Databases (context) - sek, Krej et al.
2   Czech Technical University (context) - Melichar, Pokorn et al. - 1994
1   Palacky University (context) - Pirkl, compression et al. - 1998

Documents on the same site (http://alpha.inf.upol.cz/~dvorsky/):
Text Compression with Random Access - Dvorsky   (Correct)

Online articles have much greater impact   More about CiteSeer.IST   Add search form to your site   Submit documents   Feedback  

CiteSeer.IST - Copyright Penn State and NEC