next up previous
Next: Rate-Distortion Optimizations for Motion Up: DATA COMPRESSION Previous: Explicit Bit Minimization for

Dictionary Selection using Partial Matching

D. T. Hoang, P. M. Long and J. S. Vitter. ``Dictionary Selection using Partial Matching,'' Information Sciences, 119(1-2), 57-72, 1999. An earlier version appeared in ``Multiple-Dictionary Compression using Partial Matching,'' Proceedings of the 1995 IEEE Data Compression Conference (DCC '95), Snowbird, UT, March 1995, 272-281.

Full text (Adobe pdf format)

Motivated by the desire to find text compressors that compress better than existing dictionary methods, but run faster than PPM implementations, we describe methods for text compression using multiple dictionaries, one for each context of preceding characters, where the contexts have varying lengths. The context to be used is determined using an escape mechanism similar to that of PPM methods. We describe modifications of three popular dictionary coders along these lines and experiments evaluating their efficacy using the text files in the Calgary corpus. Our results suggest that modifying LZ77 along these lines yields an improvement in compression of about 4%, that modifying LZFG yields a compression improvement of about 8%, and that modifying LZW in this manner yields an average improvement on the order of 12%.

Keywords: Text compression, prediction by partial matching, Ziv-Lempel coding, Lempel-Ziv coding, LZ coding.


next up previous
Next: Rate-Distortion Optimizations for Motion Up: DATA COMPRESSION Previous: Explicit Bit Minimization for
Jeff Vitter
2009-10-31