Markus Mauer, Timo Beller and Enno Ohlebusch
A Lempel-Ziv-style Compression Method for Repetitive Texts
| Abstract: |
| In this paper, we present a compression algorithm that is based on finding repetitions in the file to be compressed. Our approach is a variant of longest-first-substitution compression that uses the suffix array and the LCP-array to find and encode long recurring substrings. We will show that our algorithm achieves very good compression ratios for repetitive texts. |
| Download paper: | ![]() |
![]() |
![]() |
| PostScript | BibTeX reference |