DocumentCode
147066
Title
Improving Compression via Substring Enumeration by Explicit Phase Awareness
Author
Beliveau, Mathieu ; Dube, Danny
Author_Institution
Univ. Laval, Quebec City, QC, Canada
fYear
2014
fDate
26-28 March 2014
Firstpage
399
Lastpage
399
Abstract
Compression by Substring Enumeration (CSE) is a recent and promising lossless compression scheme. The first experiments on CSE showed that it yields compression ratios that favorably compare to other lossless compression techniques. However, the experiments also showed that it tends to incur a performance loss on non-textual, byte-oriented sources and it was conjectured that CSE´s phase unawareness was responsible for this loss of performance. Subsequent work confirmed the conjecture by obtaining improved compression ratios when synchronization codes get inserted in the data source, indirectly solving the phase-unawareness problem. This indirect solution does not give an absolute measure of the loss incurred by the phase unawareness problem. This paper presents a modified CSE algorithm that is made explicitly phase aware. It compares the synchronization-code approach to the explicitly phase-aware approach and shows that, in the end, the approach based on synchronization codes is almost as good as the phase-aware approach.
Keywords
data compression; synchronisation; CSE phase unawareness; compression by substring enumeration; data source; explicit phase awareness; improved compression ratios; lossless compression scheme; modified CSE algorithm; nontextual byte-oriented sources; performance loss; phase-unawareness problem; synchronization-code approach; Data compression; Loss measurement; Phase measurement; Synchronization; CSE; lossless compression; substring enumeration; synchronization codes;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Compression Conference (DCC), 2014
Conference_Location
Snowbird, UT
ISSN
1068-0314
Type
conf
DOI
10.1109/DCC.2014.68
Filename
6824451
Link To Document