Title :
Observations on Compressing Text Files of Varying Length
Author :
Btoush, Mohammad Hjouj ; Siddiqi, Jawed ; Akhgar, Babak ; Dawahdeh, Ziad
Author_Institution :
Sheffield Hallam Univ., Sheffield
Abstract :
The paper compares different data compression algorithms of text files: LZW, Huffman, fixed-length code (FLC), and Huffman after using fixed-length code (HFLC). We compare these algorithms on different text files of different sizes in terms of compression scales of: size, ratio, time (speed), and entropy. Our evaluation reveals that initially for smaller size files the simplest algorithm namely LZW performs worst for first two scales than the more complex Huffman algorithm but as the size of the text increases interestingly the position is reversed. Moreover for the scales time and entropy LZW performs better than Huffmans but for larger files once again the position is reversed.
Keywords :
data compression; text analysis; data compression algorithms; fixed-length code; text files; Binary trees; Compression algorithms; Compressors; Data compression; Encoding; Entropy; Image coding; Image storage; Probability; Video compression; Data Compression; Huffman Coding; LZW; Text size;
Conference_Titel :
Information Technology: New Generations, 2008. ITNG 2008. Fifth International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
0-7695-3099-0
DOI :
10.1109/ITNG.2008.61