Title :
ETAO: Symbol mapping transformation method for text compression
Author :
Baloul, Fadlelmoula Mohamed ; Abdullah, Mohsin Hassan ; Babikir, Elsadig Ahmed
Author_Institution :
Dept. of Inf. Technol., Colleges of Appl. Sci., Sohar, Oman
Abstract :
This paper is proposing a novel idea for text transformation based on mapping single letters form the standard alphabetical order into the same set of single letters reordered by their relative frequencies. This method can be used as a complementary algorithm to enhance the statistical compression techniques. We have designed and implemented an algorithm called ETAO transformation method. It has been found that the Average Code Length (ACL) could be reduced with amount of about 5%, when using Huffman or Arithmetic encoding techniques as backend.
Keywords :
Huffman codes; arithmetic codes; data compression; statistical analysis; symbol manipulation; text analysis; word processing; ETAO transformation method; Huffman encoding techniques; arithmetic encoding techniques; average code length; single letters mapping; statistical compression techniques; symbol mapping transformation method; text compression; Algorithm design and analysis; Channel coding; Data compression; Dictionaries; Entropy; Transforms; Average Code Length; ETAO; Text Compression; Text Transformation; Word Length and Position-Based Relative Frequencies;
Conference_Titel :
Computer Research and Development (ICCRD), 2011 3rd International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-61284-839-6
DOI :
10.1109/ICCRD.2011.5764263