Title :
Dictionary-based English text compression using word endings
Author :
Yang, Jeehong ; Savari, Serap A.
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Michigan Univ., Ann Arbor, MI
Abstract :
In this paper, we propose an dictionary-based English text compression algorithm; we name it star word ending (StarWE) because of certain similarities to the StarNT algorithm. Furthermore, StarWE borrows techniques that are used in WRT such as EOL coding, punctuation mark modeling, and n-gram matching. The main difference between StarWE and StarNT is in the division of the external dictionary; StarWE divides it by word endings so that the compressor would be able to obtain some of the tag information
Keywords :
data compression; dictionaries; text analysis; word processing; StarNT algorithm; StarWE; dictionary-based English text compression; star word ending; tag information; Compression algorithms; Data compression; Dictionaries; Frequency;
Conference_Titel :
Data Compression Conference, 2007. DCC '07
Conference_Location :
Snowbird, UT
Print_ISBN :
0-7695-2791-4
DOI :
10.1109/DCC.2007.31