DocumentCode :
3472739
Title :
New Algorithms For Random Access Text Compression
Author :
Robert, L. ; Nadarajan, R.
Author_Institution :
Dept. of Comput. Sci., Gov. Arts Coll., Coimbatore
fYear :
2006
fDate :
10-12 April 2006
Firstpage :
104
Lastpage :
111
Abstract :
Random access text compression is a type of compression technique in which there is a direct access to the compressed data. It facilitates to start decompression from any place in the compressed file, not necessarily from first. If any byte changed during transmission, the remaining data can be retrieved safely. In this paper, a try has been made to develop few algorithms for random access text compression based on the byte pair encoding scheme (Gage, 1997). The BPE algorithm relies on the fact that ASCII character set uses only codes from 0 through 127. That frees up codes from 128 through 255 for use as pair codes. Pair code is a byte, used to replace the most frequently appearing pair of bytes in the text file. Five algorithms are developed based on this byte pair encoding scheme. These algorithms finds the unused bytes at each level and tries to use those bytes for replacing the most frequently used bytes
Keywords :
data compression; encoding; ASCII character set; byte pair encoding scheme; data transmission; pair codes; random access text compression; Art; Computer science; Decoding; Dictionaries; Educational institutions; Encoding; Frequency; Government; Information retrieval; Mathematics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Technology: New Generations, 2006. ITNG 2006. Third International Conference on
Conference_Location :
Las Vegas, NV
Print_ISBN :
0-7695-2497-4
Type :
conf
DOI :
10.1109/ITNG.2006.98
Filename :
1611578
Link To Document :
بازگشت