DocumentCode :
2752884
Title :
A new general purpose compression method for searching in large collection
Author :
Bhadade, U.S. ; Sharma, V.K. ; Trivedi, A.I.
Author_Institution :
Maharashtra Acad. of Eng., Pune
fYear :
2007
fDate :
Oct. 30 2007-Nov. 2 2007
Firstpage :
1
Lastpage :
4
Abstract :
Reduction of both compression ratio and retrieval of data from large collection is important in any computation requiring human interface. In this paper, a new algorithm is proposed for general purpose compression scheme that can be applied to all types of data storage in large collections. The paper presents a fast compression and decompression technique for natural language texts. The technique used in compressing text allows searching phrase in the compressed form without decompressing the compressed file. The algorithm suggested here uses static dictionary in matrix form, which reduces the compression and decompression time. The memory requirement of the algorithm is almost negligible.
Keywords :
data compression; information retrieval; natural language processing; compression ratio; data retrieval; data storage; fast compression; fast decompression; general purpose compression method; human interface; large collection; matrix form; natural language texts; static dictionary; text compression; Costs; Databases; Decoding; Encoding; Information retrieval; Internet; Microcomputers; Pattern matching; Reservoirs; Space technology;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
TENCON 2007 - 2007 IEEE Region 10 Conference
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-1272-3
Electronic_ISBN :
978-1-4244-1272-3
Type :
conf
DOI :
10.1109/TENCON.2007.4428935
Filename :
4428935
Link To Document :
بازگشت