DocumentCode
2752884
Title
A new general purpose compression method for searching in large collection
Author
Bhadade, U.S. ; Sharma, V.K. ; Trivedi, A.I.
Author_Institution
Maharashtra Acad. of Eng., Pune
fYear
2007
fDate
Oct. 30 2007-Nov. 2 2007
Firstpage
1
Lastpage
4
Abstract
Reduction of both compression ratio and retrieval of data from large collection is important in any computation requiring human interface. In this paper, a new algorithm is proposed for general purpose compression scheme that can be applied to all types of data storage in large collections. The paper presents a fast compression and decompression technique for natural language texts. The technique used in compressing text allows searching phrase in the compressed form without decompressing the compressed file. The algorithm suggested here uses static dictionary in matrix form, which reduces the compression and decompression time. The memory requirement of the algorithm is almost negligible.
Keywords
data compression; information retrieval; natural language processing; compression ratio; data retrieval; data storage; fast compression; fast decompression; general purpose compression method; human interface; large collection; matrix form; natural language texts; static dictionary; text compression; Costs; Databases; Decoding; Encoding; Information retrieval; Internet; Microcomputers; Pattern matching; Reservoirs; Space technology;
fLanguage
English
Publisher
ieee
Conference_Titel
TENCON 2007 - 2007 IEEE Region 10 Conference
Conference_Location
Taipei
Print_ISBN
978-1-4244-1272-3
Electronic_ISBN
978-1-4244-1272-3
Type
conf
DOI
10.1109/TENCON.2007.4428935
Filename
4428935
Link To Document