• DocumentCode
    2752884
  • Title

    A new general purpose compression method for searching in large collection

  • Author

    Bhadade, U.S. ; Sharma, V.K. ; Trivedi, A.I.

  • Author_Institution
    Maharashtra Acad. of Eng., Pune
  • fYear
    2007
  • fDate
    Oct. 30 2007-Nov. 2 2007
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Reduction of both compression ratio and retrieval of data from large collection is important in any computation requiring human interface. In this paper, a new algorithm is proposed for general purpose compression scheme that can be applied to all types of data storage in large collections. The paper presents a fast compression and decompression technique for natural language texts. The technique used in compressing text allows searching phrase in the compressed form without decompressing the compressed file. The algorithm suggested here uses static dictionary in matrix form, which reduces the compression and decompression time. The memory requirement of the algorithm is almost negligible.
  • Keywords
    data compression; information retrieval; natural language processing; compression ratio; data retrieval; data storage; fast compression; fast decompression; general purpose compression method; human interface; large collection; matrix form; natural language texts; static dictionary; text compression; Costs; Databases; Decoding; Encoding; Information retrieval; Internet; Microcomputers; Pattern matching; Reservoirs; Space technology;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    TENCON 2007 - 2007 IEEE Region 10 Conference
  • Conference_Location
    Taipei
  • Print_ISBN
    978-1-4244-1272-3
  • Electronic_ISBN
    978-1-4244-1272-3
  • Type

    conf

  • DOI
    10.1109/TENCON.2007.4428935
  • Filename
    4428935