• DocumentCode
    472466
  • Title

    Design and Compressed Storage of Inverted Index Based on Mixed Word Segmentation

  • Author

    Luo Jin ; Zu Xiao-Fang ; Liu Shu-Liang ; Zhang Fa-Yong ; Wu Xin-cai

  • Author_Institution
    China Univ. of Geosci., Beijing
  • fYear
    2008
  • fDate
    23-24 Jan. 2008
  • Firstpage
    334
  • Lastpage
    338
  • Abstract
    Speed and accuracy are the two key factors for the public-oriented integrated information service platform. This paper discusses a mixed segmentation based on the inverted index and compression algorithm. By combining database and file storage, we achieved index storage, which can support multi-table & multi-field simultaneous inquiry effectively. The design think of search engine interface based on such index storage mode has been introduced in detail at last in the paper. According to the above, search engine we designed inquiries faster than commonly used SQL statements several times faster. I am sure that will make more practical value for those who are interested in it.
  • Keywords
    data compression; file organisation; indexing; information services; search engines; word processing; file storage; inverted index compression storage; inverted index design; mixed word segmentation; multifield simultaneous inquiry; multitable simultaneous inquiry; public oriented integrated information service; search engine interface; Cities and towns; Data mining; Databases; Geographic Information Systems; Indexes; Indexing; Information retrieval; Search engines; Search methods; Statistics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Knowledge Discovery and Data Mining, 2008. WKDD 2008. First International Workshop on
  • Conference_Location
    Adelaide, SA
  • Print_ISBN
    978-0-7695-3090-1
  • Type

    conf

  • DOI
    10.1109/WKDD.2008.137
  • Filename
    4470407