Title :
Design and Compressed Storage of Inverted Index Based on Mixed Word Segmentation
Author :
Luo Jin ; Zu Xiao-Fang ; Liu Shu-Liang ; Zhang Fa-Yong ; Wu Xin-cai
Author_Institution :
China Univ. of Geosci., Beijing
Abstract :
Speed and accuracy are the two key factors for the public-oriented integrated information service platform. This paper discusses a mixed segmentation based on the inverted index and compression algorithm. By combining database and file storage, we achieved index storage, which can support multi-table & multi-field simultaneous inquiry effectively. The design think of search engine interface based on such index storage mode has been introduced in detail at last in the paper. According to the above, search engine we designed inquiries faster than commonly used SQL statements several times faster. I am sure that will make more practical value for those who are interested in it.
Keywords :
data compression; file organisation; indexing; information services; search engines; word processing; file storage; inverted index compression storage; inverted index design; mixed word segmentation; multifield simultaneous inquiry; multitable simultaneous inquiry; public oriented integrated information service; search engine interface; Cities and towns; Data mining; Databases; Geographic Information Systems; Indexes; Indexing; Information retrieval; Search engines; Search methods; Statistics;
Conference_Titel :
Knowledge Discovery and Data Mining, 2008. WKDD 2008. First International Workshop on
Conference_Location :
Adelaide, SA
Print_ISBN :
978-0-7695-3090-1
DOI :
10.1109/WKDD.2008.137