DocumentCode :
562583
Title :
Inverted index compression using Extended Golomb Code
Author :
Glory, V. ; Domnic, S.
Author_Institution :
Dept. of Comput. Applic., Nat. Inst. of Technol., Tiruchirappalli, India
fYear :
2012
fDate :
30-31 March 2012
Firstpage :
20
Lastpage :
25
Abstract :
Web Search Engines use inverted index structures for efficient query processing. But the size of the inverted index is extremely large due to rapid growth in the size of the text data in the web. In order to reduce the index size and increase the accessing speed, compression techniques are used. In this paper, we make use of a new integer compression technique, Extended Golomb Code (EGC), to reduce the size of the inverted index. We have tested the performance of EGC with other existing techniques. Experimental results show that EGC performs better than other existing techniques in compressing inverted index.
Keywords :
Internet; data compression; indexing; query processing; search engines; text analysis; EGC; Web search engine; accessing speed; extended Golomb code; index size; information retrieval system; integer compression technique; inverted index compression; inverted index structure; query processing; text data size; Indexes; D-Gap; Information Retrieval System; Inverted File; Inverted Index Compression; Search Engines;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Engineering, Science and Management (ICAESM), 2012 International Conference on
Conference_Location :
Nagapattinam, Tamil Nadu
Print_ISBN :
978-1-4673-0213-5
Type :
conf
Filename :
6215567
Link To Document :
بازگشت