Title :
Inverted index compression using Extended Golomb Code
Author :
Glory, V. ; Domnic, S.
Author_Institution :
Dept. of Comput. Applic., Nat. Inst. of Technol., Tiruchirappalli, India
Abstract :
Web Search Engines use inverted index structures for efficient query processing. But the size of the inverted index is extremely large due to rapid growth in the size of the text data in the web. In order to reduce the index size and increase the accessing speed, compression techniques are used. In this paper, we make use of a new integer compression technique, Extended Golomb Code (EGC), to reduce the size of the inverted index. We have tested the performance of EGC with other existing techniques. Experimental results show that EGC performs better than other existing techniques in compressing inverted index.
Keywords :
Internet; data compression; indexing; query processing; search engines; text analysis; EGC; Web search engine; accessing speed; extended Golomb code; index size; information retrieval system; integer compression technique; inverted index compression; inverted index structure; query processing; text data size; Indexes; D-Gap; Information Retrieval System; Inverted File; Inverted Index Compression; Search Engines;
Conference_Titel :
Advances in Engineering, Science and Management (ICAESM), 2012 International Conference on
Conference_Location :
Nagapattinam, Tamil Nadu
Print_ISBN :
978-1-4673-0213-5