Title :
Enhancing query retrieval efficiency using BGIT coding
Author :
Al-Jedady, Ameen A. ; Alsmadi, Izzat M. ; Al-Shawakfa, Emad M. ; Al-Kabi, Mohammed N.
Author_Institution :
CIS Dept., Yarmouk Univ., Irbid, Jordan
Abstract :
Data compression techniques are used to optimize time and space while sending and retrieving data. In information retrieval, data compression techniques are used by Search engines to reduce the size of their indexes which will result in optimizing the speed and performance of retrieving relevant information. The goal of this research project is to propose some enhancements on search engines indexing using Bigram index term coding. Evaluation of the improvements on search-engine performance resulting from encoding the terms of its index is also conducted. Our experiments showed a good reduction in the size of index terms which contributes to the overall index size. It also showed a significant reduction of the number of comparisons made to process the user queries as a result of reducing the number of symbols representing each index term.
Keywords :
data compression; indexing; natural language processing; query processing; search engines; Arabic language; BGIT coding; bigram based index term coding; data compression techniques; data retrieval; data sending; information retrieval; query retrieval efficiency enhancement; search engines indexing; search-engine performance improvement evaluation; Encoding; Indexing; Search engines; Standards; Information retrieval; bigram; fixed-length coding; index compression; indexing; search engine index;
Conference_Titel :
Computer, Information and Telecommunication Systems (CITS), 2012 International Conference on
Conference_Location :
Amman
Print_ISBN :
978-1-4673-1549-4
DOI :
10.1109/CITS.2012.6220367