Title :
Study on Efficiency of Full-Text Retrieval Based on Lucene
Author :
Li, Shengdong ; Lv, Xueqiang ; Ling, Feng ; Shi, Shuicai
Author_Institution :
Chinese Inf. Process. Res. Center, Beijing Inf. Sci. & Technol. Univ., Beijing, China
Abstract :
Through researching and analyzing the structure of lucene package, we have developed a full-text information retrieval system on the basis of lucene full-text retrieval. After mastering index structure and principle, we increase the size of index buffer in memory and decrease the frequency of writing index to disk by a specific algorithm. What is more, we optimize index by merging it in memory and on disk. As a result, the efficiency of creating index for 70000 documents has been improved by 55.1% at the best circumstances, and that of information retrieval for 70000 documents has been improved by 15.6% at the best circumstances.
Keywords :
indexing; information retrieval; text analysis; full-text information retrieval system; index buffer; lucene full-text retrieval; lucene package; mastering index structure; Access protocols; Indexing; Information analysis; Information processing; Information retrieval; Information science; Information technology; Packaging; Search engines; Writing;
Conference_Titel :
Information Engineering and Computer Science, 2009. ICIECS 2009. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-4994-1
DOI :
10.1109/ICIECS.2009.5363389