DocumentCode :
2733791
Title :
Entropy-based clustering for improving document re-ranking
Author :
Teng, Chong ; He, Yanxiang ; Ji, DongHong ; Zhou, Cheng ; Geng, Yixuan ; Chen, Shu
Author_Institution :
Comput. Sch., Wuhan Univ., Wuhan, China
Volume :
3
fYear :
2009
fDate :
20-22 Nov. 2009
Firstpage :
662
Lastpage :
666
Abstract :
Document re-ranking locates between initial retrieval and query expansion in information retrieval system. In this paper, we propose entropy-based clustering approach for document re-ranking. The value of within-cluster entropy determines whether two classes should be merged, and the value of between-cluster entropy determines how many clusters are reasonable. What to do next is finding a suitable cluster from clustering result to construct pseudo labeled document, and conduct document re-ranking as our previous method. We focus clustering strategy for documents after initial retrieval. Experiment with NTCIR-5 data show that the approach can improve the performance of initial retrieval, and it is helpful for improving the quality of document re-ranking.
Keywords :
entropy; information retrieval; pattern clustering; between-cluster entropy; document re-ranking; entropy-based clustering approach; information retrieval system; initial document retrieval; query expansion; within-cluster entropy; Concrete; Entropy; Helium; Information retrieval; Large-scale systems; Mathematics; Statistics; Text analysis; Thesauri; Vocabulary; Clustering; Document re-ranking; Information Retrieval; between-cluster entropy; component; within-cluster entropy;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Computing and Intelligent Systems, 2009. ICIS 2009. IEEE International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-4754-1
Electronic_ISBN :
978-1-4244-4738-1
Type :
conf
DOI :
10.1109/ICICISYS.2009.5358089
Filename :
5358089
Link To Document :
بازگشت