Title :
BioCLink: A Probabilistic Approach for Improving Genomics Search with Citation Links
Author :
Yin, Xiaoshi ; Huang, Xiangji ; Li, Zhoujun
Author_Institution :
Coll. of Comput. Sci. & Eng., Beihang Univ., Beijing, China
Abstract :
Combination of multiple evidences has been shown to be effective in genomics literature retrieval. Citation information is an intuitive evidence for facilitating literature retrieval. Previous research on citation analysis has demonstrated that useful linkage information can be extracted from the citation graph. However, the question of how the combination of citation evidence and content evidence should be done to maximize retrieval accuracy still remains largely unanswered. In this paper, we propose BioCLink, a new probabilistic approach that integrates citation evidence into content-based weighting function for improving genomics literature retrieval performance. Based on findings of our previous study, a strategy for modeling citation evidence is proposed. BioCLink provides the combination of content and citation evidences with a theoretical support. Moreover, exhaustiveparameter tuning can be avoided using BioCLink. Extensive experiments on TREC 2006 and 2007 Genomics collections demonstrate the advantages and effectiveness of our proposed methods.
Keywords :
bioinformatics; citation analysis; content-based retrieval; genomics; BioCLink; citation analysis; citation graph; citation links; content-based weighting function; genomics literature retrieval; genomics search; information citation; linkage information; Bioinformatics; Biomedical engineering; Citation analysis; Computer science; Content based retrieval; Couplings; Educational institutions; Genomics; Information retrieval; Information technology; citation link analysis; genomics literature retrieval; probabilistic model;
Conference_Titel :
Bioinformatics and Biomedicine, 2009. BIBM '09. IEEE International Conference on
Conference_Location :
Washington, DC
Print_ISBN :
978-0-7695-3885-3
DOI :
10.1109/BIBM.2009.83