Title :
BMExpert: Mining MEDLINE for Finding Experts in Biomedical Domains Based on Language Model
Author :
Beichen Wang ; Xiaodong Chen ; Mamitsuka, Hiroshi ; Shanfeng Zhu
Author_Institution :
Shanghai Key Lab. of Intell. Inf. Process., Fudan Univ., Shanghai, China
Abstract :
With the rapid development of biomedical sciences, a great number of documents have been published to report new scientific findings and advance the process of knowledge discovery. By the end of 2013, the largest biomedical literature database, MEDLINE, has indexed over 23 million abstracts. It is thus not easy for scientific professionals to find experts on a certain topic in the biomedical domain. In contrast to the existing services that use some ad hoc approaches, we developed a novel solution to biomedical expert finding, BMExpert, based on the language model. For finding biomedical experts, who are the most relevant to a specific topic query, BMExpert mines MEDLINE documents by considering three important factors: relevance of documents to the query topic, importance of documents, and associations between documents and experts. The performance of BMExpert was evaluated on a benchmark dataset, which was built by collecting the program committee members of ISMB in the past three years (2012-2014) on 14 different topics. Experimental results show that BMExpert outperformed three existing biomedical expert finding services: JANE, GoPubMed, and eTBLAST, with respect to both MAP (mean average precision) and P@50 (Precision). BMExpert is freely accessed at http://datamining-iip.fudan.edu.cn/service/BMExpert/.
Keywords :
data mining; medical expert systems; medical information systems; query processing; text analysis; BMExpert; ISMB program committee members; MEDLINE documents; MEDLINE mining; benchmark dataset; biomedical domains; biomedical expert finding services; biomedical literature database; biomedical sciences; knowledge discovery process; language model; query topic; Benchmark testing; Bibliometrics; Bioinformatics; Biological system modeling; Computational modeling; Genomics; Proteins; Biomedical text mining; expert finding; information retrieval; language model;
Journal_Title :
Computational Biology and Bioinformatics, IEEE/ACM Transactions on
DOI :
10.1109/TCBB.2015.2430338