• DocumentCode
    3602152
  • Title

    BMExpert: Mining MEDLINE for Finding Experts in Biomedical Domains Based on Language Model

  • Author

    Beichen Wang ; Xiaodong Chen ; Mamitsuka, Hiroshi ; Shanfeng Zhu

  • Author_Institution
    Shanghai Key Lab. of Intell. Inf. Process., Fudan Univ., Shanghai, China
  • Volume
    12
  • Issue
    6
  • fYear
    2015
  • Firstpage
    1286
  • Lastpage
    1294
  • Abstract
    With the rapid development of biomedical sciences, a great number of documents have been published to report new scientific findings and advance the process of knowledge discovery. By the end of 2013, the largest biomedical literature database, MEDLINE, has indexed over 23 million abstracts. It is thus not easy for scientific professionals to find experts on a certain topic in the biomedical domain. In contrast to the existing services that use some ad hoc approaches, we developed a novel solution to biomedical expert finding, BMExpert, based on the language model. For finding biomedical experts, who are the most relevant to a specific topic query, BMExpert mines MEDLINE documents by considering three important factors: relevance of documents to the query topic, importance of documents, and associations between documents and experts. The performance of BMExpert was evaluated on a benchmark dataset, which was built by collecting the program committee members of ISMB in the past three years (2012-2014) on 14 different topics. Experimental results show that BMExpert outperformed three existing biomedical expert finding services: JANE, GoPubMed, and eTBLAST, with respect to both MAP (mean average precision) and P@50 (Precision). BMExpert is freely accessed at http://datamining-iip.fudan.edu.cn/service/BMExpert/.
  • Keywords
    data mining; medical expert systems; medical information systems; query processing; text analysis; BMExpert; ISMB program committee members; MEDLINE documents; MEDLINE mining; benchmark dataset; biomedical domains; biomedical expert finding services; biomedical literature database; biomedical sciences; knowledge discovery process; language model; query topic; Benchmark testing; Bibliometrics; Bioinformatics; Biological system modeling; Computational modeling; Genomics; Proteins; Biomedical text mining; expert finding; information retrieval; language model;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2015.2430338
  • Filename
    7102707