• DocumentCode
    1815382
  • Title

    Investigation into biomedical literature classification using support vector machines

  • Author

    Polavarapu, Nalini ; Navathe, Shamkant B. ; Ramnarayanan, Ramprasad ; Haque, Abrar Ul ; Sahay, Saurav ; Liu, Ying

  • Author_Institution
    Sch. of Biol., Georgia Inst. of Technol., Atlanta, GA, USA
  • fYear
    2005
  • fDate
    8-11 Aug. 2005
  • Firstpage
    366
  • Lastpage
    374
  • Abstract
    Specific topic search in the PubMed Database, one of the most important information resources for scientific community, presents a big challenge to the users. The researcher typically formulates boolean queries followed by scanning the retrieved records for relevance, which is very time consuming and error prone. We applied Support Vector Machines (SVM) for automatic retrieval of PubMed articles related to Human genome epidemiological research at CDC (Center for disease Control and Prevention). In this paper, we discuss various investigations into biomedical literature classification and analyze the effect of various issues related to the choice of keywords, training sets, kernel functions and parameters for the SVM technique. We report on the various factors above to show that SVM is a viable technique for automatic classification of biomedical literature into topics of interest such as epidemiology, cancer, birth defects etc. In all our experiments, we achieved high values of PPV, sensitivity and specificity.
  • Keywords
    cancer; medical computing; medical information systems; operating system kernels; support vector machines; tumours; PubMed database; SVM technique; automatic retrieval; biomedical literature classification; birth defects; boolean query; cancer; human genome epidemiological research; information resource; kernel function; scientific community; support vector machines; training sets; Automatic control; Bioinformatics; Databases; Diseases; Genomics; Humans; Information resources; Kernel; Support vector machine classification; Support vector machines;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Systems Bioinformatics Conference, 2005. Proceedings. 2005 IEEE
  • Print_ISBN
    0-7695-2344-7
  • Type

    conf

  • DOI
    10.1109/CSB.2005.36
  • Filename
    1498038