• DocumentCode
    3673215
  • Title

    Automated classification of author´s sentiments in citation using machine learning techniques: A preliminary study

  • Author

    In Cheol Kim;George R. Thoma

  • Author_Institution
    Lister Hill National Center for Biomedical Communications, National Library of Medicine, 8600 Rockville Pike, Bethesda, MD 20894
  • fYear
    2015
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    Scientific papers generally include citations to external sources such as journal articles, books, or Web links to refer to works that are related in an important way to the research. The reason for the citation appears within the sentences surrounding the citation tag in the body text, and represents the relationship between the citation and cited works as supportive, contrastive, corrective, etc. This could be an important clue for researchers seeking relevant previous work or approaches for a certain research purpose. We propose to develop an automated method to identify the citing author´s sentiments toward the cited external sources expressed in citation sentences using machine-learning techniques and linguistic cues. As a preliminary study, this paper presents a support vector machine (SVM)-based text categorization technique to classify the author´s sentiments specifically toward Comment-on (CON) articles. CON, a MEDLINE citation field, indicates previously published articles commented on by authors of a given article expressing possibly complimentary or contradictory opinions. An SVM with a radial basis kernel function (RBF) is implemented, and Input feature vectors for the SVM are created based on n-grams word statistics representing the distribution of words in CON sentences. Experiments conducted on a set of CON sentences collected from 414 different online biomedical journal titles show that the SVM with a RBF yields the best result for an input feature vector combining uni-gram and bi-gram word statistics.
  • Keywords
    "Support vector machines","Dictionaries","Accuracy","Text categorization","Kernel","Citation analysis","Training"
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence in Bioinformatics and Computational Biology (CIBCB), 2015 IEEE Conference on
  • Type

    conf

  • DOI
    10.1109/CIBCB.2015.7300319
  • Filename
    7300319