• DocumentCode
    3048192
  • Title

    Two Approaches for Biomedical Text Classification

  • Author

    Li, Yanpeng ; Lin, Honfei ; Yang, Zhihao

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Dalian Univ. of Technol., Dalian
  • fYear
    2007
  • fDate
    6-8 July 2007
  • Firstpage
    310
  • Lastpage
    313
  • Abstract
    Automatic text classification systems can be especially valuable to biomedical researchers who seek to discover knowledge from terabyte-scale biomedical literatures. Different from the general domain, biomedical literatures contain a large number of named entities, complicated session structures and rich ontology resources. Taking these features into account, two approaches for biomedical text classification are presented, i.e., concept expansion and Meta-classification. Concept expansion is a method that introduces concept features using biomedical named entity recognition. Meta-classification is to combine the classification results of different parts of the full-text article and ontology resources using a Logistic regression model. The experiment results on the test set of TREC 2005 genomics track categorization task show that these techniques can improve the performance of the classification system consistently for all the classes.
  • Keywords
    biology computing; regression analysis; text analysis; Biomedical Text Classification; concept expansion; full-text article; logistic regression model; meta-classification; ontology resources; Bioinformatics; Biomedical engineering; Computer science; Genomics; Knowledge engineering; Logistics; Mice; Ontologies; System testing; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Bioinformatics and Biomedical Engineering, 2007. ICBBE 2007. The 1st International Conference on
  • Conference_Location
    Wuhan
  • Print_ISBN
    1-4244-1120-3
  • Type

    conf

  • DOI
    10.1109/ICBBE.2007.83
  • Filename
    4272567