• DocumentCode
    1259835
  • Title

    Automatic Identification and Classification of Noun Argument Structures in Biomedical Literature

  • Author

    Ozyurt, I.B.

  • Author_Institution
    Dept. of Psychiatry, Univ. of California, San Diego, La Jolla, CA, USA
  • Volume
    9
  • Issue
    6
  • fYear
    2012
  • Firstpage
    1639
  • Lastpage
    1648
  • Abstract
    The accelerating increase in the biomedical literature makes keeping up with recent advances challenging for researchers thus making automatic extraction and discovery of knowledge from this vast literature a necessity. Building such systems requires automatic detection of lexico-semantic event structures governed by the syntactic and semantic constraints of human languages in sentences of biomedical texts. The lexico-semantic event structures in sentences are centered around the predicates and most semantic role labeling (SRL) approaches focus only on the arguments of verb predicates and neglect argument taking nouns which also convey information in a sentence. In this article, a noun argument structure (NAS) annotated corpus named BioNom and a SRL system to identify and classify these structures is introduced. Also, a genetic algorithm-based feature selection (GAFS) method is introduced and global inference is applied to significantly improve the performance of the NAS Bio SRL system.
  • Keywords
    bioinformatics; classification; data mining; feature extraction; genetic algorithms; programming language semantics; text detection; BioNom; GAFS method; NAS Bio SRL system; automatic detection; automatic extraction; automatic identification; biomedical literature; biomedical text sentences; genetic algorithm-based feature selection method; human languages; knowledge discovery; lexico-semantic event structures; noun argument structure classification; semantic constraints; semantic role labeling approach; syntactic constraints; verb predicates; Biological cells; Genetic algorithms; Natural language processing; Semantics; Support vector machines; Syntactics; Text mining; Natural language processing; biomedical text mining; genetic algorithms; nominalizations; semantic role labeling; Computational Biology; Data Mining; Databases, Factual; Models, Genetic; Natural Language Processing; Semantics; Software; Support Vector Machines;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2012.111
  • Filename
    6261510