• DocumentCode
    228144
  • Title

    A hybrid model for named entity recognition using unstructured medical text

  • Author

    Keretna, Sara ; Chee Peng Lim ; Creighton, Douglas

  • Author_Institution
    Centre for Intell. Syst. Res., Deakin Univ., Geelong, VIC, Australia
  • fYear
    2014
  • fDate
    9-13 June 2014
  • Firstpage
    85
  • Lastpage
    90
  • Abstract
    Named entity recognition (NER) is an essential step in the process of information extraction within text mining. This paper proposes a technique to extract drug named entities from unstructured and informal medical text using a hybrid model of lexicon-based and rule-based techniques. In the proposed model, a lexicon is first used as the initial step to detect drug named entities. Inference rules are then deployed to further extract undetected drug names. The designed rules employ part of speech tags and morphological features for drug name detection. The proposed hybrid model is evaluated using a benchmark data set from the i2b2 2009 medication challenge, and is able to achieve an f-score of 66.97%.
  • Keywords
    data mining; drugs; inference mechanisms; information retrieval; knowledge based systems; medical information systems; text analysis; benchmark data set; biomedical named entity recognition; drug name detection; drug named entity extraction; f-score; hybrid model; inference rules; informal medical text; information extraction; lexicon-based techniques; medication challenge; morphological features; rule-based techniques; speech tags; text mining; unstructured medical text; Biological system modeling; Biomedical imaging; Databases; Dictionaries; Discharges (electric); Drugs; Feature extraction; Association rules; biomedical named entity recognition; information extraction; medical text mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    System of Systems Engineering (SOSE), 2014 9th International Conference on
  • Conference_Location
    Adelade, SA
  • Type

    conf

  • DOI
    10.1109/SYSOSE.2014.6892468
  • Filename
    6892468