Title :
A hybrid model for named entity recognition using unstructured medical text
Author :
Keretna, Sara ; Chee Peng Lim ; Creighton, Douglas
Author_Institution :
Centre for Intell. Syst. Res., Deakin Univ., Geelong, VIC, Australia
Abstract :
Named entity recognition (NER) is an essential step in the process of information extraction within text mining. This paper proposes a technique to extract drug named entities from unstructured and informal medical text using a hybrid model of lexicon-based and rule-based techniques. In the proposed model, a lexicon is first used as the initial step to detect drug named entities. Inference rules are then deployed to further extract undetected drug names. The designed rules employ part of speech tags and morphological features for drug name detection. The proposed hybrid model is evaluated using a benchmark data set from the i2b2 2009 medication challenge, and is able to achieve an f-score of 66.97%.
Keywords :
data mining; drugs; inference mechanisms; information retrieval; knowledge based systems; medical information systems; text analysis; benchmark data set; biomedical named entity recognition; drug name detection; drug named entity extraction; f-score; hybrid model; inference rules; informal medical text; information extraction; lexicon-based techniques; medication challenge; morphological features; rule-based techniques; speech tags; text mining; unstructured medical text; Biological system modeling; Biomedical imaging; Databases; Dictionaries; Discharges (electric); Drugs; Feature extraction; Association rules; biomedical named entity recognition; information extraction; medical text mining;
Conference_Titel :
System of Systems Engineering (SOSE), 2014 9th International Conference on
Conference_Location :
Adelade, SA
DOI :
10.1109/SYSOSE.2014.6892468