DocumentCode
228144
Title
A hybrid model for named entity recognition using unstructured medical text
Author
Keretna, Sara ; Chee Peng Lim ; Creighton, Douglas
Author_Institution
Centre for Intell. Syst. Res., Deakin Univ., Geelong, VIC, Australia
fYear
2014
fDate
9-13 June 2014
Firstpage
85
Lastpage
90
Abstract
Named entity recognition (NER) is an essential step in the process of information extraction within text mining. This paper proposes a technique to extract drug named entities from unstructured and informal medical text using a hybrid model of lexicon-based and rule-based techniques. In the proposed model, a lexicon is first used as the initial step to detect drug named entities. Inference rules are then deployed to further extract undetected drug names. The designed rules employ part of speech tags and morphological features for drug name detection. The proposed hybrid model is evaluated using a benchmark data set from the i2b2 2009 medication challenge, and is able to achieve an f-score of 66.97%.
Keywords
data mining; drugs; inference mechanisms; information retrieval; knowledge based systems; medical information systems; text analysis; benchmark data set; biomedical named entity recognition; drug name detection; drug named entity extraction; f-score; hybrid model; inference rules; informal medical text; information extraction; lexicon-based techniques; medication challenge; morphological features; rule-based techniques; speech tags; text mining; unstructured medical text; Biological system modeling; Biomedical imaging; Databases; Dictionaries; Discharges (electric); Drugs; Feature extraction; Association rules; biomedical named entity recognition; information extraction; medical text mining;
fLanguage
English
Publisher
ieee
Conference_Titel
System of Systems Engineering (SOSE), 2014 9th International Conference on
Conference_Location
Adelade, SA
Type
conf
DOI
10.1109/SYSOSE.2014.6892468
Filename
6892468
Link To Document