DocumentCode :
3195316
Title :
Patient information extraction in noisy tele-health texts
Author :
Mi-Young Kim ; Ying Xu ; Zaiane, Osmar ; Goebel, R.
Author_Institution :
Dept. of Comput. Sci., Univ. of Alberta, Edmonton, AB, Canada
fYear :
2013
fDate :
18-21 Dec. 2013
Firstpage :
326
Lastpage :
329
Abstract :
We explore methods for effectively extracting information from clinical narratives, which are captured in a public health consulting phone service called HealthLink. The currently available data consists of dialogues constructed by nurses while consulting patients on the phone. Since the data are interviews transcribed by nurses during phone conversations, they include a significant volume and variety of noise: First is explicit noise, which includes spelling errors, unfinished sentences, omission of sentence delimiters, variants of terms, etc. Second is implicit noise, which includes non-patient´s information and negation of patient´s information. To filter explicit noise, we propose our biomedical term detection/normalization method: it resolves misspelling, term variations, and arbitrary abbreviation of terms by nurses. In detecting temporal terms and other types of named entities (which show patients´ personal information such as age, and sex), we propose a bootstrapping-based pattern learning to detect all kinds of arbitrary variations of the named entities. To address implicit noise, we propose a dependency path-based filtering method. The result of our denoising is the extraction of normalized patient information. The experimental results show that we achieve reasonable performance with our noise reduction methods.
Keywords :
electronic health records; information filtering; learning (artificial intelligence); personal information systems; telemedicine; text analysis; biomedical term variations; bootstrapping-based pattern learning; de-noising; detection-normalization method; filter explicit noise; healthlink; misspelling; noise reduction methods; noisy telehealth texts; nonpatient information; normalized patient information extraction; nurses; path-based filtering method; patient personal information; phone conversations; public health consulting phone service; sentence delimiter omission; spelling errors; unfinished sentences; Data mining; Diseases; Information retrieval; Noise; Semantics; Syntactics; Unified modeling language;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine (BIBM), 2013 IEEE International Conference on
Conference_Location :
Shanghai
Type :
conf
DOI :
10.1109/BIBM.2013.6732511
Filename :
6732511
Link To Document :
بازگشت