Title :
Clinical Report Classification Using Natural Language Processing and Topic Modeling
Author :
Sarioglu, E. ; Hyeong-Ah Choi ; Yadav, Kuldeep
Author_Institution :
Dept. of Comput. Sci., George Washington Univ., Washington, DC, USA
Abstract :
Large amount of electronic clinical data encompass important information in free text format. To be able to help guide medical decision-making, text needs to be efficiently processed and coded. In this research, we investigate techniques to improve classification of Emergency Department computed topography (CT) reports. The proposed system uses Natural Language Processing (NLP) to generate structured output from patient reports and then applies machine learning techniques to code for the presence of clinically important injuries for traumatic orbital fracture victims. Topic modeling of the corpora is also utilized as an alternative representation of the patient reports. Our results show that both NLP and topic modeling improve raw text classification results. Within NLP features, filtering the codes using modifiers produces the best performance. Topic modeling, on the other hand, shows mixed results. Topic vectors provide good dimensionality reduction and get comparable classification results as with NLP features. However, binary topic classification fails to improve upon raw text classification.
Keywords :
computerised tomography; decision making; medical information systems; natural language processing; pattern classification; text analysis; CT; NLP; clinical report classification; electronic clinical data; emergency department computed tomography reports; free text format; medical decision-making; natural language processing; text classification results; topic modeling; traumatic orbital fracture victims; Biomedical imaging; Computed tomography; Decision trees; Feature extraction; Support vector machines; Unified modeling language; Vectors; NLP; clinical text classification; topic modeling;
Conference_Titel :
Machine Learning and Applications (ICMLA), 2012 11th International Conference on
Conference_Location :
Boca Raton, FL
Print_ISBN :
978-1-4673-4651-1
DOI :
10.1109/ICMLA.2012.173