Title :
Clinical Documents Clustering Based on Medication/Symptom Names Using Multi-View Nonnegative Matrix Factorization
Author :
Yuan Ling ; Xuelian Pan ; Guangrong Li ; Xiaohua Hu
Author_Institution :
Coll. of Comput. & Inf., Drexel Univ., Philadelphia, PA, USA
fDate :
7/1/2015 12:00:00 AM
Abstract :
Clinical documents are rich free-text data sources containing valuable medication and symptom information, which have a great potential to improve health care. In this paper, we build an integrating system for extracting medication names and symptom names from clinical notes. Then we apply nonnegative matrix factorization (NMF) and multi-view NMF to cluster clinical notes into meaningful clusters based on sample-feature matrices. Our experimental results show that multi-view NMF is a preferable method for clinical document clustering. Moreover, we find that using extracted medication/symptom names to cluster clinical documents outperforms just using words.
Keywords :
document handling; matrix decomposition; medical information systems; clinical document clustering; medication names; multiview nonnegative matrix factorization; sample-feature matrices; symptom names; Accuracy; Data mining; Design automation; Diseases; Hospitals; Informatics; Medical diagnostic imaging; Clinical document; clinical notes; document clustering; multi-view; nonnegative matrix factorization;
Journal_Title :
NanoBioscience, IEEE Transactions on
DOI :
10.1109/TNB.2015.2422612