Title :
Temporal Properties of Diagnosis Code Time Series in Aggregate
Author :
Perotte, A. ; Hripcsak, G.
Author_Institution :
Dept. of Biomed. Inf., Columbia Univ., New York, NY, USA
Abstract :
Time series are essential to health data research and data mining. We aim to study the properties of one of the more commonly available but historically unreliable types of data: administrative diagnoses in the form of the International Classification of Diseases, Ninth Revision (ICD9) codes. We use differential entropy of ICD9 code time series as a surrogate measure for disease time course and also explore Gaussian kernel smoothing to characterize the time course of diseases in a more fine-grained way. Compared to a gold standard created by a panel of clinicians, the first model classified diseases into acute and chronic groups with a receiver operating characteristic area under curve of 0.83. In the second model, several characteristic temporal profiles were observed including permanent, chronic, and acute. In addition, condition dynamics such as the refractory period for giving birth following childbirth were observed. These models demonstrate that ICD9 codes, despite well-documented concerns, contain valid and potentially valuable temporal information.
Keywords :
data mining; diseases; medical information systems; patient diagnosis; Gaussian kernel smoothing; ICD9 code time series differential entropy; ICD9 codes; International Classification of Diseases Ninth Revision; ROC area under curve; acute diseases; administrative diagnoses; chronic diseases; data mining; diagnosis code time series temporal properties; disease time course surrogate measure; health data research; receiver operating characteristic; temporal information; Diabetes; Diseases; Documentation; Entropy; Sociology; Time series analysis; Biomedical informatics; clinical diagnosis; medical information systems; time series analysis; Data Mining; Electronic Health Records; Entropy; Female; Humans; International Classification of Diseases; Male; Medical Informatics Applications; ROC Curve; Time Factors;
Journal_Title :
Biomedical and Health Informatics, IEEE Journal of
DOI :
10.1109/JBHI.2013.2244610