Title :
Incorporating Spontaneous Reporting System Data to Aid Causal Inference in Longitudinal Healthcare Data
Author :
Reps, Jenna M. ; Aickelin, Uwe
Author_Institution :
Sch. of Comput. Sci., Univ. of Nottingham, Nottingham, UK
Abstract :
Inferring causality using longitudinal observational databases is challenging due to the passive way the data are collected. The majority of associations found within longitudinal observational data are often non-causal and occur due to confounding. The focus of this paper is to investigate incorporating information from additional databases to complement the longitudinal observational database analysis. We investigate the detection of prescription drug side effects as this is an example of a causal relationship. In previous work a framework was proposed for detecting side effects only using longitudinal data. In this paper we combine a measure of association derived from mining a spontaneous reporting system database to previously proposed analysis that extracts domain expertise features for causal analysis of a UK general practice longitudinal database. The results show that there is a significant improvement to the performance of detecting prescription drug side effects when the longitudinal observation data analysis is complemented by incorporating additional drug safety sources into the framework. The area under the receiver operating characteristic curve (AUC) for correctly classifying a side effect when other data were considered was 0.967, whereas without it the AUC was 0.923 However, the results of this paper may be biased by the evaluation and future work should overcome this by developing an unbiased reference set.
Keywords :
data acquisition; data mining; drugs; feature extraction; health care; inference mechanisms; medical information systems; pattern classification; AUC; UK general practice longitudinal database; causal analysis; causal inference; causal relationship; causality; data collection; domain expertise feature extraction; drug safety sources; longitudinal healthcare data; longitudinal observational database analysis; longitudinal observational databases; mining; prescription drug side effects detection; receiver operating characteristic curve; side effect classification; spontaneous reporting system database; Data mining; Databases; Drugs; Feature extraction; Medical diagnostic imaging; causal inference; drug safety; pharmacovigilance;
Conference_Titel :
Data Mining Workshop (ICDMW), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
Print_ISBN :
978-1-4799-4275-6
DOI :
10.1109/ICDMW.2014.54