DocumentCode :
2531509
Title :
Mining Clinical Data with a Temporal Dimension: A Case Study
Author :
Berlingerio, Michele ; Bonchi, Francesco ; Giannotti, Fosca ; Turini, Franco
Author_Institution :
IMT Lucca Inst. for Adv. Studies, Lucca
fYear :
2007
fDate :
2-4 Nov. 2007
Firstpage :
429
Lastpage :
436
Abstract :
Clinical databases store large amounts of information about patients and their medical conditions. Data mining techniques can extract relationships and patterns holding in this wealth of data, and thus be helpful in understanding the progression of diseases and the efficacy of the associated therapies. A typical structure of medical data is a sequence of observations of clinical parameters taken at different time moments. In this kind of contexts, the temporal dimension of data is a fundamental variable that should be taken in account in the mining process and returned as part of the extracted knowledge. Therefore, the classical and well established framework of sequential pattern mining is not enough, because it only focuses on the sequentiality of events, without extracting the typical time elapsing between two particular events. Time-annotated sequences (IAS), is a novel mining paradigm that solves this problem. Recently defined in our laboratory together with an efficient algorithm for extracting them, IAS are sequential patterns where each transition between two events is annotated with a typical transition time that is found frequent in the data. In this paper we report a real-world medical case study, in which the IAS mining paradigm is applied to clinical data regarding a set of patients in the follow-up of a liver transplantation. The aim of the data analysis is that of assessing the effectiveness of the extracorporeal photopheresis (ECP) as a therapy to prevent rejection in solid organ transplantation. For each patient, a set of biochemical variables is recorded at different time moments after the transplantation. The IAS patterns extracted show the values of interleukins and other clinical parameters at specific dates, from which it is possible for the physician to assess the effectiveness of the ECP therapy. We believe that this case study does not only show the interestingness of extracting IAS patterns in this particular context but, more ambitiously, - it suggests a general methodology for clinical data mining, whenever the time dimension is an important variable of the problem in analysis.
Keywords :
biochemistry; data mining; diseases; liver; medical information systems; molecular biophysics; patient treatment; proteins; biochemical variables; clinical data mining; clinical databases; diseases; extracorporeal photopheresis; interleukins; liver transplantation; patient therapy; sequential pattern mining; solid organ transplantation; time-annotated sequences; Data analysis; Data mining; Databases; Diseases; Laboratories; Liver; Medical conditions; Medical treatment; Sequences; Solids;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Bioinformatics and Biomedicine, 2007. BIBM 2007. IEEE International Conference on
Conference_Location :
Fremont, CA
Print_ISBN :
978-0-7695-3031-4
Type :
conf
DOI :
10.1109/BIBM.2007.42
Filename :
4413087
Link To Document :
بازگشت