DocumentCode :
889511
Title :
Association rule discovery with the train and test approach for heart disease prediction
Author :
Ordonez, Carlos
Author_Institution :
Teradata (NCR), San Diego, CA
Volume :
10
Issue :
2
fYear :
2006
fDate :
4/1/2006 12:00:00 AM
Firstpage :
334
Lastpage :
343
Abstract :
Association rules represent a promising technique to improve heart disease prediction. Unfortunately, when association rules are applied on a medical data set, they produce an extremely large number of rules. Most of such rules are medically irrelevant and the time required to find them can be impractical. A more important issue is that, in general, association rules are mined on the entire data set without validation on an independent sample. To solve these limitations, we introduce an algorithm that uses search constraints to reduce the number of rules, searches for association rules on a training set, and finally validates them on an independent test set. The medical significance of discovered rules is evaluated with support, confidence, and lift. Association rules are applied on a real data set containing medical records of patients with heart disease. In medical terms, association rules relate heart perfusion measurements and risk factors to the degree of disease in four specific arteries. Search constraints and test set validation significantly reduce the number of association rules and produce a set of rules with high predictive accuracy. We exhibit important rules with high confidence, high lift, or both, that remain valid on the test set on several runs. These rules represent valuable medical knowledge
Keywords :
blood vessels; cardiology; data mining; diseases; medical computing; medical information systems; arteries; association rule discovery; heart disease prediction; heart perfusion measurement; medical data set; medical records; risk factor; search constraint; Arteries; Association rules; Biomedical imaging; Cardiac disease; Cardiovascular diseases; Data mining; Heart; Medical diagnostic imaging; Stress; Testing; Association rules; heart disease; search constraint; train and test;
fLanguage :
English
Journal_Title :
Information Technology in Biomedicine, IEEE Transactions on
Publisher :
ieee
ISSN :
1089-7771
Type :
jour
DOI :
10.1109/TITB.2006.864475
Filename :
1613959
Link To Document :
بازگشت