DocumentCode :
3130158
Title :
Summarizing Contrasts by Recursive Pattern Mining
Author :
Soulet, Arnaud ; Crémilleux, Bruno ; Plantevit, Marc
Author_Institution :
LI, Univ. of Tours, Blois, France
fYear :
2011
fDate :
11-11 Dec. 2011
Firstpage :
1155
Lastpage :
1162
Abstract :
A lot of constrained patterns (e.g., emerging patterns, subgroup discovery, classification rules) emphasize the contrasts between data classes and are at the core of many classification techniques. Nevertheless, the extremely large collection of generated patterns hampers the end-user interpretation and the deep understanding of the knowledge revealed by the whole collection of patterns. The key idea of this paper is to summarize the contrasts of a dataset in order to provide understandable characterizations of data classes. We first introduce a novel framework, called recursive pattern mining, for only discovering few as well as relevant patterns. We demonstrate that this approach encompasses usual pattern mining framework and we study its key properties. Then, we use recursive pattern mining for extracting k recursive emerging patterns. Taken together, these patterns form a REP k-summary which summarizes the contrasts of the dataset. Finally, we validate our approach on benchmarks and real-world applications on the biological domain, showing the efficiency and the usefulness of the approach.
Keywords :
data mining; pattern classification; REP k-summary; contrast summarization; data classes; end-user interpretation; pattern classification technique; recursive pattern mining; Biology; Conferences; Context; Data mining; Educational institutions; Itemsets; contrasts; pattern mining; summary;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops (ICDMW), 2011 IEEE 11th International Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
978-1-4673-0005-6
Type :
conf
DOI :
10.1109/ICDMW.2011.161
Filename :
6137511
Link To Document :
بازگشت