Title :
Turning Clusters into Patterns: Rectangle-Based Discriminative Data Description
Author :
Gao, Byron J. ; Ester, Martin
Author_Institution :
Sch. of Comput. Sci., Simon Fraser Univ., Burnaby, BC
Abstract :
The ultimate goal of data mining is to extract knowledge from massive data. Knowledge is ideally represented as human-comprehensible patterns from which end-users can gain intuitions and insights. Yet not all data mining methods produce such readily understandable knowledge, e.g., most clustering algorithms output sets of points as clusters. In this paper, we perform a systematic study of cluster description that generates interpretable patterns from clusters. We introduce and analyze novel description formats leading to more expressive power, motivate and define novel description problems specifying different trade-offs between interpretability and accuracy. We also present effective heuristic algorithms together with their empirical evaluations.
Keywords :
data mining; pattern clustering; cluster description; clustering algorithms; data mining methods; empirical evaluations; human-comprehensible patterns; knowledge extraction; rectangle-based discriminative data description; Clustering algorithms; Clustering methods; Content based retrieval; Data mining; Database systems; Heuristic algorithms; Iterative algorithms; Pareto optimization; Shape; Turning;
Conference_Titel :
Data Mining, 2006. ICDM '06. Sixth International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2701-7
DOI :
10.1109/ICDM.2006.163