Title :
Concept Clustering of Evolving Data
Author :
Chen, Shixi ; Wang, Haixun ; Zhou, Shuigeng
Author_Institution :
Fudan Univ.
fDate :
March 29 2009-April 2 2009
Abstract :
Much work has focused on mining evolving data, and most approaches learn the latest model from the latest data. The problem with these approaches is that the learned model is always of low quality. In this paper, we propose a clustering approach to find hidden concepts that control data generation. Unlike traditional clustering methods that are based on data similarity (measured by Euclidean distance, e.g.), we devise a new similarity metric for concept similarity. We propose a two step algorithm, which uses dynamic programming and hierarchical clustering to find concepts in the data.
Keywords :
data mining; dynamic programming; pattern clustering; Euclidean distance; concept clustering; data generation control; data similarity; dynamic programming; evolving data mining; hierarchical clustering; Data engineering; Euclidean distance; History; Scattering; Supervised learning; Training data; USA Councils; Unsupervised learning; Venture capital; Web search;
Conference_Titel :
Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-1-4244-3422-0
Electronic_ISBN :
1084-4627
DOI :
10.1109/ICDE.2009.232