DocumentCode :
3124281
Title :
Concept Clustering of Evolving Data
Author :
Chen, Shixi ; Wang, Haixun ; Zhou, Shuigeng
Author_Institution :
Fudan Univ.
fYear :
2009
fDate :
March 29 2009-April 2 2009
Firstpage :
1327
Lastpage :
1330
Abstract :
Much work has focused on mining evolving data, and most approaches learn the latest model from the latest data. The problem with these approaches is that the learned model is always of low quality. In this paper, we propose a clustering approach to find hidden concepts that control data generation. Unlike traditional clustering methods that are based on data similarity (measured by Euclidean distance, e.g.), we devise a new similarity metric for concept similarity. We propose a two step algorithm, which uses dynamic programming and hierarchical clustering to find concepts in the data.
Keywords :
data mining; dynamic programming; pattern clustering; Euclidean distance; concept clustering; data generation control; data similarity; dynamic programming; evolving data mining; hierarchical clustering; Data engineering; Euclidean distance; History; Scattering; Supervised learning; Training data; USA Councils; Unsupervised learning; Venture capital; Web search;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
Conference_Location :
Shanghai
ISSN :
1084-4627
Print_ISBN :
978-1-4244-3422-0
Electronic_ISBN :
1084-4627
Type :
conf
DOI :
10.1109/ICDE.2009.232
Filename :
4812532
Link To Document :
بازگشت