DocumentCode
3124281
Title
Concept Clustering of Evolving Data
Author
Chen, Shixi ; Wang, Haixun ; Zhou, Shuigeng
Author_Institution
Fudan Univ.
fYear
2009
fDate
March 29 2009-April 2 2009
Firstpage
1327
Lastpage
1330
Abstract
Much work has focused on mining evolving data, and most approaches learn the latest model from the latest data. The problem with these approaches is that the learned model is always of low quality. In this paper, we propose a clustering approach to find hidden concepts that control data generation. Unlike traditional clustering methods that are based on data similarity (measured by Euclidean distance, e.g.), we devise a new similarity metric for concept similarity. We propose a two step algorithm, which uses dynamic programming and hierarchical clustering to find concepts in the data.
Keywords
data mining; dynamic programming; pattern clustering; Euclidean distance; concept clustering; data generation control; data similarity; dynamic programming; evolving data mining; hierarchical clustering; Data engineering; Euclidean distance; History; Scattering; Supervised learning; Training data; USA Councils; Unsupervised learning; Venture capital; Web search;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
Conference_Location
Shanghai
ISSN
1084-4627
Print_ISBN
978-1-4244-3422-0
Electronic_ISBN
1084-4627
Type
conf
DOI
10.1109/ICDE.2009.232
Filename
4812532
Link To Document