• DocumentCode
    3124281
  • Title

    Concept Clustering of Evolving Data

  • Author

    Chen, Shixi ; Wang, Haixun ; Zhou, Shuigeng

  • Author_Institution
    Fudan Univ.
  • fYear
    2009
  • fDate
    March 29 2009-April 2 2009
  • Firstpage
    1327
  • Lastpage
    1330
  • Abstract
    Much work has focused on mining evolving data, and most approaches learn the latest model from the latest data. The problem with these approaches is that the learned model is always of low quality. In this paper, we propose a clustering approach to find hidden concepts that control data generation. Unlike traditional clustering methods that are based on data similarity (measured by Euclidean distance, e.g.), we devise a new similarity metric for concept similarity. We propose a two step algorithm, which uses dynamic programming and hierarchical clustering to find concepts in the data.
  • Keywords
    data mining; dynamic programming; pattern clustering; Euclidean distance; concept clustering; data generation control; data similarity; dynamic programming; evolving data mining; hierarchical clustering; Data engineering; Euclidean distance; History; Scattering; Supervised learning; Training data; USA Councils; Unsupervised learning; Venture capital; Web search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
  • Conference_Location
    Shanghai
  • ISSN
    1084-4627
  • Print_ISBN
    978-1-4244-3422-0
  • Electronic_ISBN
    1084-4627
  • Type

    conf

  • DOI
    10.1109/ICDE.2009.232
  • Filename
    4812532