• DocumentCode
    3123359
  • Title

    On High Dimensional Projected Clustering of Uncertain Data Streams

  • Author

    Aggarwal, Charu C.

  • Author_Institution
    IBM T. J. Watson Res. Center, Hawthorne, NY
  • fYear
    2009
  • fDate
    March 29 2009-April 2 2009
  • Firstpage
    1152
  • Lastpage
    1154
  • Abstract
    In this paper, we will study the problem of projected clustering of uncertain data streams. The use of uncertainty is especially important in the high dimensional scenario, because the sparsity property of high dimensional data is aggravated by the uncertainty. The uncertainty information is important for not only the determination of the assignment of data points to clusters, but also that of the valid projections across which the data is naturally clustered. The problem is especially challenging in the case where the data is not available on disk and arrives in the form of a fast stream. In such cases, the one-pass constraint in data stream computation poses special challenges to the algorithmic sophistication required for incorporating uncertainty information into the high dimensional computations. We will show that the projected clustering problem can be effectively solved in the context of uncertain data streams.
  • Keywords
    data mining; pattern clustering; data stream computation; high dimensional projected clustering; uncertain data streams; uncertainty information; Clustering algorithms; Data engineering; Data mining; Design methodology; Fading; Probability density function; Quality management; Statistics; USA Councils; Uncertainty; high dimensionality; projected clustering; uncertain data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
  • Conference_Location
    Shanghai
  • ISSN
    1084-4627
  • Print_ISBN
    978-1-4244-3422-0
  • Electronic_ISBN
    1084-4627
  • Type

    conf

  • DOI
    10.1109/ICDE.2009.188
  • Filename
    4812488