• DocumentCode
    866747
  • Title

    A Survey of Uncertain Data Algorithms and Applications

  • Author

    Aggarwal, Charu C. ; Yu, Philip S.

  • Author_Institution
    IBM T. J. Watson Res. Center, Hawthorne, NY
  • Volume
    21
  • Issue
    5
  • fYear
    2009
  • fDate
    5/1/2009 12:00:00 AM
  • Firstpage
    609
  • Lastpage
    623
  • Abstract
    In recent years, a number of indirect data collection methodologies have lead to the proliferation of uncertain data. Such data points are often represented in the form of a probabilistic function, since the corresponding deterministic value is not known. This increases the challenge of mining and managing uncertain data, since the precise behavior of the underlying data is no longer known. In this paper, we provide a survey of uncertain data mining and management applications. In the field of uncertain data management, we will examine traditional methods such as join processing, query processing, selectivity estimation, OLAP queries, and indexing. In the field of uncertain data mining, we will examine traditional mining problems such as classification and clustering. We will also examine a general transform based technique for mining uncertain data. We discuss the models for uncertain data, and how they can be leveraged in a variety of applications. We discuss different methodologies to process and mine uncertain data in a variety of forms.
  • Keywords
    data mining; database management systems; uncertain systems; database management; indirect data collection; probabilistic information; uncertain data algorithms; uncertain data management; uncertain data mining; Mining methods and algorithms;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2008.190
  • Filename
    4626956