• DocumentCode
    3576388
  • Title

    A naïve Bayesian classifier in categorical uncertain data streams

  • Author

    Jiaqi Ge ; Yuni Xia ; Jian Wang

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Purdue Univ. Indianapolis, Indianapolis, IN, USA
  • fYear
    2014
  • Firstpage
    392
  • Lastpage
    398
  • Abstract
    This paper proposes a novel naïve Bayesian classifier in categorical uncertain data streams. Uncertainty in categorical data is usually represented by vector valued discrete pdf, which has to be carefully handled to guarantee the underlying performance in data mining applications. In this paper, we map the probabilistic attribute to deterministic points in the Euclidean space and design a distance based and a density based algorithms to measure the correlations between feature vectors and class labels. We also devise a new pre-binning approach to guarantee bounded computation and memory cost in uncertain data streams classification. Experimental results in real uncertain data streams prove that our density-based naive classifier is efficient, accurate, and robust to data uncertainty.
  • Keywords
    Bayes methods; data mining; pattern classification; Euclidean space; Naive Bayesian classifier; categorical uncertain data streams; data mining applications; data uncertainty; density based algorithms; density-based naive classifier; feature vectors; guarantee bounded computation; memory cost; prebinning approach; probabilistic attribute; uncertain data stream classification; vector valued discrete pdf; Accuracy; Bayes methods; Equations; Gold; Kernel; Mathematical model; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Science and Advanced Analytics (DSAA), 2014 International Conference on
  • Type

    conf

  • DOI
    10.1109/DSAA.2014.7058102
  • Filename
    7058102