• DocumentCode
    3227996
  • Title

    An exponential mixture models for noise reduction in clustering

  • Author

    Ren, Dehao ; Li, Wenzao

  • Author_Institution
    Coll. of Commun. Eng., Chengdu Univ. of Inf. & Technol., Chengdu, China
  • fYear
    2011
  • fDate
    27-29 May 2011
  • Firstpage
    457
  • Lastpage
    461
  • Abstract
    Noise is an important factor that influences the performance of clustering or classifying. Many researches on noise reduction were reported in this eld. In this paper we assume that the noise data and the real data integrate with each other, and we design an exponential mixture model to reduce noise data for improvement of performance of clustering. This model is based on the mathematic theory which is that two exponential family distributions multiply with each other to get another exponential family distribution. If the noise data is Gaussian distribution and real data is Gaussian distribution, the observed data also is Gaussian distribution. In the real case the observed data can be measured, but the real data can not be known. This exponential mixture model´s goal is to reduce the noise data from the observed data. Finally the real data can be obtained. We use this model to preprocess the observed data for clustering, we found that the performance is improved much, which shows that the model works well. Noise is an important factor that influences the performance of clustering or classifying. Many researches on noise reduction were reported in this eld. In this paper we assume that the noise data and the real data integrate with each other, and we design an exponential mixture model to reduce noise data for improvement of performance of clustering. This model is based on the mathematic theory which is that two exponential family distributions multiply with each other to get another exponential family distribution. If the noise data is Gaussian distribution and real data is Gaussian distribution, the observed data also is Gaussian distribution. In the real case the observed data can be measured, but the real data can not be known. This exponential mixture model´s goal is to reduce the noise data from the observed data. Finally the real data can be obtained. We use this model to preprocess the observed data for clustering, we found that the performance is improv- - ed much, which shows that the model works well.
  • Keywords
    Gaussian distribution; exponential distribution; pattern classification; pattern clustering; Gaussian distribution; clustering performance; exponential family distribution; exponential mixture model; mathematic theory; noise data reduction; Data models; Clustering; Exponential Mixture model; Noise Reduction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communication Software and Networks (ICCSN), 2011 IEEE 3rd International Conference on
  • Conference_Location
    Xi´an
  • Print_ISBN
    978-1-61284-485-5
  • Type

    conf

  • DOI
    10.1109/ICCSN.2011.6014091
  • Filename
    6014091