DocumentCode :
3102315
Title :
The fitting of binned data clustering to imprecise data
Author :
Hamdan, Hani ; Govaert, Gérard
Author_Institution :
Centre de Recherches de Royallieu, Univ. de Technologie de Compiegne, France
fYear :
2004
fDate :
19-23 April 2004
Firstpage :
557
Lastpage :
558
Abstract :
This paper addresses the problem of taking into account the data imprecision in the clustering of binned data using mixture models and binned EM algorithm. Within the framework of a defects detection problem by acoustic emission control, we were brought to treat a set of points using the EM algorithm applied to a diagonal Gaussian mixture model. This one provides a satisfactory solution but the real time constraints imposed in our problem make its application impossible when the number of points becomes too big. As data sets become larger, data processing becomes increasingly complex and as a result, the data analysis is expensive in computation time. The solution that we propose is to group data and available data thus takes the form of a histogram. Such data are also called binned data. We fit the binning data procedure to imprecise data. We model imprecise data by multivariate uncertainty zones and we propose to assign each uncertainty zone to several bins with percentages proportional to its overlapping surfaces with the bins. The experimental results compare this binning procedure with the classical one (applied to imprecise points) and with the interval EM algorithm considered here as a reference, using simulated data.
Keywords :
Gaussian distribution; acoustic emission; data analysis; maximum likelihood estimation; pattern clustering; KL distance; acoustic emission control; binned EM algorithm; binned data fitting; data analysis; data histogram; data imprecision; diagonal Gaussian mixture model; fuzzy clustering; interval EM algorithm; multivariate uncertainty zones; Acoustic emission; Acoustic signal detection; Clustering algorithms; Data analysis; Data processing; Histograms; Machine learning; Machine learning algorithms; Maximum likelihood estimation; Uncertainty;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Communication Technologies: From Theory to Applications, 2004. Proceedings. 2004 International Conference on
Print_ISBN :
0-7803-8482-2
Type :
conf
DOI :
10.1109/ICTTA.2004.1307883
Filename :
1307883
Link To Document :
بازگشت