DocumentCode :
2759280
Title :
Clustering Analysis Using Data Range Aware Seeding and Agglomerative Expectation Maximization
Author :
Zhu, Hongwei ; Zhu, Honglei
Author_Institution :
Dept. of lT/DS, Old Dominion Univ., Norfolk, VA
fYear :
2007
fDate :
16-18 Dec. 2007
Firstpage :
904
Lastpage :
911
Abstract :
Expectation maximization (EM) is a local maximization method of the mixture model. When applied to clustering analysis, it generates good results only with reasonably good initialization, which can be produced by hierarchical agglomeration. However, hierarchical agglomeration has poor scalability due to its computational complexity. This paper presents a novel method, called ISOEM, to overcome this limitation. It uses a data range aware seeding algorithm to create an initial classification to initialize an iterative self-organizing process. The process alternates between EM and agglomeration coupled with classification EM. Evaluation using two imagery datasets showed the method had very good performance. The paper also presents the results of using a skewness measure and a separation-cohesion index as indicators for determining the number of clusters in the data.
Keywords :
computational complexity; data handling; expectation-maximisation algorithm; pattern clustering; agglomerative expectation maximization; clustering analysis; computational complexity; data range aware seeding; hierarchical agglomeration; iterative self-organizing process; separation-cohesion index; Clustering algorithms; Computational complexity; Data analysis; Internet; Iterative algorithms; Iterative methods; Merging; Partitioning algorithms; Pattern analysis; USA Councils; Bayesian estimation; Gaussian distribution; Model-based clustering; expectation maximization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal-Image Technologies and Internet-Based System, 2007. SITIS '07. Third International IEEE Conference on
Conference_Location :
Shanghai
Print_ISBN :
978-0-7695-3122-9
Type :
conf
DOI :
10.1109/SITIS.2007.61
Filename :
4618870
Link To Document :
بازگشت