DocumentCode :
1097517
Title :
Centroid Neural Network With a Divergence Measure for GPDF Data Clustering
Author :
Park, Dong-Chul ; Kwon, Oh-Hyun ; Chung, Jio
Author_Institution :
Dept. of Inf. Eng., Myong Ji Univ., Yongin
Volume :
19
Issue :
6
fYear :
2008
fDate :
6/1/2008 12:00:00 AM
Firstpage :
948
Lastpage :
957
Abstract :
An unsupervised competitive neural network for efficient clustering of Gaussian probability density function (GPDF) data of continuous density hidden Markov models (CDHMMs) is proposed in this paper. The proposed unsupervised competitive neural network, called the divergence-based centroid neural network (DCNN), employs the divergence measure as its distance measure and utilizes the statistical characteristics of observation densities in the HMM for speech recognition problems. While the conventional clustering algorithms used for the vector quantization (VQ) codebook design utilize only the mean values of the observation densities in the HMM, the proposed DCNN utilizes both the mean and the covariance values. When compared with other conventional unsupervised neural networks, the DCNN successfully allocates more code vectors to the regions where GPDF data are densely distributed while it allocates fewer code vectors to the regions where GPDF data are sparsely distributed. When applied to Korean monophone recognition problems as a tool to reduce the size of the codebook, the DCNN reduced the number of GPDFs used for code vectors by 65.3% while preserving recognition accuracy. Experimental results with a divergence-based k-means algorithm and a divergence-based self-organizing map algorithm are also presented in this paper for a performance comparison.
Keywords :
Gaussian processes; hidden Markov models; neural nets; pattern clustering; speech recognition; vector quantisation; Gaussian probability density function; Korean monophone recognition problem; continuous density hidden Markov model; conventional clustering algorithm; divergence-based centroid neural network; divergence-based k-means algorithm; divergence-based self-organizing map algorithm; speech recognition problem; unsupervised competitive neural network; vector quantization codebook; Clustering method; Gaussian distributions; hidden Markov models (HMM); neural networks; speech recognition; Algorithms; Cluster Analysis; Female; Humans; Male; Markov Chains; Neural Networks (Computer); Pattern Recognition, Automated; Pattern Recognition, Physiological; Pattern Recognition, Visual; Recognition (Psychology);
fLanguage :
English
Journal_Title :
Neural Networks, IEEE Transactions on
Publisher :
ieee
ISSN :
1045-9227
Type :
jour
DOI :
10.1109/TNN.2007.2000051
Filename :
4470005
Link To Document :
بازگشت