DocumentCode :
2608592
Title :
Efficient Gaussian Mixture for Speech Recognition
Author :
Zouari, Lilia ; Chollet, Gérard
Author_Institution :
GET-ENST, Paris
Volume :
4
fYear :
0
fDate :
0-0 0
Firstpage :
294
Lastpage :
297
Abstract :
This article presents a clustering algorithm to determine the optimal number of components in a Gaussian mixture. The principle is to start from an important number of mixture components then group the multivariate normal distributions into clusters using the divergence, a weighted symmetric, distortion measure based on the Kullback-Leibler distance. The optimal cut in the tree, i.e. the clustering, satisfies criteria based on either the minimum amount of available training data or dissimilarities between clusters. The performance of this algorithm is compared favorably against a reference system and a likelihood loss based clustering system. The tree cutting criteria are also discussed. About an hour of Ester, a French broadcast news database is used for the recognition experiments. Performance is significantly improved and the word error rate decreases by about4.8%, where the confidence interval is 1%
Keywords :
Gaussian processes; normal distribution; pattern clustering; speech recognition; French broadcast news database; Gaussian mixture; Kullback-Leibler distance; clustering algorithm; multivariate normal distributions; speech recognition; Broadcasting; Clustering algorithms; Databases; Distortion measurement; Error analysis; Gaussian distribution; Performance loss; Speech recognition; Training data; Weight measurement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location :
Hong Kong
ISSN :
1051-4651
Print_ISBN :
0-7695-2521-0
Type :
conf
DOI :
10.1109/ICPR.2006.475
Filename :
1699838
Link To Document :
بازگشت