DocumentCode :
3782997
Title :
Agglomerative vs. tree-based clustering for the definition of multilingual set of triphones
Author :
B. Imperl;Z. Kacic;B. Horvat;A. Zgank
Author_Institution :
Fac. of Electr. Eng. & Comput. Sci., Maribor Univ., Slovenia
Volume :
3
fYear :
2000
Firstpage :
1273
Abstract :
The paper addresses the problem of multilingual acoustic modelling for the design of multilingual speech recognisers. Two different approaches for the definition of multilingual set of triphones (bottom-up and a top-down) are investigated. A new clustering algorithm for the definition of multilingual set of triphones is proposed. The agglomerative clustering algorithm (bottom-up) is based on a definition of a distance measure for triphones defined as a weighted sum of explicit estimates of the context similarity on a monophone level. The monophone similarity estimation method is based on the algorithm of Houtgast. The second type of system uses tree-based clustering (top-down) with a common decision tree. The experiments were based on the SpeechDat II databases (Slovenian, Spanish and German 1000 FDB SpeechDat II). Experiments have shown that the use of the agglomerative clustering algorithm results in a significant reduction of the number of triphones with minor degradation of word accuracy.
Keywords :
"Context modeling","Clustering algorithms","Speech recognition","Signal processing algorithms","Decision trees","Degradation","Laboratories","Digital signal processing","Computer science","Databases"
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2000. ICASSP ´00. Proceedings. 2000 IEEE International Conference on
ISSN :
1520-6149
Print_ISBN :
0-7803-6293-4
Type :
conf
DOI :
10.1109/ICASSP.2000.861809
Filename :
861809
Link To Document :
بازگشت