DocumentCode :
2703689
Title :
Modelling Pronunciation Variation using Multi-Path HMMS for Syllables
Author :
Hamalainen, A. ; Bosch, L. ; Boves, L.
Author_Institution :
Centre for Language & Speech Technol., Radboud Univ. Nijmegen, Netherlands
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
Recent research suggests that it is more appropriate to model pronunciation variation with syllable-length acoustic models than with triphones. Due to the large number of factors contributing to pronunciation variation at the syllable level, the creation of multi-path model topologies appears necessary. In this paper, we construct multi-path models using phonetic knowledge to initialise the parallel paths, and a data-driven solution for their reestimation. When applied to 94 frequent syllables in a Dutch read speech recognition task, the approach leads to improved recognition performance when compared with a much more complex triphone recogniser. A detailed analysis of the pronunciation variation captured by the parallel paths pinpoints the deficiencies of the approach, and provides insights into how these may be overcome.
Keywords :
hidden Markov models; speech recognition; Dutch read speech recognition task; modelling pronunciation variation; multi-path HMM; phonetic knowledge; syllable-length acoustic models; triphone recogniser; Appropriate technology; Automatic speech recognition; Data mining; Displays; Hidden Markov models; Libraries; Natural languages; Speech recognition; Topology; Training data; Speech recognition; hidden Markov models;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.367029
Filename :
4218217
Link To Document :
بازگشت