Title :
Modelling Pronunciation Variation using Multi-Path HMMS for Syllables
Author :
Hamalainen, A. ; Bosch, L. ; Boves, L.
Author_Institution :
Centre for Language & Speech Technol., Radboud Univ. Nijmegen, Netherlands
Abstract :
Recent research suggests that it is more appropriate to model pronunciation variation with syllable-length acoustic models than with triphones. Due to the large number of factors contributing to pronunciation variation at the syllable level, the creation of multi-path model topologies appears necessary. In this paper, we construct multi-path models using phonetic knowledge to initialise the parallel paths, and a data-driven solution for their reestimation. When applied to 94 frequent syllables in a Dutch read speech recognition task, the approach leads to improved recognition performance when compared with a much more complex triphone recogniser. A detailed analysis of the pronunciation variation captured by the parallel paths pinpoints the deficiencies of the approach, and provides insights into how these may be overcome.
Keywords :
hidden Markov models; speech recognition; Dutch read speech recognition task; modelling pronunciation variation; multi-path HMM; phonetic knowledge; syllable-length acoustic models; triphone recogniser; Appropriate technology; Automatic speech recognition; Data mining; Displays; Hidden Markov models; Libraries; Natural languages; Speech recognition; Topology; Training data; Speech recognition; hidden Markov models;
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
Print_ISBN :
1-4244-0727-3
DOI :
10.1109/ICASSP.2007.367029