DocumentCode :
1623075
Title :
Nonnative speech recognition based on bilingual model modification
Author :
Zhang, Qingqing ; Pan, Jielin ; Chan, Shui-duen ; Yan, Yonghong
Author_Institution :
ThinkIT Speech Lab., Chinese Acad. of Sci., Beijing, China
fYear :
2009
Firstpage :
110
Lastpage :
114
Abstract :
This paper presents a novel bilingual model modification approach to improve nonnative speech recognition accuracy when the variations of accented pronunciations occur. Each state of baseline nonnative acoustic model is modified with several candidate states from the auxiliary acoustic model, which is trained on speakers´ mother language. State mapping criterion and n-best candidates are investigated, and different numbers of Gaussian mixtures of the auxiliary acoustic model are compared based on a grammar-constrained speech recognition system. Using this bilingual model modification approach, compared to the nonnative acoustic model which has already been well trained by adaptation technique MAP, the Phrase Error Rate further achieves a 5.83% relative reduction, while only a small relative increase on Real Time Factor occurs.
Keywords :
Gaussian processes; natural language processing; speech recognition; Gaussian mixtures; auxiliary acoustic model; baseline nonnative acoustic model; bilingual model modification approach; grammar-constrained speech recognition system; n-best candidates; nonnative speech recognition; phrase error rate; real time factor; state mapping criterion; Acoustic testing; Automatic speech recognition; Error analysis; Loudspeakers; Maximum likelihood linear regression; Merging; Natural languages; Robustness; Speech recognition; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Fuzzy Systems, 2009. FUZZ-IEEE 2009. IEEE International Conference on
Conference_Location :
Jeju Island
ISSN :
1098-7584
Print_ISBN :
978-1-4244-3596-8
Electronic_ISBN :
1098-7584
Type :
conf
DOI :
10.1109/FUZZY.2009.5277103
Filename :
5277103
Link To Document :
بازگشت