Title :
An acoustic model adaptation using HMM-based speech synthesis
Author :
Tanaka, Koji ; Kuroiwa, Shingo ; Tsuge, Satoru ; Ren, Fuji
Author_Institution :
Dept. of Inf. Sci. & Intelligent Syst., Tokushima Univ., Japan
Abstract :
Recently, personal digital assistants like cellular phones are shifting to the IP terminal. The encoding-decoding process utilized for transmitting over IP networks deteriorates the quality of the speech data. This deterioration causes degradation in speech recognition performance. Acoustic model adaptations could improve recognition performance. However, the current adaptation methods usually require a large amount of adaptation data. A novel adaptation method using speech synthesis based on HMM (hidden Markov model) is proposed. This method does not require speech data for adaptation because speech data is generated by speech synthesis from the acoustic model. Experimental results on G.723.1 coded speech recognition show that the proposed method improves speech recognition performance. A relative improvement in word accuracy of approximately 2% was observed.
Keywords :
Internet telephony; hidden Markov models; speech recognition; speech synthesis; G.723.1 coded speech recognition; HMM-based speech synthesis; IP networks; acoustic model adaptation; cellular phone; encoding-decoding process; hidden Markov model; personal digital assistants; speech recognition performance; Acoustic distortion; Adaptation model; Cellular phones; Hidden Markov models; Personal digital assistants; Speech analysis; Speech coding; Speech recognition; Speech synthesis; Telephony;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on
Conference_Location :
Beijing, China
Print_ISBN :
0-7803-7902-0
DOI :
10.1109/NLPKE.2003.1275933