مرکز منطقه ای اطلاع رساني علوم و فناوري - An acoustic model adaptation using HMM-based speech synthesis

DocumentCode :

2665105

Title :

An acoustic model adaptation using HMM-based speech synthesis

Author :

Tanaka, Koji ; Kuroiwa, Shingo ; Tsuge, Satoru ; Ren, Fuji

Author_Institution :

Dept. of Inf. Sci. & Intelligent Syst., Tokushima Univ., Japan

fYear :

2003

fDate :

26-29 Oct. 2003

Firstpage :

368

Lastpage :

373

Abstract :

Recently, personal digital assistants like cellular phones are shifting to the IP terminal. The encoding-decoding process utilized for transmitting over IP networks deteriorates the quality of the speech data. This deterioration causes degradation in speech recognition performance. Acoustic model adaptations could improve recognition performance. However, the current adaptation methods usually require a large amount of adaptation data. A novel adaptation method using speech synthesis based on HMM (hidden Markov model) is proposed. This method does not require speech data for adaptation because speech data is generated by speech synthesis from the acoustic model. Experimental results on G.723.1 coded speech recognition show that the proposed method improves speech recognition performance. A relative improvement in word accuracy of approximately 2% was observed.

Keywords :

Internet telephony; hidden Markov models; speech recognition; speech synthesis; G.723.1 coded speech recognition; HMM-based speech synthesis; IP networks; acoustic model adaptation; cellular phone; encoding-decoding process; hidden Markov model; personal digital assistants; speech recognition performance; Acoustic distortion; Adaptation model; Cellular phones; Hidden Markov models; Personal digital assistants; Speech analysis; Speech coding; Speech recognition; Speech synthesis; Telephony;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003 International Conference on

Conference_Location :

Beijing, China

Print_ISBN :

0-7803-7902-0

Type :

conf

DOI :

10.1109/NLPKE.2003.1275933

Filename :

1275933

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2665105