DocumentCode :
763636
Title :
Modeling, estimating, and compensating low-bit rate coding distortion in speech recognition
Author :
Yoma, Néstor Becerra ; Molina, Carlos ; Silva, Jorge ; Busso, Carlos
Author_Institution :
Electr. Eng. Dept., Univ. of Chile, Santiago, Chile
Volume :
14
Issue :
1
fYear :
2006
Firstpage :
246
Lastpage :
255
Abstract :
A solution to the problem of speech recognition with signals distorted by low-bit rate coders is presented in this paper. A model for the coding-decoding distortion, a HMM compensation method to include this model, and an EM-based adaptation algorithm to estimate this distortion are proposed here. Medium vocabulary continuous-speech speaker-independent recognition experiments with 8 kbps G.729(CS-CELP), 13 kbps RPE-LTP (GSM), 5.3 kbps G723.1, 4.8 kbps FS-1016 and 32 kbps G.726(ADPCM) coders show that the approach described in this paper is able to dramatically reduce the effect of the coding distortion and, in some cases, gives a word accuracy higher than the baseline system with uncoded speech. Finally, the EM estimation algorithm requires only one adapting utterance and the approach described is certainly suitable for dialogue systems where just a few adapting utterances are available.
Keywords :
distortion; hidden Markov models; interactive systems; linear predictive coding; speech codecs; speech coding; speech recognition; 13 kbit/s; 32 kbit/s; 4.8 kbit/s; 5.3 kbit/s; 8 kbit/s; CELP; GSM; HMM; coders; coding-decoding distortion; continuous-speech speaker-independent recognition; dialogue systems; hidden Markov model compensation method; low-bit rate coding distortion; speech recognition; Acoustic distortion; Cepstral analysis; Error analysis; GSM; Helium; Hidden Markov models; Rate distortion theory; Speech coding; Speech recognition; Vocabulary; Coding distortion; EM estimation algorithm; HMM compensation; low-bit rate coders; speech recognition;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TSA.2005.852994
Filename :
1561281
Link To Document :
بازگشت