Title :
Mixed-excited phonetic vocoding at 265 bps
Author :
Maia, R. Da S ; Cirigliano, R. J da R ; Rojtenberg, D. ; Resende, E. C V, Jr.
Author_Institution :
Electr. Eng. Program/COPPE, Fed. Univ. of Rio de Janeiro, Brazil
Abstract :
In this paper a phonetic vocoder which synthesizes speech using mixed excitation is presented. The encoder carries out HMM-based speech recognition and pitch analysis, whereas the decoder performs parameter extraction from HMM and builds a mixed excitation using pitch and bandpass voicing strengths. The vocoder at an average bit rate of 265 bit/s reaches good degree of intelligibility, while the use of mixed excitation significantly improves the speech quality with no increase of bit rate when compared with the conventional binary excitation pulse train/random noise.
Keywords :
band-pass filters; frequency estimation; hidden Markov models; speech coding; speech intelligibility; speech processing; speech recognition; speech synthesis; vocoders; 256 bit/s; HMM-based speech recognition; bandpass voicing strength; intelligibility; mixed excitation; mixed-excited phonetic vocoding; parameter extraction; phonetic vocoder; pitch analysis; speech quality; speech synthesis; Bit rate; Decoding; Hidden Markov models; Parameter extraction; Performance analysis; Speech analysis; Speech enhancement; Speech recognition; Speech synthesis; Vocoders;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1198901