DocumentCode
394357
Title
Mixed-excited phonetic vocoding at 265 bps
Author
Maia, R. Da S ; Cirigliano, R. J da R ; Rojtenberg, D. ; Resende, E. C V, Jr.
Author_Institution
Electr. Eng. Program/COPPE, Fed. Univ. of Rio de Janeiro, Brazil
Volume
1
fYear
2003
fDate
6-10 April 2003
Abstract
In this paper a phonetic vocoder which synthesizes speech using mixed excitation is presented. The encoder carries out HMM-based speech recognition and pitch analysis, whereas the decoder performs parameter extraction from HMM and builds a mixed excitation using pitch and bandpass voicing strengths. The vocoder at an average bit rate of 265 bit/s reaches good degree of intelligibility, while the use of mixed excitation significantly improves the speech quality with no increase of bit rate when compared with the conventional binary excitation pulse train/random noise.
Keywords
band-pass filters; frequency estimation; hidden Markov models; speech coding; speech intelligibility; speech processing; speech recognition; speech synthesis; vocoders; 256 bit/s; HMM-based speech recognition; bandpass voicing strength; intelligibility; mixed excitation; mixed-excited phonetic vocoding; parameter extraction; phonetic vocoder; pitch analysis; speech quality; speech synthesis; Bit rate; Decoding; Hidden Markov models; Parameter extraction; Performance analysis; Speech analysis; Speech enhancement; Speech recognition; Speech synthesis; Vocoders;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
ISSN
1520-6149
Print_ISBN
0-7803-7663-3
Type
conf
DOI
10.1109/ICASSP.2003.1198901
Filename
1198901
Link To Document