• DocumentCode
    394357
  • Title

    Mixed-excited phonetic vocoding at 265 bps

  • Author

    Maia, R. Da S ; Cirigliano, R. J da R ; Rojtenberg, D. ; Resende, E. C V, Jr.

  • Author_Institution
    Electr. Eng. Program/COPPE, Fed. Univ. of Rio de Janeiro, Brazil
  • Volume
    1
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    In this paper a phonetic vocoder which synthesizes speech using mixed excitation is presented. The encoder carries out HMM-based speech recognition and pitch analysis, whereas the decoder performs parameter extraction from HMM and builds a mixed excitation using pitch and bandpass voicing strengths. The vocoder at an average bit rate of 265 bit/s reaches good degree of intelligibility, while the use of mixed excitation significantly improves the speech quality with no increase of bit rate when compared with the conventional binary excitation pulse train/random noise.
  • Keywords
    band-pass filters; frequency estimation; hidden Markov models; speech coding; speech intelligibility; speech processing; speech recognition; speech synthesis; vocoders; 256 bit/s; HMM-based speech recognition; bandpass voicing strength; intelligibility; mixed excitation; mixed-excited phonetic vocoding; parameter extraction; phonetic vocoder; pitch analysis; speech quality; speech synthesis; Bit rate; Decoding; Hidden Markov models; Parameter extraction; Performance analysis; Speech analysis; Speech enhancement; Speech recognition; Speech synthesis; Vocoders;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1198901
  • Filename
    1198901