• DocumentCode
    2852343
  • Title

    Joint optimization of model and excitation in parametric speech coders

  • Author

    Lashkari, Khosrow ; Miki, Toshio

  • Author_Institution
    DoCoMo USA Laboratories, Inc., 181 Metro Drive, San Jose, California 95110, USA
  • Volume
    1
  • fYear
    2002
  • fDate
    13-17 May 2002
  • Abstract
    This paper presents a new Analysis-by-Synthesis (AbS) technique for joint optimization of the excitation and model parameters based on minimizing the closed loop synthesis error instead of the linear prediction error. By minimizing the synthesis error, the analysis and synthesis stages become more compatible. Using a gradient search in the root domain, model parameters for a given excitation are optimized to minimize the error between the original and the synthesized speech. Since the optimization starts from the LPC solution, the synthesis error is guaranteed to be lower than that obtained using the LPC coefficients. For multipulse LPC, there is a 0.5–1 dB improvement in the segmental SNR for male and female speakers over 4 to 6 second long sentences. Listening tests and objective MOS scores confirm the improved speech quality. By adding an extra optimization step, the technique can be incorporated into the LPC, multi-pulse LPC and CELP-type speech coders.
  • Keywords
    Frequency synthesizers; Maximum likelihood detection; Nonlinear filters; Optimization; Production; Signal to noise ratio;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
  • Conference_Location
    Orlando, FL, USA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7402-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.2002.5743708
  • Filename
    5743708