DocumentCode
2852343
Title
Joint optimization of model and excitation in parametric speech coders
Author
Lashkari, Khosrow ; Miki, Toshio
Author_Institution
DoCoMo USA Laboratories, Inc., 181 Metro Drive, San Jose, California 95110, USA
Volume
1
fYear
2002
fDate
13-17 May 2002
Abstract
This paper presents a new Analysis-by-Synthesis (AbS) technique for joint optimization of the excitation and model parameters based on minimizing the closed loop synthesis error instead of the linear prediction error. By minimizing the synthesis error, the analysis and synthesis stages become more compatible. Using a gradient search in the root domain, model parameters for a given excitation are optimized to minimize the error between the original and the synthesized speech. Since the optimization starts from the LPC solution, the synthesis error is guaranteed to be lower than that obtained using the LPC coefficients. For multipulse LPC, there is a 0.5–1 dB improvement in the segmental SNR for male and female speakers over 4 to 6 second long sentences. Listening tests and objective MOS scores confirm the improved speech quality. By adding an extra optimization step, the technique can be incorporated into the LPC, multi-pulse LPC and CELP-type speech coders.
Keywords
Frequency synthesizers; Maximum likelihood detection; Nonlinear filters; Optimization; Production; Signal to noise ratio;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location
Orlando, FL, USA
ISSN
1520-6149
Print_ISBN
0-7803-7402-9
Type
conf
DOI
10.1109/ICASSP.2002.5743708
Filename
5743708
Link To Document