• DocumentCode
    395210
  • Title

    Joint optimization of short-term and long-term predictors in CELP speech coders

  • Author

    Zarrinkoub, Houman ; Mermelstein, Paul

  • Author_Institution
    Inst. Nat. de la Recherche Scientifique, Quebec Univ., Montreal, Que., Canada
  • Volume
    2
  • fYear
    2003
  • fDate
    6-10 April 2003
  • Abstract
    The objective of this work is to investigate whether joint optimization of short-term and long-term predictors manifests significant advantages over the sequential optimization in speech coding. We propose a new joint optimization method based on Wiener filtering. The proposed analysis model resolves the pitch-bias problem of classical LPC analysis by considering the contribution of the long-term predictor while optimizing the short-term predictor. Our approach to joint optimization is based on analysis-by-synthesis and guarantees the synthesis filter stability. By applying our proposed joint optimization approach to CELP coding we obtain superior objective and subjective performance relative to CELP coding with sequential optimization. To provide voice quality equivalent to that of sequentially optimized CELP, the jointly optimized coder needs fewer FCB pulses and requires a reduced bit budget for LPC quantization. Our listening tests suggest that the JCELP coder at 4.25 kbps is equivalent in quality to the G.729 at 8 kbps.
  • Keywords
    Wiener filters; data compression; filtering theory; linear predictive coding; optimisation; speech coding; speech intelligibility; speech synthesis; vector quantisation; 4.25 kbit/s; 8 kbit/s; CELP coding; CELP speech coders; FCB pulses; G.729; LPC analysis; LPC quantization; Wiener filtering; analysis-by-synthesis; bit budget; joint optimization; listening tests; long-term predictor; objective performance; pitch-bias problem; sequential optimization; short-term predictor; speech coding; speech quality; subjective performance; synthesis filter stability; Linear predictive coding; Optimization methods; Power harmonic filters; Predictive models; Quantization; Signal analysis; Signal synthesis; Speech coding; Speech synthesis; Wiener filter;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-7663-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.2003.1202318
  • Filename
    1202318