• DocumentCode
    284633
  • Title

    Improved phonetically-segmented vector excitation coding at 3.4 kb/s

  • Author

    Wang, Shihua ; Gersho, Allen

  • Author_Institution
    Dept of Electr. & Comput. Eng., California Univ., Santa Barbara, CA, USA
  • Volume
    1
  • fYear
    1992
  • fDate
    23-26 Mar 1992
  • Firstpage
    349
  • Abstract
    Several major modifications to the phonetically segmented vector excitation coding (PS-VXC) coder by the authors (1989, 1990) reported previously have resulted in enhanced speech quality while reducing the delay, complexity, and bit rate. Speech is segmented into variable-length phonetic classes and a VXC coding module is tailored to each class. Coding techniques include adaptive linear predictive coding (LPC) analysis and interpolation, two-stage excitation coding of onsets, comb filtering, modified perceptual weighting, and pitch contour smoothing. The improved PS-VXC coder operates at a peak rate of 3.4 kb/s with an average rate of 3.0 kb/s and has a subjective performance closely matching that of the 4.8 kb/s DoD CELP coder
  • Keywords
    linear predictive coding; speech coding; 3.4 kbit/s; LPC analysis; adaptive linear predictive coding; bit rate; coding module; comb filtering; complexity; delay; interpolation; modified perceptual weighting; onsets; phonetically segmented vector excitation coding; pitch contour smoothing; speech quality; subjective performance; two-stage excitation coding; variable-length phonetic classes; Adaptive filters; Bit rate; Delay; Filtering; Interpolation; Linear predictive coding; Nonlinear filters; Speech coding; Speech enhancement; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on
  • Conference_Location
    San Francisco, CA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-0532-9
  • Type

    conf

  • DOI
    10.1109/ICASSP.1992.225900
  • Filename
    225900