• DocumentCode
    310676
  • Title

    Non-linear techniques for pitch and waveform enhancement in PWI coders

  • Author

    Li, Hui ; Lockhart, Gordon B.

  • Author_Institution
    Dept. of Electron. & Electr. Eng., Leeds Univ., UK
  • Volume
    2
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    1563
  • Abstract
    Two non-linear interpolation techniques are introduced for enhancing speech reproduction in prototype waveform interpolation (PWI) and similar encoders. A temporal differential rate (TDR) vector is used to characterise the non-uniform evolution of pitch cycle temporal structure during interpolation. Experimental results show a clear improvement in the accuracy of decoded pitch cycle lengths and in the reproduction of periodicity in general. It is also shown that waveform reproduction can be significantly improved by vector quantising sets of optimal combination coefficients (OCC) aimed at maximising the similarity between interpolated and target signal segments. Both time domain waveform similarity and frequency domain spectral envelope similarity derived OCC are tested. Subjective assessment suggests a general preference for non-linear interpolation methods and the scheme using frequency domain derived OCC with perceptual weighting provided the best subjective preference
  • Keywords
    decoding; frequency-domain analysis; interpolation; spectral analysis; speech coding; speech enhancement; speech processing; time-domain analysis; waveform analysis; PWI coders; decoded pitch cycle lengths; encoders; experimental results; frequency domain spectral envelope similarity; nonlinear interpolation techniques; nonuniform evolution; optimal combination coefficients; perceptual weighting; periodicity reproduction; pitch cycle temporal structure; pitch enhancement; prototype waveform interpolation; subjective assessment; subjective preference; temporal differential rate vector; time domain waveform similarity; vector quantising sets; waveform enhancement; waveform reproduction; Bit rate; Codecs; Decoding; Encoding; Frequency domain analysis; Interpolation; Prototypes; Speech coding; Speech enhancement; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.596250
  • Filename
    596250