DocumentCode
310676
Title
Non-linear techniques for pitch and waveform enhancement in PWI coders
Author
Li, Hui ; Lockhart, Gordon B.
Author_Institution
Dept. of Electron. & Electr. Eng., Leeds Univ., UK
Volume
2
fYear
1997
fDate
21-24 Apr 1997
Firstpage
1563
Abstract
Two non-linear interpolation techniques are introduced for enhancing speech reproduction in prototype waveform interpolation (PWI) and similar encoders. A temporal differential rate (TDR) vector is used to characterise the non-uniform evolution of pitch cycle temporal structure during interpolation. Experimental results show a clear improvement in the accuracy of decoded pitch cycle lengths and in the reproduction of periodicity in general. It is also shown that waveform reproduction can be significantly improved by vector quantising sets of optimal combination coefficients (OCC) aimed at maximising the similarity between interpolated and target signal segments. Both time domain waveform similarity and frequency domain spectral envelope similarity derived OCC are tested. Subjective assessment suggests a general preference for non-linear interpolation methods and the scheme using frequency domain derived OCC with perceptual weighting provided the best subjective preference
Keywords
decoding; frequency-domain analysis; interpolation; spectral analysis; speech coding; speech enhancement; speech processing; time-domain analysis; waveform analysis; PWI coders; decoded pitch cycle lengths; encoders; experimental results; frequency domain spectral envelope similarity; nonlinear interpolation techniques; nonuniform evolution; optimal combination coefficients; perceptual weighting; periodicity reproduction; pitch cycle temporal structure; pitch enhancement; prototype waveform interpolation; subjective assessment; subjective preference; temporal differential rate vector; time domain waveform similarity; vector quantising sets; waveform enhancement; waveform reproduction; Bit rate; Codecs; Decoding; Encoding; Frequency domain analysis; Interpolation; Prototypes; Speech coding; Speech enhancement; Testing;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location
Munich
ISSN
1520-6149
Print_ISBN
0-8186-7919-0
Type
conf
DOI
10.1109/ICASSP.1997.596250
Filename
596250
Link To Document