Title :
Combined speech and audio coding using non-linear adaptations
Author :
Chan, Cheung-Fat
Author_Institution :
Dept. of Electron. Eng., City Univ. of Hong Kong, Kowloon, Hong Kong
Abstract :
A combined speech and audio coder is proposed. The coder structure resembles a low-delay CELP coder, however, the excitation gain is adapted non-linearly in a sample-by-sample fashion by using a trained neural network, and the spectral parameters are derived from backward non-linear prediction based on a second-order Volterra filter. A perceptual weighting filter derived from psychoacoustic analysis in the spectral domain is used to shape the coding noise. The proposed non-linear adaptation schemes significantly improve the effectiveness of using an analysis-by-synthesis model for coding audio signal. Simulation results show that transparent coding of wideband (7 kHz) speech and audio at 24 kbps is achieved
Keywords :
adaptive signal processing; audio coding; filtering theory; learning (artificial intelligence); neural nets; spectral analysis; speech coding; speech processing; speech synthesis; vocoders; 24 kbit/s; 7 kHz; analysis-by-synthesis model; audio coder; audio coding; backward nonlinear prediction; coding noise shaping; excitation gain; low-delay CELP coder; nonlinear adaptations; perceptual weighting filter; psychoacoustic analysis; second-order Volterra filter; simulation results; spectral domain; spectral parameters; speech coder; speech coding; trained neural network; transparent coding; wideband audio signals; wideband speech signals; Audio coding; Filters; Neural networks; Noise shaping; Psychoacoustic models; Psychology; Signal analysis; Spectral analysis; Speech coding; Wideband;
Conference_Titel :
Speech Coding For Telecommunications Proceeding, 1997, 1997 IEEE Workshop on
Conference_Location :
Pocono Manor, PA
Print_ISBN :
0-7803-4073-6
DOI :
10.1109/SCFT.1997.623918