DocumentCode
336755
Title
On speech coding in a perceptual domain
Author
Kubin, Gemot ; Kleijn, W. Bastiaan
Author_Institution
Tech. Univ. Wien, Austria
Volume
1
fYear
1999
fDate
15-19 Mar 1999
Firstpage
205
Abstract
For speech coders which fall within the class of waveform coders, the reconstructed signal approaches the original with increasing bit rate. In such coders, the distortion criterion generally operates on the speech signal or a signal obtained by adaptive linear filtering of the speech signal. To satisfy computational and delay constraints, the distortion criterion must be reduced to a very simple approximation of the auditory system. This drawback of conventional approaches motivates a new speech coding paradigm in which the coding is performed in a domain where the single-letter squared-error criterion forms an accurate representation of perception. The new paradigm requires a model of the auditory periphery which is accurate, can be be inverted with relatively low computational effort, and which represents the signal with relatively few parameters. We develop such a model of the auditory periphery and discuss its suitability for speech coding. The results indicate that the new paradigm in general and our auditory model in particular form a promising basis for the coding of both speech and audio at low bit rates
Keywords
adaptive filters; adaptive signal processing; audio coding; channel bank filters; filtering theory; hearing; signal reconstruction; signal representation; speech coding; vocoders; adaptive linear filtering; audio coding; auditory periphery model; auditory system approximation; computational constraint; delay constraint; distortion criterion; filterbank; invertible auditory model; low bit rate coding; perceptual domain; reconstructed signal; single-letter squared-error criterion; source coding; speech coders; speech coding; speech signal representation; waveform coders; Auditory system; Bit rate; Decorrelation; Delay; Distortion measurement; Maximum likelihood detection; Rate distortion theory; Signal design; Source coding; Speech coding;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location
Phoenix, AZ
ISSN
1520-6149
Print_ISBN
0-7803-5041-3
Type
conf
DOI
10.1109/ICASSP.1999.758098
Filename
758098
Link To Document