Title :
A new audio coding scheme using a forward masking model and perceptually weighted vector quantization
Author :
Huang, Yuan-Hao ; Chiueh, Tzi-Dar
Author_Institution :
Dept. of Electr. Eng., Nat. Taiwan Univ., Taipei, Taiwan
fDate :
7/1/2002 12:00:00 AM
Abstract :
This paper presents a new audio coder that includes two techniques to improve the sound quality of the audio coding system. First, a forward masking model is proposed. This model exploits adaptation of the peripheral sensory and neural elements in the auditory system, which is often deemed as the cause of forward masking. In the proposed audio coder, the forward masking is first modeled by a nonlinear analog circuit and then difference equations for finding the solution of this circuit are formulated. The parameters of the circuit are derived from several factors, including time difference between masker and maskee, masker level, masker frequency, and masker duration. Inclusion of this model in the coding process will remove more redundancy inaudible to humans and thus improves the coding efficiency. Secondly, we propose a new vector quantization technique, whose codebooks are generated by a perceptually weighted binary-tree self-organizing feature maps (PW-BTSOFM) algorithm. This vector quantization technique adopts a perceptually weighted error criterion to train and select codewords so that the quantization error is kept below the just-noticed distortion (JND) while using the smallest possible codebook, again reducing the required coded bit rate. Experimental objective and subjective sound quality measurements show that the proposed audio coding scheme requires about 30% less bits than the MPEG layer III audio coding standard.
Keywords :
analogue circuits; audio coding; data compression; difference equations; hearing; nonlinear network analysis; self-organising feature maps; vector quantisation; MPEG layer III audio coding standard; audio coder; audio coding system; audio signal compression; auditory system; binary-tree self-organizing feature maps; circuit parameters; codebook; codebooks; coded bit rate reduction; codewords; coding efficiency; difference equations; forward masking model; just-noticed distortion; masker duration; masker frequency; masker level; neural elements; nonlinear analog circuit; objective sound quality measurements; perceptually weighted VQ; perceptually weighted error criterion; perceptually weighted feature maps; perceptually weighted vector quantization; peripheral sensory elements; quantization error; sound quality; subjective sound quality measurements; Adaptation model; Analog circuits; Audio coding; Auditory system; Difference equations; Frequency; Humans; Nonlinear distortion; Redundancy; Vector quantization;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
DOI :
10.1109/TSA.2002.800559