DocumentCode :
2391494
Title :
A neural network based perceptual audio coder
Author :
Teh, Do-Hui ; Koh, Soo-Ngee ; Huang, Si-Jun ; Tan, Chee-Heng
Author_Institution :
Sch. of Electr. & Electron. Eng., Nanyang Technol. Inst., Singapore
fYear :
1994
fDate :
22-26 Aug 1994
Firstpage :
913
Abstract :
The implementations and performance results of a neural network based perceptual audio coder is reported. The coder uses the configuration of the ISOIMPEG audio layer II coder with the perceptual analysis block replaced by a 2 layer network trained to estimate the masking thresholds required for the bit allocation. The 2 layer network is trained by a back propagation algorithm using the energies in the subbands as inputs and the masking thresholds of obtained from psychoacoustic model II of the ISOIMPEG audio coder as the reference outputs. The result is a coder which performs favourably in quality against the ISOIMPEG audio layer 2 coder at bit rates of 256 kbit/s and 192 kbit/s stereo. Performance at a bit rate of 128 kbit/s stereo was however, found to be poorer
Keywords :
ISO standards; audio coding; backpropagation; feedforward neural nets; hearing; telecommunication computing; telecommunication standards; 2 layer network; ISOIMPEG audio coder; ISOIMPEG audio layer II coder; back propagation algorithm; bit allocation; masking thresholds; neural network based perceptual audio coder; psychoacoustic model II; reference outputs; subbands; Application software; Bit rate; Codecs; Frequency; Humans; ISO standards; Masking threshold; Neural networks; Quantization; Sampling methods;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
TENCON '94. IEEE Region 10's Ninth Annual International Conference. Theme: Frontiers of Computer Technology. Proceedings of 1994
Print_ISBN :
0-7803-1862-5
Type :
conf
DOI :
10.1109/TENCON.1994.369178
Filename :
369178
Link To Document :
بازگشت