Title :
Non-linear encoding of the excitation source using neural networks for transition mode coding in CELP
Author :
Joseph, M. Anand ; Yegnanarayana, B.
Author_Institution :
Int. Inst. of Inf. Technol., Hyderabad, India
Abstract :
When a frame suffers erasure, the adaptive codebook at the decoder is no longer in sync with the one at the encoder. When the frame that is erased is a frame following the voice-onset frame, this loss of synchronization of the codebooks severely degrades the quality of the decoded speech. This degradation is primarily because no meaningful excitation signal is present in the adaptive codebook. In this paper, an autoassociative neural network (AANN) with a compression layer is used to capture the characteristics of the excitation source around the GCIs. A transition mode frame that differs from the conventional CELP frame without altering the bit-rate is proposed to deal with this problem of frame drops during transition regions. In this transition mode frames, the compressed representation of the excitation source around the GCIs obtained through AANNs is used to reconstruct the adaptive codebook at the receiver. It is shown that the proposed method improves the quality of the decoded speech.
Keywords :
decoding; neural nets; nonlinear codes; speech coding; AANN; CELP frame; GCI; adaptive codebook synchronization; autoassociative neural network; decoded speech quality; excitation source; excitation source compressed representation; nonlinear encoding; transition mode coding; voice-onset frame; Bit rate; Decoding; Neural networks; Speech; Speech coding; Synchronization; CELP; GCI; neural network; speech coding; transition mode coding;
Conference_Titel :
Signal Processing and Communications (SPCOM), 2012 International Conference on
Conference_Location :
Bangalore
Print_ISBN :
978-1-4673-2013-9
DOI :
10.1109/SPCOM.2012.6290018