Title :
Joint Source Channel Speech Decoding using long-term residual redundancy
Author :
Pourmir, Arezou M. ; Lahouti, Farshad
Author_Institution :
Sch. of ECE, Univ. of Tehran, Tehran
Abstract :
In this paper, we present a minimum mean squared error decoding scheme for reconstruction of encoded speech transmitted over a noisy channel, utilizing the residual redundancy due to the quasi-periodic property of voiced segments of speech. The proposed algorithm is in general applicable to any speech codec provided that the long-term memory is not exploited in the encoding process. An enabling step of the proposed solution is the pitch extraction based on the noisy source encoder output. This work is the first attempt in the concept of joint source channel decoding (JSCD) which uses the long-term residual redundancy of speech encoders output. Prior art has focused on the short term redundancy or the memory between successive symbols. Simulation results, based on the 32 kbps ADPCM codec of the ITU-T G.726 standard, indicate enhanced reconstruction performance when compared to the standard decoder.
Keywords :
combined source-channel coding; decoding; feature extraction; mean square error methods; signal reconstruction; speech codecs; ITU-T G.726 standard; bit rate 32 kbit/s; joint source channel speech decoding; long-term residual redundancy; minimum mean squared error decoding scheme; pitch extraction; quasi-periodic property; speech codec; AWGN channels; Art; Code standards; Decoding; Educational institutions; Multimedia communication; Redundancy; Speech codecs; Speech enhancement; Wireless communication;
Conference_Titel :
Software, Telecommunications and Computer Networks, 2008. SoftCOM 2008. 16th International Conference on
Conference_Location :
Split
Print_ISBN :
978-953-6114-97-9
Electronic_ISBN :
978-953-290-009-5
DOI :
10.1109/SOFTCOM.2008.4669505