Title :
Signal modification for robust speech coding
Author :
Kim, Nam Soo ; Cha, Joon-Hyuk
Author_Institution :
Sch. of Electr. Eng. & INMC, Seoul Nat. Univ., South Korea
Abstract :
Usually, the performance of a low-bit-rate speech coder degrades seriously in the presence of various interfering signals such as the background noise, acoustic echo, co-talkers´ speech and other unwanted signals. This comes from the mismatch between the input signal and the assumed speech production model on which the design of the given speech coder is based. In this paper, we present an approach to modify the input signal such that it can be coded more effectively within the generalized analysis-by-synthesis framework. Signal modification in the presented approach is performed according to a criterion which makes a compromise between the modification and coder quantization errors. The coder-decoder (CODEC) characteristic is described in terms of a transfer matrix, and an on-line method using the recursive least square (RLS) technique is proposed to estimate it. Since each part of the speech signal is differently affected by the modification, we also devise an adaptive method based on the signal-to-quantization noise ratio (SQNR). In contrast to the conventional modification techniques, our approach can be implemented as a simple front-end for any analysis-by-synthesis type coders.
Keywords :
least squares approximations; quantisation (signal); recursive estimation; speech codecs; speech coding; speech synthesis; RLS technique; acoustic echo; analysis-by-synthesis type coders; background noise; codec; coder quantization error; coder-decoder characteristics; input signals; interfering signals; low bit rate speech coder; recursive least square technique; signal modification; signal-to-quantization noise ratio; speech coding; speech production model; transfer matrix; unwanted signals; Background noise; Codecs; Degradation; Least squares approximation; Noise robustness; Quantization; Signal analysis; Signal design; Speech coding; Speech enhancement;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on
DOI :
10.1109/TSA.2003.819946