Title :
A novel transcoding algorithm for AMR and EVRC speech codecs via direct parameter transformation
Author :
Lee, Sunil ; Seo, Seongho ; Jang, Dalwon ; Yoo, Chang D.
Author_Institution :
Dept. of Electr. Eng. & Comput. Sci., Korea Adv. Inst. of Sci. & Technol., Daejeon, South Korea
Abstract :
A novel transcoding algorithm for the adaptive multi rate (AMR) codec and the enhanced variable rate codec (EVRC) is proposed. In contrast to the conventional tandem transcoding algorithm, the proposed algorithm transcodes the parameters of one codec to the other without synthesizing the speech. The proposed algorithm decodes the parameters of source codec from the input bitstream, and based on frame classification and mode decision, it appropriately transforms the parameters of source codec to those of the target codec in the parametric domain. Finally, the transformed parameters are encoded into a bitstream that is decodable by the target codec. The parameters transcoded by the proposed algorithm are line-spectral pair (LSP), pitch delay, fixed codevector, codebook gains, and frame energy. Evaluation results show that while reducing both the computational complexity and delay by 50%, the proposed algorithm produces speech quality equivalent to that of produced by the tandem transcoding algorithm. The general idea is not restricted to the AMR and EVRC but is applicable to various other code-excited linear prediction (CELP) based codecs.
Keywords :
adaptive codes; computational complexity; decoding; delays; spectral analysis; speech codecs; AMR speech codec; CELP based codecs; EVRC speech codec; adaptive multi rate codec; code-excited linear prediction based codecs; codebook gains; computational complexity reduction; direct parameter transformation; enhanced variable rate codec; fixed codevector; frame classification; frame energy; input bitstream; line-spectral pair; mode decision; parametric domain; pitch delay; source codec parameters decoding; speech quality; tandem transcoding algorithm; transcoding algorithm; Computational complexity; Computer science; Decoding; Delay; Encoding; Speech analysis; Speech codecs; Speech coding; Speech synthesis; Transcoding;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1202323