Title :
Scalable speech coding at rates below 900 BPS
Author :
Jahangiri, Ehsan ; Ghaemmaghami, Shahrokh
Author_Institution :
Electron. Res. Center, Sharif Univ. of Technol., Tehran
fDate :
June 23 2008-April 26 2008
Abstract :
This paper introduces a novel scalable speech coding scheme based on embedded matrix quantization of LSF parameters in an LPC model. In the proposed quantizer, codewords are organized based on a tree structure through a cell-merging process, which leads to a fine-grain scalable coder at rates below 900 bps. Near natural sounding is achieved at very low rates by employing an efficient adaptive dual-band scheme to approximate the LPC excitation signals. Evaluation results, obtained from both overall quality measurement and intelligibility assessment, show that the proposed coder could be a reasonable choice for improving the bottom-line speech quality in low bit rates.
Keywords :
linear predictive coding; quantisation (signal); speech coding; LPC model; LSF parameters; adaptive dual-band scheme; bottom-line speech quality; cell-merging process; embedded matrix quantization; fine-grain scalable coder; scalable speech coding; tree structure; Bit rate; Dual band; Encoding; Frequency; Linear predictive coding; Quantization; Speech analysis; Speech coding; Speech synthesis; Tree data structures; Very low rate speech coding; dual-band excitation; embedded matrix quantization; scalable speech coding;
Conference_Titel :
Multimedia and Expo, 2008 IEEE International Conference on
Conference_Location :
Hannover
Print_ISBN :
978-1-4244-2570-9
Electronic_ISBN :
978-1-4244-2571-6
DOI :
10.1109/ICME.2008.4607377