Title :
Personal speech coding
Author :
Wenhui Jin ; Chan, Wai-Yip
Author_Institution :
Dept. of Electr. & Comput. Eng., Illinois Inst. of Technol., Chicago, IL, USA
Abstract :
In existing speech coding systems, all quantizer codebooks are designed to suit the statistical and perceptual characteristics of speech signals of a population of speakers. However, an individual´s speech signal does not exhibit, even over a long time, the entire range of characteristics of the population. With the advent of the personal communication systems, personal information might become available and be used to improve the rate-distortion performance of speech coders. We assess the potential gain of personal speech coding by designing codebooks for individual speakers. Spectral quantisation, excitation and pitch lag codebooks of existing CELP coders are redesigned. The gains appear to be modest, suggesting that we need to use a different coding framework, which can model personal characteristics explicitly. Amongst the components, the spectral quantizer seems to be most amenable to personalization
Keywords :
linear predictive coding; personal communication networks; quantisation (signal); rate distortion theory; spectral analysis; speech coding; speech synthesis; vocoders; CELP coders; VSELP platform; excitation codebook; linear prediction based analysis-by-synthesis; perceptual characteristics; personal communication systems; personal information; personal speech coding; pitch lag codebook; population; quantizer codebooks; rate-distortion performance; spectral quantisation; speech coders; speech signals; statistical characteristics; Databases; Decoding; Distortion measurement; Explosives; Personal communication networks; Quantization; Rate-distortion; Signal design; Speech coding; Wireless communication;
Conference_Titel :
Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7803-4428-6
DOI :
10.1109/ICASSP.1998.674368