Title :
A variable-rate natural-quality parametric speech coder
Author :
Das, Amitava ; Gersho, Allen
Author_Institution :
Dept. of Electr. & Comput. Eng., California Univ., Santa Barbara, CA, USA
Abstract :
A variable bit rate speech coder operating below 2 kb/s is presented which combines phonetic classification and frequency domain modeling. Each input frame is identified as one of the following classes: mixed voiced, fully voiced, unvoiced, noise, and silence. Based on the class parameter, a suitable parametric coding scheme (spectral analysis, modeling, quantization, and synthesis) is selected. The coder thereby adaptively matches a suitable coding scheme to the character of the input speech leading to variable rates ranging from 0.15 kb/s (for silence) to 2.6 kb/s (for mixed voiced). For typical conversational speech, the coder operates at an average rate of 1.4 kb/s while delivering speech quality that is subjectively preferred over Federal Standard 1016 CELP at 4.8 kb/s by a majority of listeners
Keywords :
frequency-domain analysis; spectral analysis; speech coding; speech synthesis; variable rate codes; vocoders; conversational speech; frequency domain modeling; fully voiced; input frame; mixed voiced; noise; parametric coding; phonetic classification; quantization; silence; spectral analysis; speech quality; synthesis; unvoiced; variable-rate natural-quality parametric speech coder; Bit rate; Frequency domain analysis; Information processing; Quantization; Space technology; Spectral shape; Speech coding; Speech enhancement; Speech synthesis; Vocoders;
Conference_Titel :
Communications, 1994. ICC '94, SUPERCOMM/ICC '94, Conference Record, 'Serving Humanity Through Communications.' IEEE International Conference on
Conference_Location :
New Orleans, LA
Print_ISBN :
0-7803-1825-0
DOI :
10.1109/ICC.1994.369057