Title :
TTS based very low bit rate speech coder
Author :
Lee, Ki-Seung ; Cox, Richard V.
Author_Institution :
AT&T Labs.-Res., Florham Park, NJ, USA
Abstract :
This paper addresses a speech coder which uses a text-to-speech (TTS) synthesis system to achieve very low bit rates (sub 1 kbps). The main issue of the work is the accurate coding of the pitch (f0) and gain contours which are principle components of prosody. This is of paramount interest since the correct prosody will increase naturalness and an efficient coding scheme will provide high coding gain. Together with the phonetic transcription, the f0 and gain contour constitute the parameters that are necessary for the TTS system to synthesize the speech signal. Piecewise linear approximation is used to code the f0 parameter. A technique which minimizes the bit rate while maintaining f0 error below a given threshold are described. To obtain both high compression and smoothly changing gain contours, the variance of the signal is averaged over each half phoneme length is transmitted as gain information. With single speaker stimuli, and a priori text transcription information, we obtained natural sounding speech at an average bit rate of about 300 bps
Keywords :
data compression; speech coding; speech intelligibility; speech synthesis; vocoders; 30 bit/s; TTS; TTS system; a priori text transcription information; average bit rate; bit rate minimisation; gain contour coding; gain information; half phoneme length; high coding gain; high compression; natural sounding speech; phonetic transcription; piecewise linear approximation; pitch coding; prosody; single speaker stimuli; text-to-speech synthesis system; very low bit rate speech coder; Approximation error; Bit rate; Delay; Loudspeakers; Piecewise linear approximation; Signal synthesis; Speech coding; Speech synthesis; Synthesizers; Vocoders;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference on
Conference_Location :
Phoenix, AZ
Print_ISBN :
0-7803-5041-3
DOI :
10.1109/ICASSP.1999.758092