Title :
A formant vocoder based on mixtures of Gaussians
Author :
Zolfaghari, Parham ; Robinson, Tony
Author_Institution :
Dept. of Eng., Cambridge Univ., UK
Abstract :
This paper describes a new low bit-rate formant vocoder. The formant parameters are represented by Gaussian mixture distributions, which are estimated from the discrete Fourier transform (DFT) magnitude spectrum of the speech signal. A voiced/unvoiced classification mechanism has been developed based on the harmonic nature of each formant in the DFT spectrum modulated by the Gaussian mixture distribution. Using a magnitude-only sinusoidal synthesiser, intelligible synthetic speech has been obtained. Vector quantisation of the vocal tract parameters enables this formant vocoder to operate at a bit-rate of 1248 bps
Keywords :
Gaussian distribution; discrete Fourier transforms; spectral analysis; speech coding; speech intelligibility; speech synthesis; vector quantisation; vocoders; 1248 bit/s; DFT magnitude spectrum; DFT spectrum modulation; Gaussian mixture distributions; discrete Fourier transform; formant parameters; formant vocoder; intelligible synthetic speech; low bit rate formant vocoder; magnitude only sinusoidal synthesiser; speech signal; vector quantisation; vocal tract parameters; voiced/unvoiced classification; Data mining; Discrete Fourier transforms; Gaussian distribution; Gaussian processes; Resonance; Signal synthesis; Speech analysis; Speech synthesis; Vector quantization; Vocoders;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
Print_ISBN :
0-8186-7919-0
DOI :
10.1109/ICASSP.1997.596253