Title :
Parametric models of the magnitude/phase spectrum for harmonic speech coding
Author :
Thomson, David L.
Author_Institution :
AT&T Bell Lab., Naperville, IL, USA
Abstract :
A method is described for representing magnitude and phase in a sinusoidal transform coder. Instead of transmitting individual sinusoids, the entire speech spectrum is transmitted. The synthesizer estimates the frequency, amplitude, and phase of each harmonic from the spectrum. Relatively high-quality speech in the 4.8-9.6 kb/s range is obtained by modeling the magnitude/phase spectrum with a combination of pole-zero analysis, phase prediction and vector quantization. A window subtraction method ensures proper synthesis of unvoiced speech. The system is robust since it does not depend on pitch estimates or voicing decisions
Keywords :
encoding; speech analysis and processing; speech synthesis; 4.8 to 9.6 kbit/s; harmonic speech coding; magnitude/phase spectrum; parametric models; phase prediction; pole-zero analysis; sinusoidal transform coder; speech spectrum; unvoiced speech synthesis; vector quantization; window subtraction method; Amplitude estimation; Frequency estimation; Frequency synthesizers; Parametric statistics; Phase estimation; Predictive models; Robustness; Speech analysis; Speech synthesis; Vector quantization;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on
Conference_Location :
New York, NY
DOI :
10.1109/ICASSP.1988.196596