Title :
Phase coherence in speech reconstruction for enhancement and coding applications
Author :
Quatieri, Thomas F. ; McAulay, Robert J.
Author_Institution :
MIT Lincoln Lab., Lexington, MA, USA
Abstract :
It has been shown that an analysis-synthesis system based on a sinusoidal representation leads to synthetic speech that is essentially perceptually indistinguishable from the original. A change in speech quality has been observed, however, when the phase relation of the sine waves is altered. This occurs in practice when sine waves are processed for speech enhancement and for speech coding. A description is given of a zero-phase sinusoidal analysis-synthesis system which generates natural-sounding speech without the requirement of vocal tract phase. The method provides a basis for improving sound quality by providing different levels of phase coherence in speech reconstruction for time-scale modification, for a baseline system for coding, and for reducing the peak-to-RMS ratio by dispersion
Keywords :
encoding; speech analysis and processing; speech synthesis; baseline system; dispersion; natural-sounding speech; original; peak-to-RMS ratio; perceptually indistinguishable; phase coherence; phase relation; sine waves; sinusoidal representation; sound quality; speech coding; speech enhancement; speech quality; speech reconstruction; synthetic speech; time-scale modification; zero-phase sinusoidal analysis-synthesis system; Coherence; Frequency; Laboratories; Pulse shaping methods; Shape control; Speech analysis; Speech coding; Speech enhancement; Speech synthesis; Time varying systems;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on
Conference_Location :
Glasgow
DOI :
10.1109/ICASSP.1989.266401