Title :
Pitch estimation and voicing detection based on a sinusoidal speech model
Author :
McAulay, Robert J. ; Quatieri, Thomas F.
Author_Institution :
Lincoln Lab., MIT, Lexington, MA, USA
Abstract :
A technique for estimating the pitch of a speech waveform is developed. It fits a harmonic set of sine waves to the input data using a mean-squared-error (MSE) criterion. By exploiting a sinusoidal model for the input speech waveform, a pitch estimation criterion is derived that is inherently unambiguous, uses pitch-adaptive resolution, uses small-signal suppression to provide enhanced discrimination, and uses amplitude compression to eliminate the effects of pitch-formant interaction. The normalized minimum mean squared error proves to be a powerful discriminant for estimating the likelihood that a given frame of speech is voiced
Keywords :
acoustic variables measurement; speech analysis and processing; MSE criterion; amplitude compression; enhanced discrimination; harmonic set; input data; minimum mean squared error; pitch estimation criterion; pitch-adaptive resolution; pitch-formant interaction; sine waves; sinusoidal speech model; small-signal suppression; speech waveform; voicing detection; Amplitude estimation; Frequency measurement; Harmonic analysis; Laboratories; Phase measurement; Robustness; Speech analysis; Speech coding; Speech enhancement; Speech synthesis;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1990. ICASSP-90., 1990 International Conference on
Conference_Location :
Albuquerque, NM
DOI :
10.1109/ICASSP.1990.115585