DocumentCode :
3031484
Title :
Maximum likelihood pitch estimation using state-variable techniques
Author :
McAulay, Robert J.
Author_Institution :
MIT Lincoln Laboratory
Volume :
3
fYear :
1978
fDate :
28581
Firstpage :
12
Lastpage :
14
Abstract :
The problem of estimating the pitch period of a speech waveform contaminated by acoustically coupled background noise is formulated to include the properties of the spectral envelope by postulating a state-variable model for the speech generation process. Applying the maximum likelihood estimation technique, the optimum processor uses a Kalman filter preprocessor to flatten the spectrum. The resulting signal is then passed through a bank of comb filters and the optimum pitch corresponds to the comb filter for which the output energy is smallest. The Kalman prefilter reduces to an LPC filter only when the speech is generated by an all-pole process and the signal-to-noise ratio is large. For the low signal-to-noise ratio case, a parallel formant speech generation model is more likely to lead to practical numerical algorithms for estimating the spectral coefficients.
Keywords :
Background noise; Filter bank; Kalman filters; Linear predictive coding; Maximum likelihood estimation; Signal generators; Signal to noise ratio; Speech enhancement; Speech processing; State estimation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '78.
Type :
conf
DOI :
10.1109/ICASSP.1978.1170436
Filename :
1170436
Link To Document :
بازگشت