Title :
Autoregressive modeling of voiced speech
Author :
Berezina, Maria A. ; Rudoy, Daniel ; Wolfe, Patrick J.
Author_Institution :
Div. of Health Sci.&Technol., Harvard-MIT, MA, USA
Abstract :
It is well known that the classical linear predictive model for speech fails to take into account the quasi-periodic nature of the glottal flow typical of voiced speech. In this article we describe how to incorporate an estimate of the glottal flow directly into the traditional linear prediction framework, through the use of flexible basis function expansions that admit efficient estimation procedures. As we show, this not only allows for improved estimation of vocal tract transfer function parameters in a manner that is robust to pitch variation, but also precludes the need for nonlinear optimization procedures typically required in glottal waveform estimation. We illustrate our approach with experiments using synthesized and real speech waveforms, and show how it may be used to directly estimate the relative degree of voicing and aspiration present in a given utterance.
Keywords :
acoustic signal processing; autoregressive processes; speech; speech processing; autoregressive modeling; classical linear predictive model; glottal flow; nonlinear optimization; speech waveforms; vocal tract transfer function parameters; voiced speech; Additive noise; Auditory system; Filters; Gaussian processes; Parameter estimation; Predictive models; Robustness; Speech analysis; Speech synthesis; Transfer functions; Glottal flow; linear prediction; source harmonics-to-noise ratio; spectral estimation; wavelet regression;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495058