Title :
Speech Spectrum Modeling for Joint Estimation of Spectral Envelope and Fundamental Frequency
Author :
Kameoka, Hirokazu ; Ono, Nobutaka ; Sagayama, Shigeki
Author_Institution :
Media Inf. Lab., NTT Commun. Sci. Labs., Atsugi, Japan
Abstract :
Although considerable effort has been devoted to both fundamental frequency (F0) and spectral envelope estimation in the field of speech processing, the problem of determining F0 and spectral envelopes has largely been tackled independently. If F0 were known in advance, then the spectral envelope could be estimated very reliably. On the other hand, if the spectral envelope were known in advance, then we could obtain a reliable F0 estimate. F0 and the spectral envelope, each of which is a prerequisite of the other, should thus be estimated jointly rather than independently in succession. On this basis, we develop a parametric speech spectrum model that allows us to estimate the F0 and spectral envelope simultaneously. We confirmed experimentally the significant advantage of this joint estimation approach for both F0 estimation and spectral envelope estimation.
Keywords :
expectation-maximisation algorithm; spectral analysis; speech processing; expectation-maximization algorithm; fundamental frequency; joint estimation; spectral envelope estimation; speech processing; speech spectrum modeling; $F_{0}$ estimation; Expectation–maximization (EM) algorithm; spectral envelope estimation; speech analysis;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2009.2036287