Title :
Decomposition of speech into voiced and unvoiced components based on a state-space signal model
Author :
Thomson, Mark ; Boland, Sinion ; Wu, Mike ; Epps, Julien ; Smithers, M.
Author_Institution :
Motorola Labs, Botany, NSW, Australia
Abstract :
We present a novel method for decomposing speech into voiced and unvoiced components. After demodulating the variations in the spectral envelope, energy and pitch, the method involves applying a bank of Kalman filters to separate the harmonic and non-harmonic components of the signal. This approach relies on a state-space representation of the composite signal, and provides a way to estimate accurately the harmonic component without the large delay required by a linear phase comb filter. However it also requires prior knowledge of the variance of the unvoiced component and the state transition parameters. We present a novel method to determine these parameters accurately based on a variant of the expectation-maximization algorithm. Modifications for dealing with unvoiced segments and voicing onset are also described.
Keywords :
Kalman filters; channel bank filters; demodulation; harmonic analysis; harmonics; optimisation; parameter estimation; speech coding; state-space methods; Kalman filter bank; demodulation; expectation-maximization algorithm; harmonic components; linear phase comb filter; nonharmonic components; signal representation; spectral envelope; speech coding; speech decomposition; state-space signal model; unvoiced components; voiced components; voicing onset; Australia; Delay estimation; Laboratories; Low pass filters; Nonlinear filters; Phase estimation; Power harmonic filters; Random variables; Speech coding; Stochastic processes;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International Conference on
Print_ISBN :
0-7803-7663-3
DOI :
10.1109/ICASSP.2003.1198741