Title :
Model-Based Speech Enhancement With Improved Spectral Envelope Estimation via Dynamics Tracking
Author :
Chen, Ruofei ; Chan, Cheung-Fat ; So, Hing Cheung
Author_Institution :
Dept. of Electron. Eng., City Univ. of Hong Kong, Kowloon, China
fDate :
5/1/2012 12:00:00 AM
Abstract :
In this work, we present a model-based approach to enhance noisy speech using an analysis-synthesis framework. Target speech is reconstructed with model parameters estimated from noisy observations. In particular, spectral envelope is estimated by tracking its temporal trajectories in order to improve the noise-distorted short-time spectral amplitude. Initially, we propose an analysis-synthesis framework for speech enhancement based on harmonic noise model (HNM). Acoustic parameters such as pitch, spectral envelope, and spectral gain are extracted from HNM analysis. Spectral envelope estimation is improved by tracking its line spectrum frequency trajectories through Kalman filtering. System identification of Kalman filter is achieved via a combined design of codebook mapping scheme and maximum-likelihood estimator with parallel training data. Complete system design and experimental validations are given in details. Through performance evaluation based on a study of spectrogram, objective measures and a subjective listening test, it is demonstrated that the proposed approach achieves significant improvement over conventional methods in various conditions. A distinct advantage of the proposed method is that it successfully tackles the “musical tones” problem.
Keywords :
Kalman filters; acoustic signal processing; maximum likelihood estimation; performance evaluation; signal reconstruction; spectral analysis; speech enhancement; speech synthesis; target tracking; HNM analysis; Kalman filtering; acoustic parameters; analysis-synthesis framework; codebook mapping scheme; dynamics tracking; harmonic noise model; maximum likelihood estimator; model parameter estimation; model-based approach; model-based speech enhancement; musical tones problem; noise-distorted short-time spectral amplitude; noisy speech enhancement; performance evaluation; spectral envelope estimation; speech reconstruction; system identification; temporal trajectory tracking; Harmonic analysis; Kalman filters; Noise; Noise measurement; Speech; Speech enhancement; Codebook mapping; Kalman filter; harmonic noise model (HNM); speech analysis; speech synthesis; vector quantization (VQ);
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2011.2177821