Title :
Noisy speech enhancement based on prior knowledge about spectral envelope and harmonic structure
Author :
Yoshioka, Takuya ; Nakatani, Tomohiro ; Okuno, Hiroshi G.
Author_Institution :
NTT Commun. Sci. Labs., NTT Corp., Kyoto, Japan
Abstract :
This paper considers the enhancement of noisy speech. Earlier studies have revealed that an approach that enhances spectral envelopes by using prior knowledge about the all-pole (AP) model parameters of clean speech learnt from speech corpora is advantageous in terms of the amount of musical noise and speech distortion. This paper proposes a new speech enhancement method, in which harmonic structure enhancement is incorporated in learning-based spectral envelope enhancement to further improve performance. The harmonic structure is represented by using a harmonic Gaussian mixture model (GMM), which is parameterized by a voicing indicator and a fundamental frequency. The parameters of the AP model and the harmonic GMM are jointly estimated by maximum a posteriori estimation, thus enabling the enhancement of spectral envelopes and harmonic structures in a unified framework. The proposed method outperforms the spectral envelope enhancement approach by 0.85 dB in cepstral distance.
Keywords :
Gaussian processes; learning (artificial intelligence); maximum likelihood estimation; speech enhancement; all-pole model parameter; harmonic Gaussian mixture model; harmonic structure enhancement; learning-based spectral envelope enhancement; maximum a posteriori estimation; musical noise; noisy speech enhancement; spectral envelope enhancement approach; speech distortion; voicing indicator; Acoustic noise; Dictionaries; Frequency; Laboratories; Linear predictive coding; Parameter estimation; Power harmonic filters; Power system harmonics; Speech enhancement; Wiener filter; Speech enhancement; harmonic structure; learning; spectral envelope;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495681