DocumentCode :
23081
Title :
Speech Spectral Envelope Enhancement by HMM-Based Analysis/Resynthesis
Author :
Carmona, J.L. ; Barker, J. ; Gomez, Angel M. ; Ning Ma
Author_Institution :
Dept. of Teor. de la Senal, Univ. de Granada, Granada, Spain
Volume :
20
Issue :
6
fYear :
2013
fDate :
Jun-13
Firstpage :
563
Lastpage :
566
Abstract :
We propose a speech enhancement-by-resynthesis framework whose strength lies in a common statistical speech model that is shared by the analysis and synthesis stages. First, a spectro-temporal analysis is performed and masked spectro-temporal regions are identified using a noise model. Then, HMM synthesis is used to reconstruct the spectral envelope in masked regions in a manner which is conditioned on the reliable regions, preventing the resynthesis from regressing to the training data mean. As a demonstration we enhance noise-corrupted speech utterances from a small vocabulary corpus for which good statistical models are available. Perceptual evaluation of speech quality and log spectral distances demonstrate considerable performance improvements over baseline approaches that do not exploit strong speech knowledge. The letter is accompanied by audio examples.
Keywords :
hidden Markov models; speech enhancement; statistical analysis; HMM-based analysis-resynthesis; noise-corrupted speech utterance enhancement; spectral envelope reconstruction; spectro-temporal analysis; spectro-temporal regions; speech enhancement-by-resynthesis framework; speech quality perceptual evaluation; speech spectral envelope enhancement; statistical speech model; vocabulary corpus; Acoustics; Hidden Markov models; Reliability; Speech; Speech enhancement; Speech recognition; Vectors; Hidden Markov models; speech enhancement; statistical synthesis;
fLanguage :
English
Journal_Title :
Signal Processing Letters, IEEE
Publisher :
ieee
ISSN :
1070-9908
Type :
jour
DOI :
10.1109/LSP.2013.2255125
Filename :
6502679
Link To Document :
بازگشت