DocumentCode :
269307
Title :
Speech Enhancement with EMD and Hurst-Based Mode Selection
Author :
Zão, L. ; Coelho, Rui ; Flandrin, Patrick
Author_Institution :
Defense Eng., Mil. Inst. of Eng., Rio de Janeiro, Brazil
Volume :
22
Issue :
5
fYear :
2014
fDate :
May-14
Firstpage :
899
Lastpage :
911
Abstract :
This paper presents a speech enhancement technique for signals corrupted by nonstationary acoustic noises. The proposed approach applies the empirical mode decomposition (EMD) to the noisy speech signal and obtains a set of intrinsic mode functions (IMF). The main contribution of the proposed procedure is the adoption of the Hurst exponent in the selection of IMFs to reconstruct the speech. This EMD and Hurst-based (EMDH) approach is evaluated in speech enhancement experiments considering environmental acoustic noises with different indices of nonstationarity. The results show that the EMDH improves the segmental signal-to-noise ratio and an overall quality composite measure, encompassing the perceptual evaluation of speech quality (PESQ). Moreover, the short-time objective intelligibility (STOI) measure reinforces the superior performance of EMDH. Finally, the EMDH is also examined in a speaker identification task in noisy conditions. The proposed technique leads to the highest speaker identification rates when compared to the baseline speech enhancement algorithms and also to a multicondition training procedure.
Keywords :
speaker recognition; speech enhancement; EMD and Hurst based approach; Hurst based mode selection; Hurst exponent; IMF; PESQ; STOI; empirical mode decomposition; environmental acoustic noises; intrinsic mode functions; noisy speech signal; nonstationary acoustic noises; perceptual evaluation of speech quality; short time objective intelligibility; signal-to-noise ratio; speaker identification; speech enhancement technique; speech reconstruction; IEEE transactions; Noise measurement; Signal to noise ratio; Speech; Speech enhancement; Empirical mode decomposition; hurst exponent; index of nonstationarity; speaker identification; speech enhancement;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher :
ieee
ISSN :
2329-9290
Type :
jour
DOI :
10.1109/TASLP.2014.2312541
Filename :
6775298
Link To Document :
بازگشت