DocumentCode
269307
Title
Speech Enhancement with EMD and Hurst-Based Mode Selection
Author
Zão, L. ; Coelho, Rui ; Flandrin, Patrick
Author_Institution
Defense Eng., Mil. Inst. of Eng., Rio de Janeiro, Brazil
Volume
22
Issue
5
fYear
2014
fDate
May-14
Firstpage
899
Lastpage
911
Abstract
This paper presents a speech enhancement technique for signals corrupted by nonstationary acoustic noises. The proposed approach applies the empirical mode decomposition (EMD) to the noisy speech signal and obtains a set of intrinsic mode functions (IMF). The main contribution of the proposed procedure is the adoption of the Hurst exponent in the selection of IMFs to reconstruct the speech. This EMD and Hurst-based (EMDH) approach is evaluated in speech enhancement experiments considering environmental acoustic noises with different indices of nonstationarity. The results show that the EMDH improves the segmental signal-to-noise ratio and an overall quality composite measure, encompassing the perceptual evaluation of speech quality (PESQ). Moreover, the short-time objective intelligibility (STOI) measure reinforces the superior performance of EMDH. Finally, the EMDH is also examined in a speaker identification task in noisy conditions. The proposed technique leads to the highest speaker identification rates when compared to the baseline speech enhancement algorithms and also to a multicondition training procedure.
Keywords
speaker recognition; speech enhancement; EMD and Hurst based approach; Hurst based mode selection; Hurst exponent; IMF; PESQ; STOI; empirical mode decomposition; environmental acoustic noises; intrinsic mode functions; noisy speech signal; nonstationary acoustic noises; perceptual evaluation of speech quality; short time objective intelligibility; signal-to-noise ratio; speaker identification; speech enhancement technique; speech reconstruction; IEEE transactions; Noise measurement; Signal to noise ratio; Speech; Speech enhancement; Empirical mode decomposition; hurst exponent; index of nonstationarity; speaker identification; speech enhancement;
fLanguage
English
Journal_Title
Audio, Speech, and Language Processing, IEEE/ACM Transactions on
Publisher
ieee
ISSN
2329-9290
Type
jour
DOI
10.1109/TASLP.2014.2312541
Filename
6775298
Link To Document