Title :
Combining sparse NMF with deep neural network: A new classification-based approach for speech enhancement
Author :
Hung-Wei Tseng ; Mingyi Hong ; Zhi-Quan Luo
Author_Institution :
Electr. & Comput. Eng., Univ. of Minnesota, Minneapolis, MN, USA
Abstract :
In this work, we consider enhancing a target speech from a single-channel noisy observation corrupted by non-stationary noises at low signal-to-noise ratios (SNRs). We take a classification-based approach, where the objective is to estimate an Ideal Binary Mask (IBM) that classifies each time-frequency (T-F) unit of the noisy observation into one of the two categories: speech-dominant unit or noise-dominant unit. The estimated mask is used to binary weight the noisy mixture to obtain the enhanced speech. In the proposed system, the sparse non-negative matrix factorization (NMF) is used to extract features from the noisy observation, followed by a Deep Neural Network (DNN) for classification. Compared with several existing classification-based systems, the proposed system uses minimal speech-specific domain knowledge, but is able to achieve better performance in certain low SNR regions. Moreover, the proposed system outperforms the traditional statistical method, especially in terms of improving the intelligibility.
Keywords :
feature extraction; matrix decomposition; neural nets; speech enhancement; time-frequency analysis; classification-based approach; deep neural network; feature extraction; ideal binary mask; minimal speech-specific domain knowledge; noise-dominant unit; noisy mixture; nonstationary noises; signal-to-noise ratio; single-channel noisy observation; sparse NMF; sparse nonnegative matrix factorization; speech enhancement; speech-dominant unit; time-frequency unit; Dictionaries; Encoding; Noise; Noise measurement; Speech; Speech enhancement; Training; Speech enhancement; deep neural network (DNN); non-negative matrix factorization (NMF); sparse coding;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178350