Title :
Deep stacking networks with time series for speech separation
Author :
Shuai Nie ; Hui Zhang ; Xueliang Zhang ; Wenju Liu
Author_Institution :
Nat. Lab. of Patten Recognition (NLPR), Inst. of Autom., Beijing, China
Abstract :
In many present speech separation approaches, the separation task is formulated as a binary classification problem. Several classification-based approaches have been proposed and performed satisfactorily. However, they do not explicitly model the correlation in time and each time-frequency (T-F) unit is still classified individually. As we know, the speech signal has a very rich time series and temporal dynamic information that can be exploited for speech separation. In this study, we incorporate the correlation in time into classification. Compared with the previous approaches, the proposed approach achieves better separation and generalization performance by using deep stacking networks (DSN) with time series and re-threshold method.
Keywords :
correlation theory; signal classification; speech recognition; time series; time-frequency analysis; DSN; binary classification problem; correlation; deep stacking networks; re-threshold method; speech separation; speech signal; temporal dynamic information; time series; time-frequency unit; Correlation; Feature extraction; Signal to noise ratio; Speech; Time series analysis; Training; Binary Classification; Computational Auditory Scene Analysis (CASA); Deep Stacking Networks; Speech Separation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
DOI :
10.1109/ICASSP.2014.6854890