DocumentCode
2063493
Title
An algorithm combined with spectral subtraction and binary masking for monaural speech segregation
Author
Jiang, Yi ; Zhou, Hong
Author_Institution
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
fYear
2011
fDate
14-16 Sept. 2011
Firstpage
1
Lastpage
4
Abstract
Monaural speech segregation from complex concurrent noise is an extremely challenging problem; binary mask is a method to solve this problem, however, the performance of binary mask is limited by remaining the noise in the result. In this paper, an algorithm integrated Spectral Subtraction and binary masking for speech separation and enhancement was proposed. It follows the framework of computational auditory scene analysis (CASA). The energy of time-frequency (T-F) unit was used as the clue to generate the binary mask; then the spectral subtraction algorithm was used to eliminate noise energy in original speech and an interim speech was obtained, after covered the binary mask on the interim speech, the target speech can be achieved. Systematic evaluation shows that the combined algorithm can stably improve the SNR and voice quality for noisy speech. It performs better than existing binary masking systems in most situations, especially when the noise and the speech have the similar power spectrum.
Keywords
noise; source separation; speech intelligibility; speech processing; binary masking; complex concurrent noise; computational auditory scene analysis; energy of time frequency unit; interim speech; monaural speech segregation; noisy speech; spectral subtraction; voice quality; Algorithm design and analysis; Image analysis; Signal to noise ratio; Speech; Speech processing; Time frequency analysis; Computational Auditory Scene Analysis (CASA); Spectral Subtraction; binary masking; speech segregation; time-frequency units;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing, Communications and Computing (ICSPCC), 2011 IEEE International Conference on
Conference_Location
Xi´an
Print_ISBN
978-1-4577-0893-0
Type
conf
DOI
10.1109/ICSPCC.2011.6061563
Filename
6061563
Link To Document