Title :
Time-Frequency Masking Method Using Wavelet Transform for BSS Problem
Author :
Yashita, Masahiro ; Hamada, Nozomu
Author_Institution :
Sch. of Integrated Design Eng., Keio Univ., Yokohama
Abstract :
In this paper, we propose a novel speech separation method for blind source separation problem using complex wavelet transform. Sound source separation, especially blind source separation (BSS), is necessary for speech-based human-machine interfaces. It is because BSS needs no prior information and estimates source signals only from observed signals with microphones. Time-frequency masking is famous approach for BSS of speech mixtures. It assumes the sparseness property of speech that is called "W-disjoint orthogonality (WDO)". Wavelet transform is known to be useful for analyzing nonstationary signals. We investigate the sound source separation method combining time-frequency masking and complex-valued wavelet transform known as RI-spline wavelet. As a multi-resolution method, we perform the complex multi resolution analysis (CMRA) and subband decomposition (SD). It is shown that the proposed method realizes the high sound source separation ability through simulations
Keywords :
blind source separation; man-machine systems; microphones; signal resolution; speech processing; time-frequency analysis; wavelet transforms; RI-spline wavelet; W-disjoint orthogonality; WDO; blind source separation problem; microphone; multiresolution method; nonstationary signal; sound source separation; speech-based human-machine interface; subband decomposition; time-frequency masking; wavelet transform; Blind source separation; Man machine systems; Microphones; Signal analysis; Signal resolution; Source separation; Speech coding; Time frequency analysis; Wavelet analysis; Wavelet transforms;
Conference_Titel :
TENCON 2006. 2006 IEEE Region 10 Conference
Conference_Location :
Hong Kong
Print_ISBN :
1-4244-0548-3
Electronic_ISBN :
1-4244-0549-1
DOI :
10.1109/TENCON.2006.343856