DocumentCode :
3510777
Title :
Multimodal blind source separation for moving sources
Author :
Naqvi, S.M. ; Zhang, Y. ; Chambers, J.A.
Author_Institution :
Dept. of Electron. & Electr. Eng., Loughborough Univ., Loughborough
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
125
Lastpage :
128
Abstract :
A novel multimodal approach is proposed to solve the problem of blind source separation (BSS) of moving sources. The challenge of BSS for moving sources is that the mixing filters are time varying, thus the unmixing filters should also be time varying, which are difficult to track in real time. In the proposed approach, the visual modality is utilized to facilitate the separation for both stationary and moving sources. The movement of the sources is detected by a 3-D tracker based on particle filtering. The full BSS solution is formed by integrating a frequency domain blind source separation algorithm and beamforming: if the sources are identified as stationary, a frequency domain BSS algorithm is implemented with an initialization derived from the visual information. Once the sources are moving, a beamforming algorithm is used to perform real time speech enhancement and provide separation of the sources. Experimental results show that by utilizing the visual modality, the proposed algorithm can not only improve the performance of the BSS algorithm and mitigate the permutation problem for stationary sources, but also provide a good BSS performance for moving sources in a low reverberant environment.
Keywords :
blind source separation; particle filtering (numerical methods); speech enhancement; 3D tracker; beamforming algorithm; frequency domain blind source separation algorithm; mixing filter; moving sources; multimodal blind source separation; particle filtering; permutation problem; real time speech enhancement; visual modality; Array signal processing; Blind source separation; Cameras; Data mining; Filtering; Filters; Frequency domain analysis; Particle tracking; Signal processing algorithms; Source separation; 3-D tracking; BSS; FastICA; beamforming; multimodal signal processing; particle filtering;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4959536
Filename :
4959536
Link To Document :
بازگشت