Title :
Multimodal blind source separation with a circular microphone array and robust beamforming
Author :
Naqvi, Syed Mohsen ; Khan, Muhammad Salman ; Qingju Liu ; Wenwu Wang ; Chambers, Jonathon A.
Author_Institution :
Dept. of Electron. & Electr. Eng., Loughborough Univ., Loughborough, UK
fDate :
Aug. 29 2011-Sept. 2 2011
Abstract :
A novel multimodal (audio-visual) approach to the problem of blind source separation (BSS) is evaluated in room environments. The main challenges of BSS in realistic environments are: sources are moving in complex motions and the room impulse responses are long. For moving sources the unmixing filters to separate the audio signals are difficult to calculate from only statistical information available from a limited number of audio samples. For physically stationary sources measured in rooms with long impulse responses, the performance of audio only BSS methods is limited. Therefore, visual modality is utilized to facilitate the separation. The movement of the sources is detected with a 3-D tracker based on a Markov Chain Monte Carlo particle filter (MCMC-PF), and the direction of arrival information of the sources to the microphone array is estimated. A robust least squares frequency invariant data independent (RLSFIDI) beamformer is implemented to perform real time speech enhancement. The uncertainties in source localization and direction of arrival information are also controlled by using a convex optimization approach in the beamformer design. A 16 element circular array configuration is used. Simulation studies based on objective and subjective measures confirm the advantage of beamforming based processing over conventional BSS methods.
Keywords :
Markov processes; Monte Carlo methods; array signal processing; blind source separation; convex programming; direction-of-arrival estimation; filtering theory; least squares approximations; microphone arrays; statistical analysis; transient response; 16 element circular array configuration; 3D tracker; BSS; MCMC-PF; Markov chain Monte Carlo particle filter; RLSFIDI; audio signal separation; audio-visual approach; circular microphone array; convex optimization approach; direction of arrival information; multimodal blind source separation; robust beamforming; robust least squares frequency invariant data independent; room impulse response; source localization uncertainty; speech enhancement; statistical information; unmixing filter; Abstracts; Array signal processing; Arrays; Robustness;
Conference_Titel :
Signal Processing Conference, 2011 19th European
Conference_Location :
Barcelona