DocumentCode :
3644550
Title :
Source localization and separation using Random Sample Consensus with phase cues
Author :
Łukasz Litwic;Philip JB Jackson
Author_Institution :
Centre for Vision, Speech and Signal Processing, University of Surrey, UK
fYear :
2011
Firstpage :
337
Lastpage :
340
Abstract :
In this paper we present a system for localization and separation of multiple speech sources using phase cues. The novelty of this method is the use of Random Sample Consensus (RANSAC) approach to find consistency of interaural phase differences (IPDs) across the whole frequency range. This approach is inherently free from phase ambiguity problems and enables all phase data to contribute to localization. Another property of RANSAC is its robustness against outliers which enables multiple source localization with phase data contaminated by reverberation noise. Results of RANSAC based localization are fed into a mixture model to generate time-frequency binary masks for separation. System performance is compared against other well known methods and shows similar or improved performance in reverberant conditions.
Keywords :
"Time frequency analysis","Data models","Speech","Histograms","Signal processing algorithms","Delay effects","Estimation"
Publisher :
ieee
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics (WASPAA), 2011 IEEE Workshop on
ISSN :
1931-1168
Print_ISBN :
978-1-4577-0692-9
Type :
conf
DOI :
10.1109/ASPAA.2011.6082334
Filename :
6082334
Link To Document :
بازگشت