DocumentCode :
3697405
Title :
Multi-stage rejection sampling (MSRS): A robust SRP-PHAT peak detection algorithm for localization of cocktail-party talkers
Author :
Sarthak Khanal;Harvey F. Silverman
Author_Institution :
LEMS, Brown University Box D, Providence, RI, 02906
fYear :
2015
Firstpage :
1
Lastpage :
5
Abstract :
The Steered Response Power using the Phase Transform weight (SRP-PHAT) has been shown to be robust in noisy and reverberant conditions. Also, volume contraction has been applied effectively to trap the global maximum for densely-hilly 3-D spaces like the SRP. However, previous methods have suffered from the presence of peaks representing multiple talkers in close proximity as is likely in a conversational cocktail-party setting. We present a volume contraction algorithm called Multi-Stage Rejection Sampling (MSRS) for detection of multiple peaks in the SRP-PHAT space. Our method not only circumvents sorting — a computationally expensive step in volume contraction algorithms — but also automatically divides a search volume into sub-volumes for robust detection of multiple peaks. We discuss some modifications to the standard SRP-PHAT functional and present results using all real-room data for baseline white-noise, an eight-speaker teleconferencing setup and a fully unconstrained cocktail-party situation containing about 21 persons in the room.
Keywords :
"Indexes","Robustness","Signal processing algorithms","Microphones","Correlation","Clustering algorithms","Acoustics"
Publisher :
ieee
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics (WASPAA), 2015 IEEE Workshop on
Type :
conf
DOI :
10.1109/WASPAA.2015.7336887
Filename :
7336887
Link To Document :
بازگشت