DocumentCode :
147644
Title :
Computational strategy for accelerating robust sound source detection in dynamic scenes
Author :
Donohue, Kevin D. ; Griffioen, Paul M.
Author_Institution :
Electr. & Comput. Eng. Dept., Univ. of Kentucky, Lexington, KY, USA
fYear :
2014
fDate :
13-16 March 2014
Firstpage :
1
Lastpage :
8
Abstract :
Efficient sound source detection and location with microphone arrays is important for many applications, including teleconferencing, surveillance, and smart rooms. While the steered response power algorithms exhibit robust performance relative to other approaches, their applications are limited by the high computational load required. For dynamic auditory scenes, the entire space must be scanned at regular intervals due to moving sound sources switching between active and inactive states. This paper introduces a time segmentation and parallelization strategy to speed up the steered response power algorithm for dynamic auditory scenes with multiple speech sources. The primary application targeted by this work is for immersive arrays and off-line auditory scene analysis with beamforming for speaker separation in cocktail party environments. Results from a Monte Carlo simulation with 6 speech sources in a mildly reverberant environment demonstrate a speed-up factor of 45, with a modest loss in the number of detections and a significant reduction in anomalous detections. Experimental results with real recordings demonstrate a performance consistent with those of the simulation.
Keywords :
Monte Carlo methods; acoustic signal detection; microphone arrays; speech processing; Monte Carlo simulation; cocktail party environments; computational strategy; dynamic auditory scenes; dynamic scenes; immersive array analysis; microphone arrays; off-line auditory scene analysis; parallelization strategy; smart rooms; sound source detection; sound source location; speaker separation; steered response power algorithms; surveillance; teleconferencing; time segmentation; Acoustic arrays; Acoustics; Estimation; MATLAB; Teleconferencing; Transforms; Vectors; MATLAB; Steered Response Power; cocktail party; microphone arrays; parallel processing; sound source detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
SOUTHEASTCON 2014, IEEE
Conference_Location :
Lexington, KY
Type :
conf
DOI :
10.1109/SECON.2014.6950750
Filename :
6950750
Link To Document :
بازگشت