DocumentCode :
3143574
Title :
Enhanced multidimensional spatial functions for unambiguous localization of multiple sparse acoustic sources
Author :
Nesta, Francesco ; Omologo, Maurizio
Author_Institution :
Center of Inf. Technol., Fondazione Bruno Kessler-Irst, Trento, Italy
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
213
Lastpage :
216
Abstract :
The Steered Response Power with PHAT transform (SRP-PHAT) or Global Coherence Field (GCF), has become a standard method for acoustic source localization, thanks to their simplicity, computational inexpensiveness and robustness against mid-high reverberation. However, originally formulated for the single source localization case, it does not apply satisfactorily to the multiple source case. In this paper, we analyze the structure of the spatial function and reshape it according to a generic multidimensional metric. We show that traditional functions are based on the L1 norm which is prone to generate ambiguous locations with high likelihood (i.e. ghosts). A more generic multidimensional kernel based on higher norms and on a partitioned representation of the cross-power spectrum is introduced, which better exploits the source sparseness in the discrete time-frequency domain. Evaluation results over simulated data show that the new spatial functions considerably improve the detection of multiple competing sources in both spatial and multidimensional TDOA domains.
Keywords :
acoustic radiators; acoustic signal processing; source separation; PHAT transform; acoustic source localization; discrete time-frequency domain; generic multidimensional kernel; global coherence field; mid-high reverberation; multidimensional spatial functions; multiple sparse acoustic sources; steered response power; unambiguous localization; Coherence; Estimation; Kernel; Microphones; Time frequency analysis; Vectors; TDOA estimation; kernel methods; multidimensional signal processing; multiple speaker localization; sparse sources;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6287855
Filename :
6287855
Link To Document :
بازگشت