Title :
Relative transfer function modeling for supervised source localization
Author :
Laufer, Bracha ; Talmon, Ronen ; Gannot, Sharon
Author_Institution :
Fac. of Eng., Bar-Ilan Univ., Ramat-Gan, Israel
Abstract :
Speaker localization is one of the most prevalent problems in speech processing. Despite significant efforts in the last decades, high reverberation level still limits the performance of localization algorithms. Furthermore, using conventional localization methods, the information that can be extracted from dual microphone measurements is restricted to the time difference of arrival (TDOA). Under far-field regime, this is equivalent to either azimuth or elevation angles estimation. Full description of speaker´s coordinates necessitates several microphones. In this contribution we tackle these two limitations by taking a manifold learning perspective for system identification. We present a training-based algorithm, motivated by the concept of diffusion maps, that aims at recovering the fundamental controlling parameters driving the measurements. This approach turns out to be more robust to reverberation, and capable of recovering the speech source location using merely two microphones signals.
Keywords :
learning (artificial intelligence); microphones; speech processing; transfer functions; TDOA; conventional localization methods; dual microphone measurements; manifold learning perspective; microphones signals; relative transfer function modeling; speaker localization; speech processing; speech source location; supervised source localization; time difference of arrival; Acoustics; Kernel; Microphones; Signal processing algorithms; Speech; Training; Vectors; acoustic source localization; diffusion kernel; manifold learning; relative transfer function;
Conference_Titel :
Applications of Signal Processing to Audio and Acoustics (WASPAA), 2013 IEEE Workshop on
Conference_Location :
New Paltz, NY
DOI :
10.1109/WASPAA.2013.6701829