Title :
Sound source localization using sparse coding and SOM
Author :
Kim, Hong-Shik ; Choi, Jong-Suk
Author_Institution :
Korea Inst. of Sci. & Technol., Seoul, South Korea
Abstract :
Many kinds of sound source localization systems have been developed for detecting a direction of sound source. They are commonly using time delay of arrival (TDOA) or interaural time difference (ITD) algorithm for sound source localization where, especially, the ITD is the difference in arrival time of a sound between two ears. It is largely changed depending on frequency components of sound even though the sound source is located in the same place. In this paper we propose a binaural sound localization system using sparse coding based ITD (S-ITD) and self-organizing map (SOM). The sparse coding is used for decomposing given sounds into three components: time, frequency and magnitude. Moreover we estimate the azimuth angle through the SOM. This localization system is installed in our robot that has two ears, head and body. We use PeopleBot as a body of the robot.
Keywords :
direction-of-arrival estimation; self-organising feature maps; time-of-arrival estimation; PeopleBot; binaural sound localization system; interaural time difference algorithm; self-organizing map; sound source localization systems; sparse coding; time delay of arrival algorithm; Application software; Azimuth; Delay effects; Ear; Frequency; Manufacturing; Mobile robots; Neck; Open source software; Wheels;
Conference_Titel :
Emerging Technologies & Factory Automation, 2009. ETFA 2009. IEEE Conference on
Conference_Location :
Mallorca
Print_ISBN :
978-1-4244-2727-7
Electronic_ISBN :
1946-0759
DOI :
10.1109/ETFA.2009.5347025