• DocumentCode
    399274
  • Title

    Applying scattering theory to robot audition system: robust sound source localization and extraction

  • Author

    Nakadai, Kazuhiro ; Matsuura, Daisuke ; Okuno, Hiroshi G. ; Kitano, Hrioaki

  • Author_Institution
    Honda Res. Inst., Saitama, Japan
  • Volume
    2
  • fYear
    2003
  • fDate
    27-31 Oct. 2003
  • Firstpage
    1147
  • Abstract
    Robot audition by its own ears (microphones) is essential for natural human-robot communication and interface. Since a microphone is embedded in the head of a robot, the head-related transfer function (HRTF) plays an important role in sound source localization and extraction. Usually, from binaural input, the interaural phase difference (IPD) and interaural intensity difference (IID) are calculated, and then the direction is determined by using IPD and IID with HRTF. The problem of HRTF-based sound source localization is that a HRTF should be measured for each robot in an anechoic chamber, because it depends on the shape of robot´s head; HRTF should be interpolated to manipulate a moving talker, because it is available only for discrete azimuth and elevation. To cope with these problems of HRTF, we proposed the auditory epipolar geometry as a continuous function of IPD and IID to dispense with HRTF and have developed a real-time multiple-talker tracking system. This auditory epipolar geometry, however, does not give a good approximation to IID of all range and IPD of peripheral areas. In this paper, the scattering theory in physics is employed to take into consideration the diffraction of sounds around robot´s head for better approximation of IID and IPD. The resulting system shows that it is efficient for localization and extraction of sound at higher frequency and from side directions.
  • Keywords
    acoustic signal processing; acoustic wave scattering; array signal processing; direction-of-arrival estimation; man-machine systems; microphones; robots; transfer functions; user interfaces; anechoic chamber; auditory epipolar geometry; binaural input; discrete azimuth; extraction; head-related transfer function; human-robot communication; interaural intensity difference; interaural phase difference; microphone; real-time multiple-talker tracking system; robot audition system; robust sound source localization; scattering theory; Acoustic scattering; Anechoic chambers; Azimuth; Computational geometry; Ear; Microphones; Robots; Robustness; Shape measurement; Transfer functions;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems, 2003. (IROS 2003). Proceedings. 2003 IEEE/RSJ International Conference on
  • Print_ISBN
    0-7803-7860-1
  • Type

    conf

  • DOI
    10.1109/IROS.2003.1248800
  • Filename
    1248800