• DocumentCode
    2559950
  • Title

    Real-time acoustic source localization in noisy environments for human-robot multimodal interaction

  • Author

    Trifa, V.M. ; Koene, Ansgar ; Morén, Jan ; Cheng, Gordon

  • Author_Institution
    ETH Zurich, Zurich
  • fYear
    2007
  • fDate
    26-29 Aug. 2007
  • Firstpage
    393
  • Lastpage
    398
  • Abstract
    Interaction between humans involves a plethora of sensory information, both in the form of explicit communication as well as more subtle unconsciously perceived signals. In order to enable natural human-robot interaction, robots will have to acquire the skills to detect and meaningfully integrate information from multiple modalities. In this article, we focus on sound localization in the context of a multi-sensory humanoid robot that combines audio and video information to yield natural and intuitive responses to human behavior, such as directed eye-head movements towards natural stimuli. We highlight four common sound source localization algorithms and compare their performance and advantages for real-time interaction. We also briefly introduce an integrated distributed control framework called DVC, where additional modalities such as speech recognition, visual tracking, or object recognition can easily be integrated. We further describe the way the sound localization module has been integrated in our humanoid robot, CB.
  • Keywords
    acoustic signal processing; distributed control; human computer interaction; humanoid robots; man-machine systems; noise; object recognition; speech recognition; tracking; CB humanoid robot; DVC integrated distributed control framework; eye-head movements; human behavior; human-robot multimodal interaction; multisensory humanoid robot; noisy environments; object recognition; real-time acoustic source localization; speech recognition; visual tracking; Acoustic noise; Acoustic signal detection; Context; Distributed control; Human robot interaction; Humanoid robots; Object recognition; Robot sensing systems; Speech recognition; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Robot and Human interactive Communication, 2007. RO-MAN 2007. The 16th IEEE International Symposium on
  • Conference_Location
    Jeju
  • Print_ISBN
    978-1-4244-1634-9
  • Electronic_ISBN
    978-1-4244-1635-6
  • Type

    conf

  • DOI
    10.1109/ROMAN.2007.4415116
  • Filename
    4415116