• DocumentCode
    2007122
  • Title

    Sound-event recognition with a companion humanoid

  • Author

    Janvier, Maxime ; Alameda-Pineda, Xavier ; Girinz, Laurent ; Horaud, Radu

  • Author_Institution
    INRIA Grenoble Rhone-Alpes, Grenoble, France
  • fYear
    2012
  • fDate
    Nov. 29 2012-Dec. 1 2012
  • Firstpage
    104
  • Lastpage
    111
  • Abstract
    In this paper we address the problem of recognizing everyday sound events in indoor environments with a consumer robot. Sounds are represented in the spectro-temporal domain using the stabilized auditory image (SAI) representation. The SAI is well suited for representing pulse-resonance sounds and has the interesting property of mapping a time-varying signal into a fixed-dimension feature vector space. This allows us to map the sound recognition problem into a supervised classification problem and to adopt a variety of classifications schemes. We present a complete system that takes as input a continuous signal, splits it into significant isolated sounds and noise, and classifies the isolated sounds using a catalogue of learned sound-event classes. The method is validated with a large set of audio data recorded with a humanoid robot in a house. Extended experiments show that the proposed method achieves state-of-the-art recognition scores with a twelve-class problem, while requiring extremely limited memory space and moderate computing power. A first real-time embedded implementation in a consumer robot show its ability to work in real conditions.
  • Keywords
    audio signal processing; humanoid robots; image representation; robot vision; signal classification; vectors; SAI representation; audio data recording; catalogue; companion humanoid; consumer robot; fixed-dimension feature vector space; humanoid robot; indoor environments; learned sound-event classes; pulse-resonance sound representation; sound-event recognition; spectro-temporal domain; stabilized auditory image representation; supervised classification problem; time-varying signal mapping; twelve-class problem; Humanoid robots; Indoor environments; Microphones; Prototypes; Training; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Humanoid Robots (Humanoids), 2012 12th IEEE-RAS International Conference on
  • Conference_Location
    Osaka
  • ISSN
    2164-0572
  • Type

    conf

  • DOI
    10.1109/HUMANOIDS.2012.6651506
  • Filename
    6651506