• DocumentCode
    663921
  • Title

    Nested iGMM recognition and multiple hypothesis tracking of moving sound sources for mobile robot audition

  • Author

    Sasaki, Yutaka ; Hatao, Naotaka ; Yoshii, Kazutomo ; Kagami, Satoshi

  • Author_Institution
    Digital Human Res. Center, Nat. Inst. of Adv. Ind. Sci. & Technol., Tokyo, Japan
  • fYear
    2013
  • fDate
    3-7 Nov. 2013
  • Firstpage
    3930
  • Lastpage
    3936
  • Abstract
    The paper proposes two modules for a mobile robot audition system: 1) recognizing surrounding acoustic event, 2) tracking moving sound sources. We propose nested infinite Gaussian mixture model (iGMM) for recognizing frame based feature vectors. The main advantage is that the number of classes is allowed to increase without bound, if necessary, to represent unknown audio input. The multiple hypothesis tracking module provides time-series of separated audio stream using localized directions and recognition results at each frame. Not only for continuous sounds, the proposed tracker automatically detects appearing and disappearing point of stream from multiple hypothesis. These two modules are connected to microphone array based sound localization and separation, and the combined robot audition system achieved tracking of multiple moving sounds including intermittent sound source.
  • Keywords
    Gaussian processes; audio signal processing; mobile robots; source separation; appearing point-of-stream; disappearing point-of-stream; frame based feature vectors recognition; infinite Gaussian mixture model; intermittent sound source; localized directions; microphone array based sound localization; microphone array based sound separation; mobile robot audition system; moving sound sources; multiple hypothesis tracking; nested iGMM recognition; surrounding acoustic event recognition; Arrays; Mathematical model; Microphones; Mobile robots; Noise; Vectors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Robots and Systems (IROS), 2013 IEEE/RSJ International Conference on
  • Conference_Location
    Tokyo
  • ISSN
    2153-0858
  • Type

    conf

  • DOI
    10.1109/IROS.2013.6696918
  • Filename
    6696918