• DocumentCode
    1084167
  • Title

    Robust Recognition of Simultaneous Speech by a Mobile Robot

  • Author

    Valin, Jean-Marc ; Yamamoto, Shunichi ; Rouat, Jean ; Michaud, François ; Nakadai, Kazuhiro ; Okuno, Hiroshi G.

  • Author_Institution
    Commonwealth Sci. & Ind. Res. Organ. Inf. & Commun. Technol. (CSIROICT) Centre, Sydney
  • Volume
    23
  • Issue
    4
  • fYear
    2007
  • Firstpage
    742
  • Lastpage
    752
  • Abstract
    This paper describes a system that gives a mobile robot the ability to perform automatic speech recognition with simultaneous speakers. A microphone array is used along with a real-time implementation of geometric source separation (GSS) and a postfilter that gives a further reduction of interference from other sources. The postfllter is also used to estimate the reliability of spectral features and compute a missing feature mask. The mask is used in a missing feature theory-based speech recognition system to recognize the speech from simultaneous Japanese speakers in the context of a humanoid robot. Recognition rates are presented for three simultaneous speakers located at 2 m from the robot. The system was evaluated on a 200-word vocabulary at different azimuths between sources, ranging from 10deg to 90deg. Compared to the use of the microphone array source separation alone, we demonstrate an average reduction in relative recognition error rate of 24% with the postfllter and of 42% when the missing features approach is combined with the postfllter. We demonstrate the effectiveness of our multisource microphone array postfilter and the improvement it provides when used in conjunction with the missing features theory.
  • Keywords
    geometry; humanoid robots; interference suppression; microphone arrays; mobile robots; natural language processing; speech recognition; automatic speech recognition; geometric source separation; humanoid robot; interference reduction; microphone array; missing feature theory; mobile robot; postfilter; simultaneous Japanese speakers; simultaneous speech recognition; Automatic speech recognition; Humanoid robots; Interference; Microphone arrays; Mobile robots; Robustness; Source separation; Speech coding; Speech recognition; Vocabulary; Cocktail party; geometric source separation (GSS); microphone array; missing feature theory; robot audition; speech recognition;
  • fLanguage
    English
  • Journal_Title
    Robotics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1552-3098
  • Type

    jour

  • DOI
    10.1109/TRO.2007.900612
  • Filename
    4285864