• DocumentCode
    565609
  • Title

    Multi-party human-robot interaction with distant-talking speech recognition

  • Author

    Gomez, Randy ; Nakamura, Keisuke ; Kawahara, Tatsuya ; Nakadai, Kazuhiro

  • Author_Institution
    Acad. Center for Comput. & Media Studies, Kyoto Univ., Kyoto, Japan
  • fYear
    2012
  • fDate
    5-8 March 2012
  • Firstpage
    439
  • Lastpage
    446
  • Abstract
    Speech is one of the most natural medium for human communication, which makes it vital to human-robot interaction. In real environments where robots are deployed, distant-talking speech recognition is difficult to realize due to the effects of reverberation. This leads to the degradation of speech recognition and understanding, and hinders a seamless human-robot interaction. To minimize this problem, traditional speech enhancement techniques optimized for human perception are adopted to achieve robustness in humanrobot interaction. However, human and machine perceive speech differently: An improvement in speech recognition performance may not automatically translate to an improvement in human-robot interaction experience (as perceived by the users). In this paper, we propose a method in optimizing speech enhancement techniques specifically to improve automatic speech recognition (ASR) with emphasis on the human-robot interaction experience. Experimental results using real reverberant data in a multi-party conversation, show that the proposed method improved human-robot interaction experience in severe reverberant conditions compared to the traditional techniques.
  • Keywords
    control engineering computing; human-robot interaction; speech enhancement; speech recognition; ASR; automatic speech recognition; distant-talking speech recognition; human communication; human perception; human-robot interaction experience; multiparty conversation; multiparty human-robot interaction; natural medium; real reverberant data; speech enhancement techniques; Humans; Robots; Robustness; Speech; Speech enhancement; Speech recognition; Dereverberation; Multi-party Interaction; Robot Audition; Robustness in Speech Recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Human-Robot Interaction (HRI), 2012 7th ACM/IEEE International Conference on
  • Conference_Location
    Boston, MA
  • ISSN
    2167-2121
  • Print_ISBN
    978-1-4503-1063-5
  • Electronic_ISBN
    2167-2121
  • Type

    conf

  • Filename
    6249603