مرکز منطقه ای اطلاع رساني علوم و فناوري - Compensating changes in speaker position for improved voice-based human-robot communication

DocumentCode :

3716895

Title :

Compensating changes in speaker position for improved voice-based human-robot communication

Author :

Randy Gomez;Keisuke Nakamura;Takeshi Mizumoto;Kazuhiro Nakadai

Author_Institution :

Honda Research Institute Japan Ltd. Co., Honcho Wako-shi, Japan

fYear :

2015

Firstpage :

977

Lastpage :

982

Abstract :

Acoustic perturbation due to reverberation and the changes in speaker position are detrimental to seamless human-robot speech-based communication. These cause a mismatch between the speech features at runtime condition and the acoustic model (training condition). Then the degradation of the Automatic Speech Recognition (ASR) and the Spoken Language Understanding (SLU) performances is imminent. As a consequence, the robot fails to understand the spoken commands which will negatively impact interaction experience. In this paper, we propose a framework to improving speech-based human-robot communication in various reverberant environments. The framework is based on robust robot audition that addresses the mismatch problem, striking a balance between technology and the limitations in a real robot setting. Our method improves both ASR and SLU performances. Moreover, the proposed framework has the ability to evolve in minimizing mismatch without human supervision. We experiment with data collected in real environment conditions.

Keywords :

"Robots","Speech","Reverberation","Adaptation models","Robustness","Microphones"

Publisher :

ieee

Conference_Titel :

Humanoid Robots (Humanoids), 2015 IEEE-RAS 15th International Conference on

Type :

conf

DOI :

10.1109/HUMANOIDS.2015.7363488

Filename :

7363488

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3716895