DocumentCode :
3716895
Title :
Compensating changes in speaker position for improved voice-based human-robot communication
Author :
Randy Gomez;Keisuke Nakamura;Takeshi Mizumoto;Kazuhiro Nakadai
Author_Institution :
Honda Research Institute Japan Ltd. Co., Honcho Wako-shi, Japan
fYear :
2015
Firstpage :
977
Lastpage :
982
Abstract :
Acoustic perturbation due to reverberation and the changes in speaker position are detrimental to seamless human-robot speech-based communication. These cause a mismatch between the speech features at runtime condition and the acoustic model (training condition). Then the degradation of the Automatic Speech Recognition (ASR) and the Spoken Language Understanding (SLU) performances is imminent. As a consequence, the robot fails to understand the spoken commands which will negatively impact interaction experience. In this paper, we propose a framework to improving speech-based human-robot communication in various reverberant environments. The framework is based on robust robot audition that addresses the mismatch problem, striking a balance between technology and the limitations in a real robot setting. Our method improves both ASR and SLU performances. Moreover, the proposed framework has the ability to evolve in minimizing mismatch without human supervision. We experiment with data collected in real environment conditions.
Keywords :
"Robots","Speech","Reverberation","Adaptation models","Robustness","Microphones"
Publisher :
ieee
Conference_Titel :
Humanoid Robots (Humanoids), 2015 IEEE-RAS 15th International Conference on
Type :
conf
DOI :
10.1109/HUMANOIDS.2015.7363488
Filename :
7363488
Link To Document :
بازگشت