Title :
Sound source separation and automatic speech recognition for moving sources
Author :
Nakadai, Kazuhiro ; Nakajima, Hirofumi ; Ince, öGkhan ; Hasegawa, Yuji
Author_Institution :
Honda Res. Inst. Japan Co., Ltd., Saitama, Japan
Abstract :
This paper addresses sound source separation and speech recognition for moving sound sources. Real-world applications such as robots should cope with both moving and stationary sound sources. However, most studies assume only stationary sound sources. We introduce three key techniques to cope with moving sources, that is, Adaptive Step-size control (AS), Optima Controlled Recursive Average (OCRA), and Separation Parameter Switching (SPS). We implemented a real-time robot audition system with these techniques for our humanoid robot with an 8ch microphone array by using HARK which is our open-source software for robot audition. Preliminary results show that the performance of recognition of moving sound sources improved drastically, and also the performance of the system is shown through two speech dialog scenarios which requires sound source separation and automatic speech recognition for moving sources.
Keywords :
adaptive control; blind source separation; hearing; humanoid robots; microphone arrays; optimal control; speech recognition; adaptive step-size control; automatic speech recognition; humanoid robot; microphone array; moving sound sources; open-source software; optimal controlled recursive average; robot audition system; separation parameter switching; sound source separation;
Conference_Titel :
Intelligent Robots and Systems (IROS), 2010 IEEE/RSJ International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4244-6674-0
DOI :
10.1109/IROS.2010.5651167