مرکز منطقه ای اطلاع رساني علوم و فناوري - Sound source separation and automatic speech recognition for moving sources

DocumentCode :

3328249

Title :

Sound source separation and automatic speech recognition for moving sources

Author :

Nakadai, Kazuhiro ; Nakajima, Hirofumi ; Ince, öGkhan ; Hasegawa, Yuji

Author_Institution :

Honda Res. Inst. Japan Co., Ltd., Saitama, Japan

fYear :

2010

fDate :

18-22 Oct. 2010

Firstpage :

976

Lastpage :

981

Abstract :

This paper addresses sound source separation and speech recognition for moving sound sources. Real-world applications such as robots should cope with both moving and stationary sound sources. However, most studies assume only stationary sound sources. We introduce three key techniques to cope with moving sources, that is, Adaptive Step-size control (AS), Optima Controlled Recursive Average (OCRA), and Separation Parameter Switching (SPS). We implemented a real-time robot audition system with these techniques for our humanoid robot with an 8ch microphone array by using HARK which is our open-source software for robot audition. Preliminary results show that the performance of recognition of moving sound sources improved drastically, and also the performance of the system is shown through two speech dialog scenarios which requires sound source separation and automatic speech recognition for moving sources.

Keywords :

adaptive control; blind source separation; hearing; humanoid robots; microphone arrays; optimal control; speech recognition; adaptive step-size control; automatic speech recognition; humanoid robot; microphone array; moving sound sources; open-source software; optimal controlled recursive average; robot audition system; separation parameter switching; sound source separation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Robots and Systems (IROS), 2010 IEEE/RSJ International Conference on

Conference_Location :

Taipei

ISSN :

2153-0858

Print_ISBN :

978-1-4244-6674-0

Type :

conf

DOI :

10.1109/IROS.2010.5651167

Filename :

5651167

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3328249