DocumentCode :
426222
Title :
Assessment of general applicability of robot audition system by recognizing three simultaneous speeches
Author :
Yamamoto, Shun´ichi ; Nakadai, Kazuhiro ; Tsujino, Hiroshi ; Okuno, Hiroshi G.
Author_Institution :
Graduate Sch. of Inf., Kyoto Univ., Japan
Volume :
3
fYear :
2004
fDate :
28 Sept.-2 Oct. 2004
Firstpage :
2111
Abstract :
Robot audition is a critical technology in creating an intelligent robot operating in daily environments. We have developed such a robot audition system by using a new interface between sound source separation and automatic speech recognition (ASR). A mixture of speeches captured with a pair of microphones installed in the ear positions of a humanoid is separated into each speech by using active direction-pass filter (ADPF). The ADPF extracts a sound source originating from a specific direction in real-time by using interaural phase and intensity differences. The separated speech is recognized by a speech recognizer based on the missing feature theory (MFT). By using a missing feature mask, the MFT based ASR neglects distorted and missing features caused during the speech separation. A missing feature mask for each separated speech is generated in speech separation and is sent to the ASR with the separated speech. Thus, this new integration improves the performance of ASR. However, the generality of this robot audition system has not been assessed so far. In this paper, we assess its general applicability by implementing it on the three humanoids, i.e., ASIMO of Honda, SIG2, and Replie of Kyoto University. By using three simultaneous speeches as benchmarks, the robot audition system improved the performance of ASR over 50% in every humanoid, and thus its general applicability was confirmed.
Keywords :
active filters; humanoid robots; intelligent robots; speech recognition; active direction-pass filter; automatic speech recognition; humanoid robot; intelligent robot; missing feature theory; robot audition system; sound source separation; Active filters; Automatic speech recognition; Ear; Humanoid robots; Intelligent robots; Microphones; Robotics and automation; Source separation; Speech coding; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems, 2004. (IROS 2004). Proceedings. 2004 IEEE/RSJ International Conference on
Print_ISBN :
0-7803-8463-6
Type :
conf
DOI :
10.1109/IROS.2004.1389721
Filename :
1389721
Link To Document :
بازگشت