مرکز منطقه ای اطلاع رساني علوم و فناوري - Improvement of robot audition by interfacing sound source separation and automatic speech recognition with Missing Feature Theory

DocumentCode :

414290

Title :

Improvement of robot audition by interfacing sound source separation and automatic speech recognition with Missing Feature Theory

Author :

Yamamoto, Seiichi ; Nakadai, Kazuhiro ; Tsujino, Hiroshi ; Yokoyama, Toshio ; Okuno, Hiroshi G.

Author_Institution :

Graduate Sch. of Inf., Kyoto Univ., Japan

Volume :

fYear :

2004

fDate :

April 26-May 1, 2004

Firstpage :

1517

Abstract :

We have been developed robot audition system using the active direction-pass filter (ADPF) with the Scattering Theory, and demonstrated that the humanoid SIG could separate and recognize three simultaneous speeches originating from different directions. This is the first result that a robot can listen to several things simultaneously. However, its general applicability to other robots is not yet confirmed. Since automatic speech recognition (ASR) requires direction- and speaker-dependent acoustic models, it is difficult to adapt various kinds of environments. In addition ASR with lots of acoustic models causes slow processing. In this paper, these three problems are resolved. First, we confirmed the generality of the ADPF by applying it to two humanoids, SIG2 and Replie, under different environments. Next, we present the new interface between ADPF and ASR based on the Missing Feature Theory, which masks broken features of separated sound to make them unavailable to ASR. This new interface improved the recognition performance of three simultaneous speeches up to about 90%. Finally, since the ASR uses only a single acoustic model that is direction- and speaker-independent and created under clean environments, the processing of the whole system was made very light and fast.

Keywords :

active filters; filtering theory; robots; source separation; speech processing; speech recognition; Replie humanoid robot; SIG2 humanoid robot; active direction pass filter; automatic speech recognition; direction dependent acoustic models; missing feature theory; robot audition system; sound source separation; speaker dependent acoustic models; Acoustic noise; Automatic speech recognition; Humanoid robots; Humans; Microphones; Mouth; Noise robustness; Robotics and automation; Source separation; Working environment noise;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Robotics and Automation, 2004. Proceedings. ICRA '04. 2004 IEEE International Conference on

ISSN :

1050-4729

Print_ISBN :

0-7803-8232-3

Type :

conf

DOI :

10.1109/ROBOT.2004.1308039

Filename :

1308039

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=414290