Title :
Barge-in-able robot audition based on ICA and missing feature theory under semi-blind situation
Author :
Takeda, Ryu ; Nakadai, Kazuhiro ; Komatani, Kazunori ; Ogata, Tetsuya ; Okuno, Hiroshi G.
Author_Institution :
Dept. of Intell. Sci., Kyoto Univ., Kyoto
Abstract :
This paper describes a robot audition system that allows the user to barge-in; that is, the user can speak simultaneously when the robot is speaking. Our ldquobarge-in-ablerdquo system consists of two stages: (1) cancellation of robot speech and (2) recognition of the separated user speech under the ldquosemi-blind situationrdquo. The semi-blind situation is where a robotpsilas speech signal is known but a userpsilas speech signal is not. The first stage is achieved by using an adaptive filter based on time-frequency domain Independent Component Analysis, because that can separate robot speech more robustly against noise than conventional echo cancellers. To improve performance in online processing, we utilized known source normalization and the exponentially weighted stepsize method. The second stage is achieved by automatic speech recognition (ASR) based on the missing feature theory which provides robust recognition by exploiting the reliability of speech features distorted due to noise and/or separation. The semi-blind situation simplifies the estimation of such reliabilities. Experiments demonstrated that our system improved word correctness of ASR by 10.0%.
Keywords :
adaptive filters; hearing; independent component analysis; robots; speech recognition; adaptive filter; automatic speech recognition; barge-in-able system; independent component analysis; missing feature theory; robot audition; robot speech cancellation; semiblind situation; time-frequency domain; Frequency domain analysis; Optical wavelength conversion; Reliability; Robots; Speech; Speech recognition; Time frequency analysis;
Conference_Titel :
Intelligent Robots and Systems, 2008. IROS 2008. IEEE/RSJ International Conference on
Conference_Location :
Nice
Print_ISBN :
978-1-4244-2057-5
DOI :
10.1109/IROS.2008.4650799