مرکز منطقه ای اطلاع رساني علوم و فناوري - An improvement in automatic speech recognition using soft missing feature masks for robot audition

DocumentCode :

3317761

Title :

An improvement in automatic speech recognition using soft missing feature masks for robot audition

Author :

Takahashi, Toru ; Nakadai, Kazuhiro ; Komatani, Kazunori ; Ogata, Tetsuya ; Okuno, Hiroshi G.

Author_Institution :

Dept. of Intell. & Sci. & Technol., Kyoto Univ., Kyoto, Japan

fYear :

2010

fDate :

18-22 Oct. 2010

Firstpage :

964

Lastpage :

969

Abstract :

We describe integration of preprocessing and automatic speech recognition based on Missing-Feature-Theory (MFT) to recognize a highly interfered speech signal, such as the signal in a narrow angle between a desired and interfered speakers. As a speech signal separated from a mixture of speech signals includes the leakage from other speech signals, recognition performance of the separated speech degrades. An important problem is estimating the leakage in time-frequency components. Once the leakage is estimated, we can generate missing feature masks (MFM) automatically by using our method. A new weighted sigmoid function is introduced for our MFM generation method. An experiment shows that a word correct rate improves from 66 % to 74 % by using our MFM generation method tuned by a search base approach in the parameter space.

Keywords :

hearing; human-robot interaction; humanoid robots; source separation; speech recognition; MFM; missing feature theory; robot audition; speech recognition; speech signal separation; time-frequency analysis; weighted sigmoid function;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Intelligent Robots and Systems (IROS), 2010 IEEE/RSJ International Conference on

Conference_Location :

Taipei

ISSN :

2153-0858

Print_ISBN :

978-1-4244-6674-0

Type :

conf

DOI :

10.1109/IROS.2010.5650540

Filename :

5650540

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3317761