DocumentCode :
3317761
Title :
An improvement in automatic speech recognition using soft missing feature masks for robot audition
Author :
Takahashi, Toru ; Nakadai, Kazuhiro ; Komatani, Kazunori ; Ogata, Tetsuya ; Okuno, Hiroshi G.
Author_Institution :
Dept. of Intell. & Sci. & Technol., Kyoto Univ., Kyoto, Japan
fYear :
2010
fDate :
18-22 Oct. 2010
Firstpage :
964
Lastpage :
969
Abstract :
We describe integration of preprocessing and automatic speech recognition based on Missing-Feature-Theory (MFT) to recognize a highly interfered speech signal, such as the signal in a narrow angle between a desired and interfered speakers. As a speech signal separated from a mixture of speech signals includes the leakage from other speech signals, recognition performance of the separated speech degrades. An important problem is estimating the leakage in time-frequency components. Once the leakage is estimated, we can generate missing feature masks (MFM) automatically by using our method. A new weighted sigmoid function is introduced for our MFM generation method. An experiment shows that a word correct rate improves from 66 % to 74 % by using our MFM generation method tuned by a search base approach in the parameter space.
Keywords :
hearing; human-robot interaction; humanoid robots; source separation; speech recognition; MFM; missing feature theory; robot audition; speech recognition; speech signal separation; time-frequency analysis; weighted sigmoid function;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Robots and Systems (IROS), 2010 IEEE/RSJ International Conference on
Conference_Location :
Taipei
ISSN :
2153-0858
Print_ISBN :
978-1-4244-6674-0
Type :
conf
DOI :
10.1109/IROS.2010.5650540
Filename :
5650540
Link To Document :
بازگشت