DocumentCode :
1694888
Title :
Robust Speech Recognition System for Communication Robots in Real Environments
Author :
Ishi, Carlos Toshinori ; Matsuda, Shigeki ; Kanda, Takayuki ; Jitsuhiro, Takatoshi ; Ishiguro, Hiroshi ; Nakamura, Satoshi ; Hagita, Norihiro
Author_Institution :
Intelligent Robotics & Commun. Labs., ATR, Kyoto
fYear :
2006
Firstpage :
340
Lastpage :
345
Abstract :
The application range of communication robots could be widely expanded by the use of an automatic speech recognition (ASR) system with improved robustness for noise and for speakers of different ages. In this paper, we describe an ASR system which can robustly recognize speech by adults and children in noisy environments. We evaluate the ASR system in a communication robot placed in a real noisy environment. Speech is captured using a twelve-element microphone array arranged in the robot chest. To suppress interference and noise and to attenuate reverberation, we implemented a multi-channel system consisting of an outlier-robust generalized sidelobe canceller (RGSC) technique and a feature-space noise suppression using MMSE criteria. Speech activity periods are detected using GMM-based end-point detection (GMM-EPD). Our ASR system has two decoders for adults´ and children´s speech. The final hypothesis is selected based on posterior probability. We then assign a generalized word posterior probability (GWPP)-based confidence measure to this hypothesis, and if it is higher than a threshold, we transfer it to a subsequent dialog processing module. The performance of each step was evaluated for adults´ and children´s speech, by adding different levels of real environment noise recorded in a cafeteria. Experimental results indicated that our ASR system could achieve over 80 % word accuracy in 70 dBA noise. Further evaluation of adult speech recorded in a real noisy environment resulted in 73 % word accuracy
Keywords :
Gaussian processes; feature extraction; least mean squares methods; microphone arrays; probability; reverberation; robots; signal denoising; speech recognition; Gaussian mixture model; MMSE criteria; acoustic noise; communication robots; dialog processing; end-point detection; feature-space noise suppression; generalized word posterior probability; interference suppression; microphone array; multichannel system; noisy environment; outlier-robust generalized sidelobe canceller; reverberation attenuation; robust speech recognition system; speech decoding; Automatic speech recognition; Interference suppression; Microphone arrays; Noise cancellation; Noise robustness; Robotics and automation; Robots; Speech analysis; Speech recognition; Working environment noise; Communication robots; acoustic noise; children speech; robustness; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Humanoid Robots, 2006 6th IEEE-RAS International Conference on
Conference_Location :
Genova
Print_ISBN :
1-4244-0200-X
Electronic_ISBN :
1-4244-0200-X
Type :
conf
DOI :
10.1109/ICHR.2006.321294
Filename :
4115624
Link To Document :
بازگشت