DocumentCode
164820
Title
Improved hands-free automatic speech recognition in reverberant environment condition
Author
Gomez, Raquel ; Nakamura, Kentaro ; Mizumoto, Tetsuya ; Nakadai, Kazuhiro
Author_Institution
Honda Res. Inst. Japan Co., Ltd., Japan
fYear
2014
fDate
12-14 May 2014
Firstpage
67
Lastpage
71
Abstract
In this paper, we incorporate language information in the treatment of reverberation in a microphone array system. First, microphone array processing is used to spatially filter the observed multiple sources. Second, the separated signal is enhanced to remove the effects of reverberation. Then, we extract information concerning the impact of smearing to the language (word sequence) used in the automatic speech recognition (ASR) system. Consequently, we compute the language-inferred coefficients via offline training. During online testing, the language-inferred coefficients are employed in conjunction with an enhancement scheme. The coefficients reflect the realistic impact of smearing in conjunction with the language information. As a result, the enhanced signal is more effective in ASR application. Experimental results in a real environment condition show that the proposed method outperforms the conventional methods that exclude language information. Moreover, we show the recognition results comparing the proposed method against existing dereverberation methods.
Keywords
microphone arrays; natural language processing; reverberation; speech recognition; ASR system; hands free automatic speech recognition; language inferred coefficients; language information; microphone array processing; microphone array system; offline training; online testing; reverberant environment condition; separated signal; word sequence; Arrays; Hidden Markov models; Microphone arrays; Reverberation; Speech; ASR; Dereverberation; Speech Enhancement;
fLanguage
English
Publisher
ieee
Conference_Titel
Hands-free Speech Communication and Microphone Arrays (HSCMA), 2014 4th Joint Workshop on
Conference_Location
Villers-les-Nancy
Type
conf
DOI
10.1109/HSCMA.2014.6843253
Filename
6843253
Link To Document