DocumentCode :
1887815
Title :
Environmental warping for in-car speech recognition
Author :
Weifeng Li ; Itou, Koichi ; Takeda, Kenji ; Itakura, F.
Author_Institution :
Nagoya Univ., Japan
fYear :
2005
fDate :
18-20 May 2005
Firstpage :
35
Abstract :
Summary form only given. We present an environmental warping approach to reduce the mismatch between the acoustic conditions during training and recognition. The idea of this approach is to map the log mel-filter-bank (MFB) vector obtained from the speech in a test driving condition into the one in the target driving condition, in which the acoustical models are trained. The mapping function is obtained by training multilayer perceptron (MLP) based neural network. In our in-car isolated word recognition experiments under 12 real car environments, the proposed approach obtained an average relative word error rate (WER) reduction of 47.6% and 17.5%, compared to the original speech and conventional speech enhancement methods, respectively.
Keywords :
channel bank filters; error statistics; learning (artificial intelligence); multilayer perceptrons; speech enhancement; speech recognition; MLP neural network; acoustic conditions mismatch; acoustical model training; environmental warping; in-car speech recognition; isolated word recognition; log mel-filter-bank vector; multilayer perceptron neural network; speech enhancement; word error rate; Acoustic testing; Error analysis; Multi-layer neural network; Multilayer perceptrons; Neural networks; Speech enhancement; Speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Nonlinear Signal and Image Processing, 2005. NSIP 2005. Abstracts. IEEE-Eurasip
Conference_Location :
Sapporo
Print_ISBN :
0-7803-9064-4
Type :
conf
DOI :
10.1109/NSIP.2005.1502284
Filename :
1502284
Link To Document :
بازگشت