DocumentCode
1887815
Title
Environmental warping for in-car speech recognition
Author
Weifeng Li ; Itou, Koichi ; Takeda, Kenji ; Itakura, F.
Author_Institution
Nagoya Univ., Japan
fYear
2005
fDate
18-20 May 2005
Firstpage
35
Abstract
Summary form only given. We present an environmental warping approach to reduce the mismatch between the acoustic conditions during training and recognition. The idea of this approach is to map the log mel-filter-bank (MFB) vector obtained from the speech in a test driving condition into the one in the target driving condition, in which the acoustical models are trained. The mapping function is obtained by training multilayer perceptron (MLP) based neural network. In our in-car isolated word recognition experiments under 12 real car environments, the proposed approach obtained an average relative word error rate (WER) reduction of 47.6% and 17.5%, compared to the original speech and conventional speech enhancement methods, respectively.
Keywords
channel bank filters; error statistics; learning (artificial intelligence); multilayer perceptrons; speech enhancement; speech recognition; MLP neural network; acoustic conditions mismatch; acoustical model training; environmental warping; in-car speech recognition; isolated word recognition; log mel-filter-bank vector; multilayer perceptron neural network; speech enhancement; word error rate; Acoustic testing; Error analysis; Multi-layer neural network; Multilayer perceptrons; Neural networks; Speech enhancement; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Nonlinear Signal and Image Processing, 2005. NSIP 2005. Abstracts. IEEE-Eurasip
Conference_Location
Sapporo
Print_ISBN
0-7803-9064-4
Type
conf
DOI
10.1109/NSIP.2005.1502284
Filename
1502284
Link To Document