DocumentCode :
179575
Title :
Using neural network front-ends on far field multiple microphones based speech recognition
Author :
Yulan Liu ; Pengyuan Zhang ; Hain, Thomas
Author_Institution :
Speech & Hearing Res. Group, Univ. of Sheffield, Sheffield, UK
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
5542
Lastpage :
5546
Abstract :
This paper presents an investigation of far field speech recognition using beamforming and channel concatenation in the context of Deep Neural Network (DNN) based feature extraction. While speech enhancement with beamforming is attractive, the algorithms are typically signal-based with no information about the special properties of speech. A simple alternative to beamforming is concatenating multiple channel features. Results presented in this paper indicate that channel concatenation gives similar or better results. On average the DNN front-end yields a 25% relative reduction in Word Error Rate (WER). Further experiments aim at including relevant information in training adapted DNN features. Augmenting the standard DNN input with the bottleneck feature from a Speaker Aware Deep Neural Network (SADNN) shows a general advantage over the standard DNN based recognition system, and yields additional improvements for far field speech recognition.
Keywords :
array signal processing; feature extraction; microphones; neural nets; speech enhancement; speech recognition; DNN based recognition system; SADNN; WER; beamforming; far field multiple microphones; feature extraction; multiple channel feature concatenation; neural network front-ends; speaker aware deep neural network; speech enhancement; speech recognition; word error rate; Array signal processing; Hidden Markov models; Speech; Speech processing; Speech recognition; Standards; Training; beamforming; deep neural networks; multiple microphone; speech recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6854663
Filename :
6854663
Link To Document :
بازگشت