DocumentCode
180405
Title
Investigation of maxout networks for speech recognition
Author
Swietojanski, Pawel ; Jinyu Li ; Jui-Ting Huang
Author_Institution
Centre for Speech Technol. Res., Univ. of Edinburgh, Edinburgh, UK
fYear
2014
fDate
4-9 May 2014
Firstpage
7649
Lastpage
7653
Abstract
We explore the use of maxout neuron in various aspects of acoustic modelling for large vocabulary speech recognition systems; including low-resource scenario and multilingual knowledge transfers. Through the experiments on voice search and short message dictation datasets, we found that maxout networks are around three times faster to train and offer lower or comparable word error rates on several tasks, when compared to the networks with logistic nonlinearity. We also present a detailed study of the maxout unit internal behaviour suggesting the use of different nonlinearities in different layers.
Keywords
neural nets; speech recognition; acoustic modelling; deep neural networks; large vocabulary speech recognition systems; logistic nonlinearity; low-resource scenario; maxout neuron networks; maxout unit internal behaviour; multilingual knowledge transfers; short message dictation datasets; voice search; word error rates; Acoustics; Hidden Markov models; Neural networks; Neurons; Speech; Speech recognition; Training; deep neural networks; low-resource speech recognition; maxout networks; multitask learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6855088
Filename
6855088
Link To Document