DocumentCode :
180402
Title :
Fine context, low-rank, softplus deep neural networks for mobile speech recognition
Author :
Senior, Alan ; Xin Lei
Author_Institution :
Google Inc., Mountain View, CA, USA
fYear :
2014
fDate :
4-9 May 2014
Firstpage :
7644
Lastpage :
7648
Abstract :
We investigate the use of large state inventories and the softplus nonlinearity for on-device neural network based mobile speech recognition. Large state inventories are achieved by less aggressive context-dependent state tying, and made possible by using a bottleneck layer to contain the number of parameters. We investigate alternative approaches to the bottleneck layer, demonstrate the superiority of the softplus non-linearity and investigate alternatives for the final stages of the training algorithm. Overall we reduce the word error rate of the system by 9% relative. The techniques are also shown to work well for large acoustic models for cloud-based speech recognition.
Keywords :
mobile computing; neural nets; speech recognition; acoustic models; bottleneck layer; cloud based speech recognition; fine context; large state inventories; low-rank; mobile speech recognition; softplus deep neural networks; softplus nonlinearity; training algorithm; word error rate; Acoustics; Approximation methods; Error analysis; Neural networks; Speech; Speech recognition; Training; Deep neural networks; Voice Search; embedded recognizer; hybrid neural network speech recognition; low-rank approximation; mobile speech recognition; singular value decomposition; softplus nonlinearity;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location :
Florence
Type :
conf
DOI :
10.1109/ICASSP.2014.6855087
Filename :
6855087
Link To Document :
بازگشت