DocumentCode :
1665855
Title :
Direct product based deep belief networks for automatic speech recognition
Author :
Fousek, Petr ; Rennie, Steven ; Dognin, Pierre ; Goel, Vikas
Author_Institution :
IBM Thomas J. Watson Res. Center, Yorktown Heights, NY, USA
fYear :
2013
Firstpage :
3148
Lastpage :
3152
Abstract :
In this paper, we present new methods for parameterizing the connections of neural networks using sums of direct products. We show that low rank parameterizations of weight matrices are a subset of this set, and explore the theoretical and practical benefits of representing weight matrices using sums of Kronecker products. ASR results on a 50 hr subset of the English Broadcast News corpus indicate that the approach is promising. In particular, we show that a factorial network with more than 150 times less parameters in its bottom layer than its standard unconstrained counterpart suffers minimal WER degradation, and that by using sums of Kronecker products, we can close the gap in WER performance while maintaining very significant parameter savings. In addition, direct product DBNs consistently outperform standard DBNs with the same number of parameters. These results have important implications for research on deep belief networks (DBNs). They imply that we should be able to train neural networks with thousands of neurons and minimal restrictions much more rapidly than is currently possible, and that by using sums of direct products, it will be possible to train neural networks with literally millions of neurons tractably-an exciting prospect.
Keywords :
belief networks; matrix decomposition; neural nets; speech recognition; Kronecker product; WER degradation; automatic speech recognition; direct product DBN; direct product based deep belief network; factorial network; low rank parameterization; neural network; weight matrices; Acoustics; Biological neural networks; Bismuth; Matrix decomposition; Neurons; Training; Vectors; Back-Propagation; Deep Belief Networks; Kronecker Product; Matrix Factorization; Multi-Layer Perceptron;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
ISSN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2013.6638238
Filename :
6638238
Link To Document :
بازگشت