DocumentCode :
3152689
Title :
Scalable stacking and learning for building deep architectures
Author :
Deng, Li ; Yu, Dong ; Platt, John
Author_Institution :
Microsoft Res., Redmond, WA, USA
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
2133
Lastpage :
2136
Abstract :
Deep Neural Networks (DNNs) have shown remarkable success in pattern recognition tasks. However, parallelizing DNN training across computers has been difficult. We present the Deep Stacking Network (DSN), which overcomes the problem of parallelizing learning algorithms for deep architectures. The DSN provides a method of stacking simple processing modules in buiding deep architectures, with a convex learning problem in each module. Additional fine tuning further improves the DSN, while introducing minor non-convexity. Full learning in the DSN is batch-mode, making it amenable to parallel training over many machines and thus be scalable over the potentially huge size of the training data. Experimental results on both the MNIST (image) and TIMIT (speech) classification tasks demonstrate that the DSN learning algorithm developed in this work is not only parallelizable in implementation but it also attains higher classification accuracy than the DNN.
Keywords :
convex programming; learning (artificial intelligence); neural net architecture; pattern classification; MNIST classification task; TIMIT classification task; classification accuracy; convex learning problem; deep architecture; deep neural network training parallelization; deep stacking network learning algorithm; image classification task; learning algorithm parallelization; pattern recognition task; scalable stacking; speech classification task; Computer architecture; Error analysis; Speech; Stacking; Training; Tuning; Vectors; DNN; DSN; convexity; deep learning; stacking;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6288333
Filename :
6288333
Link To Document :
بازگشت