مرکز منطقه ای اطلاع رساني علوم و فناوري - Joint training of convolutional and non-convolutional neural networks

DocumentCode :

179586

Title :

Joint training of convolutional and non-convolutional neural networks

Author :

Soltau, Hagen ; Saon, George ; Sainath, Tara N.

Author_Institution :

IBM T. J. Watson Res. Center, Yorktown Heights, NY, USA

fYear :

2014

fDate :

4-9 May 2014

Firstpage :

5572

Lastpage :

5576

Abstract :

We describe a simple modification of neural networks which consists in extending the commonly used linear layer structure to an arbitrary graph structure. This allows us to combine the benefits of convolutional neural networks with the benefits of regular networks. The joint model has only a small increase in parameter size and training and decoding time are virtually unaffected. We report significant improvements over very strong baselines on two LVCSR tasks and one speech activity detection task.

Keywords :

convolutional codes; decoding; neural nets; LVCSR tasks; arbitrary graph structure; convolutional neural networks; decoding time; joint training; linear layer structure; nonconvolutional neural networks; regular networks; speech activity detection task; Acoustics; Error analysis; Hidden Markov models; Joints; Neural networks; Speech; Training; Acoustic Modeling; CNN; MLP; Neural Networks;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on

Conference_Location :

Florence

Type :

conf

DOI :

10.1109/ICASSP.2014.6854669

Filename :

6854669

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=179586