مرکز منطقه ای اطلاع رساني علوم و فناوري - Deep and Wide: Multiple Layers in Automatic Speech Recognition

DocumentCode :

1452362

Title :

Deep and Wide: Multiple Layers in Automatic Speech Recognition

Author :

Morgan, Nelson

Author_Institution :

Int. Comput. Sci. Inst., Berkeley, CA, USA

Volume :

Issue :

fYear :

2012

Firstpage :

Lastpage :

Abstract :

This paper reviews a line of research carried out over the last decade in speech recognition assisted by discriminatively trained, feedforward networks. The particular focus is on the use of multiple layers of processing preceding the hidden Markov model based decoding of word sequences. Emphasis is placed on the use of multiple streams of highly dimensioned layers, which have proven useful for this purpose. This paper ultimately concludes that while the deep processing structures can provide improvements for this genre, choice of features and the structure with which they are incorporated, including layer width, can also be significant factors.

Keywords :

hidden Markov models; speech recognition; automatic speech recognition; feedforward network; hidden Markov model based decoding; multiple streams; Acoustics; Artificial neural networks; Hidden Markov models; Speech; Speech recognition; Training; Vocabulary; Machine learning; multilayer perceptrons; speech recognition;

fLanguage :

English

Journal_Title :

Audio, Speech, and Language Processing, IEEE Transactions on

Publisher :

ieee

ISSN :

1558-7916

Type :

jour

DOI :

10.1109/TASL.2011.2116010

Filename :

5714717

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=1452362