مرکز منطقه ای اطلاع رساني علوم و فناوري - Hybrid DNN-Latent structured SVM acoustic models for continuous speech recognition

DocumentCode :

3744821

Title :

Hybrid DNN-Latent structured SVM acoustic models for continuous speech recognition

Author :

Suman Ravuri

Author_Institution :

International Computer Science Institute, Berkeley, CA, University of California - Berkeley, Berkeley, CA

fYear :

2015

Firstpage :

Lastpage :

Abstract :

In this work, we propose Deep Neural Network (DNN)-Latent Structured Support Vector Machine (LSSVM) Acoustic Models as replacement for more standard sequence-discriminative trained DNN-HMM hybrid acoustic models. Compared to existing methods, approaches based on margin maximization, as is considered in this work, enjoy better theoretical justification. In addition to a max-margin based criteria, we also extend the Structured SVM model to include latent variables in the model to account for uncertainty in state alignments. Introducing latent structure allows for better sample complexity, often requiring 33% to 66% fewer utterances to converge compared to alternate criteria. On an 8-hour independent test set of conversational speech, the proposed method decreases word error rate by 9% relative to a cross-entropy trained hybrid system, while the best existing system decreases the word error rate by 6.5% relative.

Keywords :

"Hidden Markov models","Support vector machines","Training","Acoustics","Decoding","Speech recognition","Neural networks"

Publisher :

ieee

Conference_Titel :

Automatic Speech Recognition and Understanding (ASRU), 2015 IEEE Workshop on

Type :

conf

DOI :

10.1109/ASRU.2015.7404771

Filename :

7404771

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3744821