مرکز منطقه ای اطلاع رساني علوم و فناوري - Evaluation of multi-level context-dependent acoustic model for large vocabulary speaker adaptation tasks

DocumentCode :

3163073

Title :

Evaluation of multi-level context-dependent acoustic model for large vocabulary speaker adaptation tasks

Author :

Chang, Hung-An ; Glass, James

Author_Institution :

MIT Comput. Sci. & Artificial Intell. Lab., Cambridge, MA, USA

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

4313

Lastpage :

4316

Abstract :

In this paper, we investigate the ability of a recently proposed discriminatively trained, multi-level context-dependent acoustic model to adapt to a new speaker in both supervised and unsupervised adaptation scenarios. Speaker adaptive speech recognition experiments performed on a large-vocabulary spoken lecture task show that the multi-level model reduces word error rates by more than 10% in both cases as compared to the conventional clustering-based decision-tree context-dependent acoustic model approach.

Keywords :

speaker recognition; vocabulary; clustering-based decision-tree context-dependent acoustic model approach; large vocabulary speaker adaptation task; large-vocabulary spoken lecture task; multilevel context-dependent acoustic model; speaker adaptive speech recognition experiment; supervised adaptation scenario; unsupervised adaptation scenario; word error rate reduction; Acoustics; Adaptation models; Context modeling; Data models; Silicon; Training; Vocabulary; LVCSR; Multi-level acoustic model; context-dependent model; discriminative training; speaker adaptation;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6288873

Filename :

6288873

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3163073