Title :
Evaluation of multi-level context-dependent acoustic model for large vocabulary speaker adaptation tasks
Author :
Chang, Hung-An ; Glass, James
Author_Institution :
MIT Comput. Sci. & Artificial Intell. Lab., Cambridge, MA, USA
Abstract :
In this paper, we investigate the ability of a recently proposed discriminatively trained, multi-level context-dependent acoustic model to adapt to a new speaker in both supervised and unsupervised adaptation scenarios. Speaker adaptive speech recognition experiments performed on a large-vocabulary spoken lecture task show that the multi-level model reduces word error rates by more than 10% in both cases as compared to the conventional clustering-based decision-tree context-dependent acoustic model approach.
Keywords :
speaker recognition; vocabulary; clustering-based decision-tree context-dependent acoustic model approach; large vocabulary speaker adaptation task; large-vocabulary spoken lecture task; multilevel context-dependent acoustic model; speaker adaptive speech recognition experiment; supervised adaptation scenario; unsupervised adaptation scenario; word error rate reduction; Acoustics; Adaptation models; Context modeling; Data models; Silicon; Training; Vocabulary; LVCSR; Multi-level acoustic model; context-dependent model; discriminative training; speaker adaptation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6288873