Investigations on ensemble based unsupervised adaptation methods

Author

Kubota, Yu ; Shinozaki, Takahiro ; Furui, Sadaoki

Author_Institution

Grad. Sch. of Inf. Sci. & Eng., Tokyo Inst. of Technol., Tokyo, Japan

fYear

2010

fDate

14-19 March 2010

Firstpage

4874

Lastpage

4877

Abstract

We have previously proposed unsupervised cross-validation (CV) adaptation that introduces CV into an iterative unsupervised batch mode adaptation framework to suppress the influence of errors in an internally generated recognition hypothesis and have shown that it improves recognition performance. However, a limitation was that the experiments were performed using only a clean speech recognition task with a ML trained initial acoustic model. Another limitation was that only the CV method was investigated while there was a possibility of using other ensemble methods. In this study, we evaluate the CV method using a discriminatively trained baseline and a noisy speech recognition task. As an alternative to CV adaptation, unsupervised aggregated (Ag) adaptation is proposed and investigated that introduces a bagging like idea instead of CV. Experimental results show that CV and Ag adaptations consistently give larger improvements than the conventional batch adaptation but the former is more advantageous in terms of computational cost.

Keywords

iterative methods; speech recognition; unsupervised learning; ML trained initial acoustic model; clean speech recognition task; internally generated recognition hypothesis; iterative unsupervised batch mode adaptation framework; noisy speech recognition task; unsupervised aggregated adaptation; unsupervised cross validation; Acoustic noise; Bagging; Computational efficiency; Computer errors; Information science; Iterative decoding; Iterative methods; Maximum likelihood decoding; Parameter estimation; Speech recognition; Cross-validation; acoustic model; bagging; machine learning ensemble; unsupervised adaptation;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on

Conference_Location

Dallas, TX

ISSN

1520-6149

Print_ISBN

978-1-4244-4295-9

Electronic_ISBN

1520-6149

Type

conf

DOI

10.1109/ICASSP.2010.5495118

Filename

5495118