DocumentCode
2791439
Title
Investigations on ensemble based unsupervised adaptation methods
Author
Kubota, Yu ; Shinozaki, Takahiro ; Furui, Sadaoki
Author_Institution
Grad. Sch. of Inf. Sci. & Eng., Tokyo Inst. of Technol., Tokyo, Japan
fYear
2010
fDate
14-19 March 2010
Firstpage
4874
Lastpage
4877
Abstract
We have previously proposed unsupervised cross-validation (CV) adaptation that introduces CV into an iterative unsupervised batch mode adaptation framework to suppress the influence of errors in an internally generated recognition hypothesis and have shown that it improves recognition performance. However, a limitation was that the experiments were performed using only a clean speech recognition task with a ML trained initial acoustic model. Another limitation was that only the CV method was investigated while there was a possibility of using other ensemble methods. In this study, we evaluate the CV method using a discriminatively trained baseline and a noisy speech recognition task. As an alternative to CV adaptation, unsupervised aggregated (Ag) adaptation is proposed and investigated that introduces a bagging like idea instead of CV. Experimental results show that CV and Ag adaptations consistently give larger improvements than the conventional batch adaptation but the former is more advantageous in terms of computational cost.
Keywords
iterative methods; speech recognition; unsupervised learning; ML trained initial acoustic model; clean speech recognition task; internally generated recognition hypothesis; iterative unsupervised batch mode adaptation framework; noisy speech recognition task; unsupervised aggregated adaptation; unsupervised cross validation; Acoustic noise; Bagging; Computational efficiency; Computer errors; Information science; Iterative decoding; Iterative methods; Maximum likelihood decoding; Parameter estimation; Speech recognition; Cross-validation; acoustic model; bagging; machine learning ensemble; unsupervised adaptation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location
Dallas, TX
ISSN
1520-6149
Print_ISBN
978-1-4244-4295-9
Electronic_ISBN
1520-6149
Type
conf
DOI
10.1109/ICASSP.2010.5495118
Filename
5495118
Link To Document