DocumentCode :
3529504
Title :
On-line speaker adaptation on telephony speech data with adaptively trained acoustic models
Author :
Giuliani, Diego ; Gretter, Roberto ; Brugnara, Fabio
Author_Institution :
Human Language Technol. Res. Unit, FBK-irst - Fondazione Bruno Kessler, Povo
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
4385
Lastpage :
4388
Abstract :
This paper addresses speaker adaptive acoustic modeling, based on feature space maximum likelihood linear regression, in the context of on-line telephony applications. An adaptive acoustic modeling method, that we previously proved effective in off-line applications, is used to train acoustic models to be used in text-dependent and text-independent on-line adaptation. Experiments on telephony speech data indicate that feature space maximum a posteriori linear regression (fMAPLR) greatly helps to cope with sparse adaptation data when performing instantaneous and incremental adaptation with both baseline models and speaker adaptively trained models. The use of speaker adaptively trained models in conjunction with fMAPLR leads to the best recognition results in both instantaneous and incremental adaptation. The proposed text-independent adaptation approach, exploiting speaker adaptively trained models, is also proven effective.
Keywords :
speaker recognition; telephony; adaptively trained acoustic model; feature space maximum a posteriori linear regression; feature space maximum likelihood linear regression; incremental adaptation; online speaker adaptation; online telephony application; sparse adaptation data; speaker adaptive acoustic modeling; telephony speech data; text-independent adaptation; text-independent online adaptation; Acoustic applications; Acoustic testing; Context modeling; Hidden Markov models; Loudspeakers; Maximum likelihood linear regression; Parameter estimation; Space technology; Speech recognition; Telephony; automatic speech recognition; on-line adaptation; speaker adaptation; speaker adaptive training; telephony application;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4960601
Filename :
4960601
Link To Document :
بازگشت