مرکز منطقه ای اطلاع رساني علوم و فناوري - On-line speaker adaptation on telephony speech data with adaptively trained acoustic models

DocumentCode :

3529504

Title :

On-line speaker adaptation on telephony speech data with adaptively trained acoustic models

Author :

Giuliani, Diego ; Gretter, Roberto ; Brugnara, Fabio

Author_Institution :

Human Language Technol. Res. Unit, FBK-irst - Fondazione Bruno Kessler, Povo

fYear :

2009

fDate :

19-24 April 2009

Firstpage :

4385

Lastpage :

4388

Abstract :

This paper addresses speaker adaptive acoustic modeling, based on feature space maximum likelihood linear regression, in the context of on-line telephony applications. An adaptive acoustic modeling method, that we previously proved effective in off-line applications, is used to train acoustic models to be used in text-dependent and text-independent on-line adaptation. Experiments on telephony speech data indicate that feature space maximum a posteriori linear regression (fMAPLR) greatly helps to cope with sparse adaptation data when performing instantaneous and incremental adaptation with both baseline models and speaker adaptively trained models. The use of speaker adaptively trained models in conjunction with fMAPLR leads to the best recognition results in both instantaneous and incremental adaptation. The proposed text-independent adaptation approach, exploiting speaker adaptively trained models, is also proven effective.

Keywords :

speaker recognition; telephony; adaptively trained acoustic model; feature space maximum a posteriori linear regression; feature space maximum likelihood linear regression; incremental adaptation; online speaker adaptation; online telephony application; sparse adaptation data; speaker adaptive acoustic modeling; telephony speech data; text-independent adaptation; text-independent online adaptation; Acoustic applications; Acoustic testing; Context modeling; Hidden Markov models; Loudspeakers; Maximum likelihood linear regression; Parameter estimation; Space technology; Speech recognition; Telephony; automatic speech recognition; on-line adaptation; speaker adaptation; speaker adaptive training; telephony application;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on

Conference_Location :

Taipei

ISSN :

1520-6149

Print_ISBN :

978-1-4244-2353-8

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2009.4960601

Filename :

4960601

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3529504