مرکز منطقه ای اطلاع رساني علوم و فناوري - Dealing with acoustic mismatch for training multilingual subspace Gaussian mixture models for speech recognition

DocumentCode :

3166082

Title :

Dealing with acoustic mismatch for training multilingual subspace Gaussian mixture models for speech recognition

Author :

Mohan, Aanchan ; Ghalehjegh, Sina Hamidi ; Rose, Richard C.

Author_Institution :

Dept. of Electr. & Comput. Eng., McGill Univ., Montreal, QC, Canada

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

4893

Lastpage :

4896

Abstract :

The subspace Gaussian mixture model (SGMM) has been recently proposed as an acoustic modeling technique suitable for configuring multilingual speech recognition systems. It is attractive for this purpose since its parametrization allows its “shared” model parameters to be trained with data from multiple languages [1]. In this work, we report on the results of an experimental study carried out with the goal of improving native Spanish language speech recognition performance using an existing telephone speech corpus of English spoken by speakers of Spanish origin. Compensation for sources of acoustic variability between Spanish and English language data sets was found to be important in obtaining good multilingual ASR performance. We conclude with a discussion about the notion of acoustic similarity between the state dependent parameters of the SGMM, and its possible use in effectively modelling pronunciation variation.

Keywords :

Gaussian processes; speech recognition; English spoken; SGMM; acoustic mismatch; acoustic modeling technique; multilingual ASR performance; native Spanish language speech recognition performance; pronunciation variation; shared model parameters; speakers; speech recognition; state dependent parameters; telephone speech corpus; training multilingual subspace Gaussian mixture models; Acoustics; Adaptation models; Hidden Markov models; Speech; Speech recognition; Training; Vectors; Acoustic Modelling; Multilingual Speech Recognition; Subspace methods;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6289016

Filename :

6289016

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3166082