Title :
Dealing with acoustic mismatch for training multilingual subspace Gaussian mixture models for speech recognition
Author :
Mohan, Aanchan ; Ghalehjegh, Sina Hamidi ; Rose, Richard C.
Author_Institution :
Dept. of Electr. & Comput. Eng., McGill Univ., Montreal, QC, Canada
Abstract :
The subspace Gaussian mixture model (SGMM) has been recently proposed as an acoustic modeling technique suitable for configuring multilingual speech recognition systems. It is attractive for this purpose since its parametrization allows its “shared” model parameters to be trained with data from multiple languages [1]. In this work, we report on the results of an experimental study carried out with the goal of improving native Spanish language speech recognition performance using an existing telephone speech corpus of English spoken by speakers of Spanish origin. Compensation for sources of acoustic variability between Spanish and English language data sets was found to be important in obtaining good multilingual ASR performance. We conclude with a discussion about the notion of acoustic similarity between the state dependent parameters of the SGMM, and its possible use in effectively modelling pronunciation variation.
Keywords :
Gaussian processes; speech recognition; English spoken; SGMM; acoustic mismatch; acoustic modeling technique; multilingual ASR performance; native Spanish language speech recognition performance; pronunciation variation; shared model parameters; speakers; speech recognition; state dependent parameters; telephone speech corpus; training multilingual subspace Gaussian mixture models; Acoustics; Adaptation models; Hidden Markov models; Speech; Speech recognition; Training; Vectors; Acoustic Modelling; Multilingual Speech Recognition; Subspace methods;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6289016