DocumentCode :
3485699
Title :
Regularized subspace Gaussian mixture models for cross-lingual speech recognition
Author :
Lu, Liang ; Ghoshal, Arnab ; Renals, Steve
Author_Institution :
Centre for Speech Technol. Res., Univ. of Edinburgh, Edinburgh, UK
fYear :
2011
fDate :
11-15 Dec. 2011
Firstpage :
365
Lastpage :
370
Abstract :
We investigate cross-lingual acoustic modelling for low resource languages using the subspace Gaussian mixture model (SGMM). We assume the presence of acoustic models trained on multiple source languages, and use the global subspace parameters from those models for improved modelling in a target language with limited amounts of transcribed speech. Experiments on the GlobalPhone corpus using Spanish, Portuguese, and Swedish as source languages and German as target language (with 1 hour and 5 hours of transcribed audio) show that multilingually trained SGMM shared parameters result in lower word error rates (WERs) than using those from a single source language. We also show that regularizing the estimation of the SGMM state vectors by penalizing their ℓ1-norm help to overcome numerical instabilities and lead to lower WER.
Keywords :
Gaussian processes; acoustic signal processing; natural language processing; speech recognition; German; GlobalPhone corpus; Portuguese; Spanish; Swedish; cross-lingual acoustic modelling; cross-lingual speech recognition; global subspace parameter; low resource language; regularized subspace Gaussian mixture model; word error rates; Acoustics; Data models; Estimation; Hidden Markov models; Speech recognition; Training data; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on
Conference_Location :
Waikoloa, HI
Print_ISBN :
978-1-4673-0365-1
Electronic_ISBN :
978-1-4673-0366-8
Type :
conf
DOI :
10.1109/ASRU.2011.6163959
Filename :
6163959
Link To Document :
بازگشت