DocumentCode :
3165775
Title :
Unsupervised training of subspace gaussian mixture models for conversational telephone speech recognition
Author :
Ma, Zejun ; Wang, XiaoRui ; Xu, Bo
Author_Institution :
Digital Content Technol. Res. Center, Inst. of Autom., Beijing, China
fYear :
2012
fDate :
25-30 March 2012
Firstpage :
4829
Lastpage :
4832
Abstract :
This paper presents our preliminary works on exploring unsupervised training of subspace gaussian mixture models for under-resourced CTS recognition task. The subspace model yields better performance than conventional GMM model, particularly in small or middle-sized training set. As an effective way to save human efforts, unsupervised learning is often applied to automatically transcribe a large amount of speech archives. The additional auto-transcribed data may help to improve model accuracy. In this paper, experiments are carried out on two publicly available English conversational telephone speech corpora. Both GMM and SGMM model in combination with unsupervised learning are examined and compared in this paper.
Keywords :
Gaussian processes; learning (artificial intelligence); speech recognition; English conversational telephone speech corpora; SGMM model; autotranscribed data; conversational telephone speech recognition; middle-sized training set; speech archives; subspace Gaussian mixture models; under-resourced CTS recognition task; unsupervised learning; unsupervised training; Acoustics; Data models; Hidden Markov models; Speech; Speech recognition; Training; Unsupervised learning; Speech recognition with low resources; subspace acoustic model; unsupervised learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
ISSN :
1520-6149
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2012.6289000
Filename :
6289000
Link To Document :
بازگشت