مرکز منطقه ای اطلاع رساني علوم و فناوري - Unsupervised training of subspace gaussian mixture models for conversational telephone speech recognition

DocumentCode :

3165775

Title :

Unsupervised training of subspace gaussian mixture models for conversational telephone speech recognition

Author :

Ma, Zejun ; Wang, XiaoRui ; Xu, Bo

Author_Institution :

Digital Content Technol. Res. Center, Inst. of Autom., Beijing, China

fYear :

2012

fDate :

25-30 March 2012

Firstpage :

4829

Lastpage :

4832

Abstract :

This paper presents our preliminary works on exploring unsupervised training of subspace gaussian mixture models for under-resourced CTS recognition task. The subspace model yields better performance than conventional GMM model, particularly in small or middle-sized training set. As an effective way to save human efforts, unsupervised learning is often applied to automatically transcribe a large amount of speech archives. The additional auto-transcribed data may help to improve model accuracy. In this paper, experiments are carried out on two publicly available English conversational telephone speech corpora. Both GMM and SGMM model in combination with unsupervised learning are examined and compared in this paper.

Keywords :

Gaussian processes; learning (artificial intelligence); speech recognition; English conversational telephone speech corpora; SGMM model; autotranscribed data; conversational telephone speech recognition; middle-sized training set; speech archives; subspace Gaussian mixture models; under-resourced CTS recognition task; unsupervised learning; unsupervised training; Acoustics; Data models; Hidden Markov models; Speech; Speech recognition; Training; Unsupervised learning; Speech recognition with low resources; subspace acoustic model; unsupervised learning;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on

Conference_Location :

Kyoto

ISSN :

1520-6149

Print_ISBN :

978-1-4673-0045-2

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2012.6289000

Filename :

6289000

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3165775