مرکز منطقه ای اطلاع رساني علوم و فناوري - Acoustic modeling for under-resourced languages based on vectorial HMM-states representation using Subspace Gaussian Mixture Models

DocumentCode :

3132655

Title :

Acoustic modeling for under-resourced languages based on vectorial HMM-states representation using Subspace Gaussian Mixture Models

Author :

Bouallegue, Mohamed ; Ferreira, Eija ; matrouf, driss driss ; Linares, Georges ; Goudi, M. ; Nocera, P.

Author_Institution :

LIA, Univ. of Avignon, Avignon, France

fYear :

2012

fDate :

2-5 Dec. 2012

Firstpage :

330

Lastpage :

335

Abstract :

This paper explores a novel method for context-dependent models in automatic speech recognition (ASR), in the context of under-resourced languages. We present a simple way to realize a tying states approach, based on a new vectorial representation of the HMM states. This vectorial representation is considered as a vector of a low number of parameters obtained by the Subspace Gaussian Mixture Models paradigm (SGMM). The proposed method does not require phonetic knowledge or a large amount of data, which represent the major problems of acoustic modeling for under-resourced languages. This paper shows how this representation can be obtained and used for tying states. Our experiments, applied on Vietnamese, show that this approach achieves a stable gain compared to the classical approach which is based on decision trees. Furthermore, this method appears to be portable to other languages, as shown in the preliminary study conducted on Berber.

Keywords :

Gaussian processes; acoustic signal processing; decision trees; hidden Markov models; natural language processing; speech recognition; ASR; Berber; SGMM; Vietnamese; acoustic modeling; automatic speech recognition; context-dependent models; decision trees; subspace Gaussian mixture models; tying states approach; underresourced languages; vectorial HMM-states representation; Acoustics; Adaptation models; Context modeling; Data models; Decision trees; Hidden Markov models; Vectors; Acoustic Modelling; HMM-state vector representation; Subspace Gaussian Mixture Models; state-tying; under-resourced languages;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Spoken Language Technology Workshop (SLT), 2012 IEEE

Conference_Location :

Miami, FL

Print_ISBN :

978-1-4673-5125-6

Electronic_ISBN :

978-1-4673-5124-9

Type :

conf

DOI :

10.1109/SLT.2012.6424245

Filename :

6424245

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3132655