• DocumentCode
    3132655
  • Title

    Acoustic modeling for under-resourced languages based on vectorial HMM-states representation using Subspace Gaussian Mixture Models

  • Author

    Bouallegue, Mohamed ; Ferreira, Eija ; matrouf, driss driss ; Linares, Georges ; Goudi, M. ; Nocera, P.

  • Author_Institution
    LIA, Univ. of Avignon, Avignon, France
  • fYear
    2012
  • fDate
    2-5 Dec. 2012
  • Firstpage
    330
  • Lastpage
    335
  • Abstract
    This paper explores a novel method for context-dependent models in automatic speech recognition (ASR), in the context of under-resourced languages. We present a simple way to realize a tying states approach, based on a new vectorial representation of the HMM states. This vectorial representation is considered as a vector of a low number of parameters obtained by the Subspace Gaussian Mixture Models paradigm (SGMM). The proposed method does not require phonetic knowledge or a large amount of data, which represent the major problems of acoustic modeling for under-resourced languages. This paper shows how this representation can be obtained and used for tying states. Our experiments, applied on Vietnamese, show that this approach achieves a stable gain compared to the classical approach which is based on decision trees. Furthermore, this method appears to be portable to other languages, as shown in the preliminary study conducted on Berber.
  • Keywords
    Gaussian processes; acoustic signal processing; decision trees; hidden Markov models; natural language processing; speech recognition; ASR; Berber; SGMM; Vietnamese; acoustic modeling; automatic speech recognition; context-dependent models; decision trees; subspace Gaussian mixture models; tying states approach; underresourced languages; vectorial HMM-states representation; Acoustics; Adaptation models; Context modeling; Data models; Decision trees; Hidden Markov models; Vectors; Acoustic Modelling; HMM-state vector representation; Subspace Gaussian Mixture Models; state-tying; under-resourced languages;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Spoken Language Technology Workshop (SLT), 2012 IEEE
  • Conference_Location
    Miami, FL
  • Print_ISBN
    978-1-4673-5125-6
  • Electronic_ISBN
    978-1-4673-5124-9
  • Type

    conf

  • DOI
    10.1109/SLT.2012.6424245
  • Filename
    6424245