• DocumentCode
    164845
  • Title

    Investigating stranded GMM for improving automatic speech recognition

  • Author

    Gorin, Arseniy ; Jouvet, Denis ; Vincent, Emmanuel ; Dung Tran

  • Author_Institution
    Speech Group, Inria, Villers-lès-Nancy, France
  • fYear
    2014
  • fDate
    12-14 May 2014
  • Firstpage
    192
  • Lastpage
    196
  • Abstract
    This paper investigates recently proposed Stranded Gaussian Mixture acoustic Model (SGMM) for Automatic Speech Recognition (ASR). This model extends conventional hidden Markov model (HMM-GMM) by explicitly introducing dependencies between components of the observation Gaussian mixture densities. The main objective of the paper is to experimentally study, how useful SGMM can be for dealing with data, which contains different sources of acoustic variability. First studied sources of variability are age and gender in quiet environment (TIdigits task including child speech). Second, the SGMM modeling is applied on data produced by different speakers and corrupted by non-stationary noise (CHiME 2013 challenge data). Finally, SGMM is applied on the same noisy data, but after performing speech enhancement (i.e., the remaining variability mostly comes from residual noise and different speakers). Although SGMM was originally proposed for robust speech recognition of noisy data, in this work it was found, that the model is more efficient for handling speaker variability in quiet environment.
  • Keywords
    Gaussian processes; hidden Markov models; mixture models; speech enhancement; speech recognition; ASR; HMM-GMM; SGMM; acoustic variability; automatic speech recognition; hidden Markov model; noisy data; quiet environment; speaker variability; speech enhancement; stranded GMM; stranded Gaussian mixture acoustic model; Hidden Markov models; Noise; Noise measurement; Speech; Speech recognition; Training; Trajectory; dynamic Bayesian network; hidden Markov model; robust speech recognition; trajectory modeling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Hands-free Speech Communication and Microphone Arrays (HSCMA), 2014 4th Joint Workshop on
  • Conference_Location
    Villers-les-Nancy
  • Type

    conf

  • DOI
    10.1109/HSCMA.2014.6843278
  • Filename
    6843278