• DocumentCode
    701782
  • Title

    Investigation of different acoustic modeling techniques for low resource Indian language data

  • Author

    Sriranjani, R. ; Murali Karthick, B. ; Umesh, S.

  • Author_Institution
    Dept. of Electr. Eng., Indian Inst. of Technol. Madras, Chennai, India
  • fYear
    2015
  • fDate
    Feb. 27 2015-March 1 2015
  • Firstpage
    1
  • Lastpage
    5
  • Abstract
    In this paper, we investigate the performance of deep neural network (DNN) and Subspace Gaussian mixture model (SGMM) in low-resource condition. Even though DNN outperforms SGMM and continuous density hidden Markov models (CDHMM) for high-resource data, it degrades in performance while modeling low-resource data. Our experimental results show that SGMM outperforms DNN for limited transcribed data. To resolve this problem in DNN, we propose to train DNN containing bottleneck layer in two stages: First stage involves extraction of bottleneck features. In second stage, the extracted bottleneck features from first stage are used to train DNN having bottleneck layer. All our experiments are performed using two Indian languages (Tamil & Hindi) in Mandi database. Our proposed method shows improved performance when compared to baseline SGMM and DNN models for limited training data.
  • Keywords
    Gaussian processes; hidden Markov models; mixture models; natural language processing; neural nets; CDHMM; DNN; Mandi database; SGMM; continuous density hidden Markov models; deep neural network; different acoustic modeling techniques; low resource Indian language data; subspace Gaussian mixture model; Acoustics; Data models; Databases; Feature extraction; Hidden Markov models; Training; Training data; DNN; Hindi; Indian languages; SGMM; Tamil; bottleneck; low resource data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications (NCC), 2015 Twenty First National Conference on
  • Conference_Location
    Mumbai
  • Type

    conf

  • DOI
    10.1109/NCC.2015.7084860
  • Filename
    7084860