DocumentCode
701782
Title
Investigation of different acoustic modeling techniques for low resource Indian language data
Author
Sriranjani, R. ; Murali Karthick, B. ; Umesh, S.
Author_Institution
Dept. of Electr. Eng., Indian Inst. of Technol. Madras, Chennai, India
fYear
2015
fDate
Feb. 27 2015-March 1 2015
Firstpage
1
Lastpage
5
Abstract
In this paper, we investigate the performance of deep neural network (DNN) and Subspace Gaussian mixture model (SGMM) in low-resource condition. Even though DNN outperforms SGMM and continuous density hidden Markov models (CDHMM) for high-resource data, it degrades in performance while modeling low-resource data. Our experimental results show that SGMM outperforms DNN for limited transcribed data. To resolve this problem in DNN, we propose to train DNN containing bottleneck layer in two stages: First stage involves extraction of bottleneck features. In second stage, the extracted bottleneck features from first stage are used to train DNN having bottleneck layer. All our experiments are performed using two Indian languages (Tamil & Hindi) in Mandi database. Our proposed method shows improved performance when compared to baseline SGMM and DNN models for limited training data.
Keywords
Gaussian processes; hidden Markov models; mixture models; natural language processing; neural nets; CDHMM; DNN; Mandi database; SGMM; continuous density hidden Markov models; deep neural network; different acoustic modeling techniques; low resource Indian language data; subspace Gaussian mixture model; Acoustics; Data models; Databases; Feature extraction; Hidden Markov models; Training; Training data; DNN; Hindi; Indian languages; SGMM; Tamil; bottleneck; low resource data;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications (NCC), 2015 Twenty First National Conference on
Conference_Location
Mumbai
Type
conf
DOI
10.1109/NCC.2015.7084860
Filename
7084860
Link To Document