Title :
Dirichlet Mixture Models of neural net posteriors for HMM-based speech recognition
Author :
Balakrishnan, Venkataramanan ; Sivaram, G.S.V.S. ; Khudanpur, Sanjeev
Author_Institution :
Dept. of Electr. & Comput. Eng., Johns Hopkins Univ., Baltimore, MD, USA
Abstract :
In this paper, we present a novel technique for modeling the posterior probability estimates obtained from a neural net work directly in the HMM framework using the Dirichlet Mixture Models (DMMs). Since posterior probability vectors lie on a probability simplex their distribution can be modeled using DMMs. Being in an exponential family, the parameters of DMMs can be estimated in an efficient manner. Conventional approaches like TANDEM attempt to gaussianize the posteriors by suitable transforms and model them using Gaussian Mixture Models (GMMs). This requires more number of parameters as it does not exploit the fact that the probability vectors lie on a simplex. We demonstrate through TIMIT phoneme recognition experiments that the proposed technique outperforms the conventional TANDEM approach.
Keywords :
Gaussian processes; hidden Markov models; speech recognition; statistical distributions; DMM; Dirichlet mixture models; GMM; Gaussian Mixture Models; HMM-based speech recognition; TANDEM approach; TIMIT phoneme recognition; neural net posteriors; posterior probability estimates; Computational modeling; Data models; Feature extraction; Hidden Markov models; Probability; Speech; Training; Dirichlet distribution; HMMs; neural network posteriors;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947486