Title :
A simplified Subspace Gaussian Mixture to compact acoustic models for speech recognition
Author :
Bouallegue, Mohamed ; Matrouf, Driss ; Linares, Georges
Author_Institution :
LIA, Univ. of Avignon, Avignon, France
Abstract :
Speech recognition applications are known to require a significant amount of resources (memory, computing power). However, embedded speech recognition systems, such as in mobile phones, only authorizes few KB of memory and few MIPS. In the context of HMM-based speech recognizers, each HMM-state distribution is modeled independently from to the other and has a large amount of parameters. In spite of using state-tying techniques, the size of the acoustic models stays large and certain redundancy remains between states. In this paper, we investigate the capacity of the Subspace Gaussian Mixture approach to reduce the acoustic models size while keeping good performances. We introduce a simplification concerning state specific Gaussians weights estimation, which is a very complex and time consuming procedure in the original approach. With this approach, we show that the acoustic model size can be reduced by 92% with almost the same performance as the standard acoustic modeling.
Keywords :
Gaussian processes; hidden Markov models; speech recognition; Gaussian weight estimation; HMM-based speech recognizer; HMM-state distribution model; MIPS; acoustic modeling; embedded speech recognition system; memory KB; mobile phone; subspace Gaussian mixture approach; Acoustics; Hidden Markov models; Mathematical model; Speech; Speech processing; Speech recognition; Training; Compact Acoustic Models; Embedded speech recognition; Gaussian Mixture Models; Hidden Markov Models; Subspace Gaussian Mixture;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947453