Title :
Tied-state based discriminative training of context-expanded region-dependent feature transforms for LVCSR
Author :
Zhi-Jie Yan ; Qiang Huo ; Jian Xu ; Yu Zhang
Author_Institution :
Microsoft Res. Asia, Beijing, China
Abstract :
We present a new discriminative feature transform approach to large vocabulary continuous speech recognition (LVCSR) using Gaussian mixture density hidden Markov models (GMM-HMMs) for acoustic modeling. The feature transform is formulated with a set of context-expanded region-dependent linear transforms (RDLTs) utilizing both long-span features and contextual weight expansion. The RDLTs are estimated by lattice-free, tied-state based discriminative training using maximum mutual information (MMI) criterion, while the GMM-HMMs are trained by conventional lattice-based, boosted MMI training. Compared with two baseline systems, which use RDLTs with either long-span features or weight expansion only and are trained using the conventional lattice-based discriminative training for both RDLTs and HMMs, the proposed approach achieves a relative word error rate reduction of 10% and 6% respectively on Switchboard-1 conversational telephone speech transcription task.
Keywords :
Gaussian processes; hidden Markov models; speech recognition; vocabulary; GMM-HMM; Gaussian mixture density hidden Markov models; LVCSR; Switchboard-1 conversational telephone speech transcription task; acoustic modeling; context-expanded region-dependent feature transforms; discriminative feature transform; large vocabulary continuous speech recognition; maximum mutual information; region-dependent linear transforms; tied-state based discriminative training; Acoustics; Hidden Markov models; Speech; Speech recognition; Training; Transforms; Vectors; HMM; discriminative training; maximum mutual information; region-dependent linear transform; tied-state;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
Conference_Location :
Vancouver, BC
DOI :
10.1109/ICASSP.2013.6639007