Title :
A study of an irrelevant variability normalization based discriminative training approach for LVCSR
Author :
Zhang, Yu ; Xu, Jian ; Yan, Zhi-Jie ; Huo, Qiang
Abstract :
This paper presents a discriminative training (DT) approach to irrelevant variability normalization (IVN) based training of feature transforms and hidden Markov models for large vocabulary continuous speech recognition. A speaker-clustering based method is used for acoustic sniffing and maximum mutual information (MMI) is used as a training criterion. Combined with unsupervised adaptation of feature transforms, the IVN-based DT approach achieves a 14.5% relative word error rate reduction over an MMI-trained baseline system on a Switchboard-1 conversational telephone speech transcription task.
Keywords :
hidden Markov models; speech recognition; DT approach; IVN; LVCSR; MMI; acoustic sniffing; discriminative training approach; feature transform training; hidden Markov model; irrelevant variability normalization; large vocabulary continuous speech recognition; maximum mutual information; speaker-clustering based method; switchboard-1 conversational telephone speech transcription task; word error rate reduction; Acoustics; Feature extraction; Hidden Markov models; Speech; Speech recognition; Training; Transforms; LVCSR; acoustic modeling; discriminative training; irrelevant variability normalization; unsupervised adaptation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947556