Title :
Multi-style training of HMMS with stereo data for reverberation-robust speech recognition
Author :
Sehr, Armin ; Hofmann, Christian ; Maas, Roland ; Kellermann, Walter
Author_Institution :
Multimedia Commun. & Signal Process., Univ. of Erlangen-Nuremberg, Erlangen, Germany
fDate :
May 30 2011-June 1 2011
Abstract :
A novel training algorithm using data pairs of clean and reverberant feature vectors for estimating robust Hidden Markov Models (HMMs), introduced in for matched training, is employed in this paper for multi-style training. The multi-style HMMs are derived from well-trained clean-speech HMMs by aligning the clean data to the clean-speech HMM and using the resulting state-frame alignment to estimate the Gaussian mixture densities from the reverberant data of several different rooms. Thus, the temporal alignment is fixed for all reverberation conditions contained in the multi-style training set so that the model mismatch between the different rooms is reduced. Therefore, this training approach is particularly suitable for multi-style training. Multi-style HMMs trained by the proposed approach and adapted to the current room condition using maximum likelihood linear regression significantly outperform the corresponding adapted multi-style HMMs trained by the conventional Baum-Welch algorithm. In strongly reverberant rooms, the proposed adapted multi-style HMMs even outper-form Baum-Welch HMMs trained on matched data.
Keywords :
hidden Markov models; maximum likelihood estimation; speech recognition; Baum-Welch algorithm; Gaussian mixture densities; HMMS; maximum likelihood linear regression; multistyle training; reverberation-robust speech recognition; robust hidden Markov models; state-frame alignment; stereo data; Adaptation models; Hidden Markov models; Reverberation; Speech; Speech recognition; Training; Training data; Multi-style HMMtraining; distant-talking speech recognition; reverberation; robust ASR; stereo data;
Conference_Titel :
Hands-free Speech Communication and Microphone Arrays (HSCMA), 2011 Joint Workshop on
Conference_Location :
Edinburgh
Print_ISBN :
978-1-4577-0997-5
DOI :
10.1109/HSCMA.2011.5942396