Title :
Maximum likelihood adaptation of histogram equalization with constraint for robust speech recognition
Author :
Xiao, Xiong ; Li, Jinyu ; Chng, Eng Siong ; Li, Haizhou
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
Abstract :
In this paper, we propose a novel feature space adaptation technique to improve the robustness of speech recognition in noisy environments. Histogram equalization (HEQ) is an effective technique for improving robustness by reducing the difference between clean and noisy features. A weakness of HEQ is that it does not take into account acoustic model, resulting in possible mismatch between HEQ processed features and the acoustic model. In this paper, we propose to adapt HEQ to maximize the likelihood of HEQ-processed features on the acoustic model, with a constraint on the parameters of HEQ. In addition, we use a Gaussian mixture model (GMM) to represent the clean feature space rather than using the acoustic model itself, and this results in both simpler implementation and better results. Experimental results show that HEQ with adaptation reduces word error rate by 7.5% and 5.7% respectively on Aurora-2 and Aurora-4 tasks over the HEQ baseline without adaptation.
Keywords :
Gaussian processes; speech recognition; GMM; Gaussian mixture model; HEQ; acoustic model; feature space adaptation technique; histogram equalization; maximum likelihood adaptation; noisy environment; robust speech recognition; Acoustics; Adaptation models; Hidden Markov models; Histograms; Robustness; Speech; Speech recognition; feature adaptation; feature normalization; histogram equalization; maximum likelihood; robust speech recognition;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2011.5947599