Title :
Adapting noisy speech models — Extended uncertainty decoding
Author :
Lu, Jianhua ; Ming, Ji ; Woods, Roger
Author_Institution :
Inst. of Electron., Commun. & Inf. Technol., Queen´´s Univ. Belfast, Belfast, UK
Abstract :
Most conventional techniques for noise adaptation assume a clean initial speech model which is adapted to a specific noise condition using adaptation data accumulated from the condition. In this paper, a different problem is considered, i.e. adapting a noisy speech model to a specific noise condition. For example, the initial noisy model may be a multi-condition model which is used to provide more accurate transcripts for the adaptation data than could be provided by a clean model, thereby obtaining a more accurate adaptation. We develop the formulation for this new problem by combining and extending maximum likelihood linear regression (MLLR), constrained MLLR (CMLLR) and uncertainty decoding techniques. We also present an implementation which has been tested on the Aurora 4 database, assuming an initial multi-condition model trained using white noise corrupted data. Significant word error rate (WER) reductions are achieved in comparison with other approaches.
Keywords :
decoding; maximum likelihood estimation; regression analysis; speech coding; Aurora 4 database; constrained maximum likelihood linear regression; extended uncertainty decoding; multicondition model; noise condition; noisy speech models; white noise corrupted data; word error rate reductions; Acoustic noise; Gaussian noise; Maximum likelihood decoding; Maximum likelihood linear regression; Noise robustness; Speech enhancement; Speech recognition; Transforms; Uncertainty; Working environment noise; Aurora 4; noise adaptation; noise compensation; noise robustness; speech recognition;
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2010.5495660