Title :
Exemplar-based noise robust automatic speech recognition using modulation spectrogram features
Author :
Baby, Deepak ; Virtanen, Tuomas ; Gemmeke, Jort F. ; Barker, Tom ; Van hamme, Hugo
Author_Institution :
Dept. ESAT, KU Leuven, Leuven, Belgium
Abstract :
We propose a novel exemplar-based feature enhancement method for automatic speech recognition which uses coupled dictionaries: an input dictionary containing atoms sampled in the modulation (envelope) spectrogram domain and an output dictionary with atoms in the Mel or full-resolution frequency domain. The input modulation representation is chosen for its separation properties of speech and noise and for its relation with human auditory processing. The output representation is one which can be processed by the ASR back-end. The proposed method was investigated on the AURORA-2 and AURORA-4 databases and improved word error rates (WER) were obtained when compared to the system which uses Mel features in the input exemplars. The paper also proposes a hybrid system which combines the baseline and the proposed algorithm on the AURORA-2 database which in turn also yielded improvement over both the algorithms.
Keywords :
error statistics; speech enhancement; speech recognition; ASR back-end; AURORA-2; AURORA-4 database; Mel features; coupled dictionaries; exemplar-based feature enhancement; exemplar-based noise robust automatic speech recognition; full-resolution frequency domain; human auditory processing; modulation spectrogram features; spectrogram domain; word error rates; Databases; Dictionaries; Discrete Fourier transforms; Modulation; Noise; Spectrogram; Speech; automatic speech recognition; coupled dictionaries; modulation envelope; non-negative matrix factorisation;
Conference_Titel :
Spoken Language Technology Workshop (SLT), 2014 IEEE
DOI :
10.1109/SLT.2014.7078628