Title :
Modulation spectrum exponential weighting for robust speech recognition
Author :
Hao-teng Fan ; Yi-cheng Lian ; Jeih-weih Hung
Author_Institution :
Dept. of Electr. Eng., Nat. Chi Nan Univ., Nantou, Taiwan
Abstract :
In this paper, we present a novel scheme to improve the noise robustness of features in speech recognition for vehicle noise environments. In the algorithm termed modulation spectrum exponential weighting (MSEW), the magnitude spectra of feature streams are updated by integrating a reference magnitude spectrum and the original magnitude spectrum with varying exponential weights based on the signal-to-noise ratio (SNR) of the operating environment. Specifically, we present three modes of MSEW, which can be viewed as a generalization of the two algorithms, modulation spectrum replacement/filtering (MSR/MSF). In experiments conducted on the AURORA-2 noisy digit database, the presented MSEW algorithms can achieve better recognition accuracy rates relative to the original MSR and MSF in various vehicle-noise environments.
Keywords :
filtering theory; speech recognition; AURORA-2 noisy digit database; MSEW algorithms; MSR-MSF; SNR; feature stream magnitude spectra; modulation spectrum exponential weighting; modulation spectrum replacement-filtering; reference magnitude spectrum; robust speech recognition; signal-to-noise ratio; vehicle noise environments; vehicle-noise environments; Accuracy; Modulation; Robustness; Signal to noise ratio; Speech; Speech recognition; modulation spectrum; noise-robust feature; speech recognition; vehicle-noise environment;
Conference_Titel :
ITS Telecommunications (ITST), 2012 12th International Conference on
Conference_Location :
Taipei
Print_ISBN :
978-1-4673-3071-8
Electronic_ISBN :
978-1-4673-3069-5
DOI :
10.1109/ITST.2012.6425295