Title :
Enhancing the sub-band modulation spectra of speech features via nonnegative matrix factorization for robust speech recognition
Author :
Fan, Hao-Teng ; Tsai, Yi-chang ; Hung, Jeih-weih
Author_Institution :
Dept. of Electr. Eng., Nat. Chi Nan Univ., Nantou, Taiwan
fDate :
June 30 2012-July 2 2012
Abstract :
In this paper, we propose to enhance the modulation spectrum of speech features in noise robustness via the technique of nonnegative matrix factorization (NMF). With NMF, a set of nonnegative basis spectra vectors is derived from the clean speech to represent the important components for speech recognition. However, compared with the original NMF-based scheme that employs iterative search to update the full-band modulation spectra, we propose to apply the orthogonal projection to update the low sub-band modulation spectra. In contrast to the original scheme, the presented new process significantly reduces the computation complexity without the cost of degraded recognition performance. In the experiments conducted on the Aurora-2 database, we show that the presented new NMF-based approach can provide an average error reduction rate of over 65% relative as compared with the baseline MFCC system.
Keywords :
computational complexity; iterative methods; matrix decomposition; search problems; signal denoising; speech recognition; Aurora-2 database; NMF-based scheme; computation complexity reduction; fullband modulation spectra; iterative search; noise robustness; nonnegative basis spectra vectors; nonnegative matrix factorization; orthogonal projection; robust speech recognition; speech features; subband modulation spectra enhancement; Accuracy; Modulation; Noise; Noise robustness; Speech; Speech recognition; Vectors; modulation spectrum; noise robustness; nonnegative matrix factorization; speech recognition;
Conference_Titel :
System Science and Engineering (ICSSE), 2012 International Conference on
Conference_Location :
Dalian, Liaoning
Print_ISBN :
978-1-4673-0944-8
Electronic_ISBN :
978-1-4673-0943-1
DOI :
10.1109/ICSSE.2012.6257172