DocumentCode
2831679
Title
Enhancing the sub-band modulation spectra of speech features via nonnegative matrix factorization for robust speech recognition
Author
Fan, Hao-Teng ; Tsai, Yi-chang ; Hung, Jeih-weih
Author_Institution
Dept. of Electr. Eng., Nat. Chi Nan Univ., Nantou, Taiwan
fYear
2012
fDate
June 30 2012-July 2 2012
Firstpage
179
Lastpage
182
Abstract
In this paper, we propose to enhance the modulation spectrum of speech features in noise robustness via the technique of nonnegative matrix factorization (NMF). With NMF, a set of nonnegative basis spectra vectors is derived from the clean speech to represent the important components for speech recognition. However, compared with the original NMF-based scheme that employs iterative search to update the full-band modulation spectra, we propose to apply the orthogonal projection to update the low sub-band modulation spectra. In contrast to the original scheme, the presented new process significantly reduces the computation complexity without the cost of degraded recognition performance. In the experiments conducted on the Aurora-2 database, we show that the presented new NMF-based approach can provide an average error reduction rate of over 65% relative as compared with the baseline MFCC system.
Keywords
computational complexity; iterative methods; matrix decomposition; search problems; signal denoising; speech recognition; Aurora-2 database; NMF-based scheme; computation complexity reduction; fullband modulation spectra; iterative search; noise robustness; nonnegative basis spectra vectors; nonnegative matrix factorization; orthogonal projection; robust speech recognition; speech features; subband modulation spectra enhancement; Accuracy; Modulation; Noise; Noise robustness; Speech; Speech recognition; Vectors; modulation spectrum; noise robustness; nonnegative matrix factorization; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
System Science and Engineering (ICSSE), 2012 International Conference on
Conference_Location
Dalian, Liaoning
Print_ISBN
978-1-4673-0944-8
Electronic_ISBN
978-1-4673-0943-1
Type
conf
DOI
10.1109/ICSSE.2012.6257172
Filename
6257172
Link To Document