DocumentCode
1653125
Title
A new MFCC improvement method for robust ASR
Author
Yeganeh, Hojatollah ; Ahadi, Seyed Mohammad ; Ziaei, Ali
Author_Institution
Amirkabir Univ. of Technol., Tehran
fYear
2008
Firstpage
643
Lastpage
646
Abstract
The Mel-frequency cepstral coefficients (MFCC) are widely used for speech recognition. However, MFCC-based speech recognition performance degrades in presence of additive noise. In this paper, we propose a set of noise-robust features based on conventional MFCC feature extraction method. Our proposed method consists of two steps. In the first step, Mel sub-band spectral subtraction is carried out. The second step consists of estimating SNR in each sub-band and defining a weight parameter based on this estimation. The weighting has been carried out in a way that gives more important roles in cepstrum parameter formation to sub-bands that are less affected by noise. Experimental results indicate that this method achieves improved performance for ASR in noisy environments. Furthermore, due to the simplicity of the implementation of our method, its computational overhead relative to MFCC is quite small.
Keywords
cepstral analysis; feature extraction; speech recognition; MFCC improvement method; Mel-frequency cepstral coefficients; additive noise; automatic speech recognition; cepstrum parameter formation; computational overhead; feature extraction; noise-robust features; robust ASR; weight parameter; Additive noise; Automatic speech recognition; Cepstral analysis; Degradation; Feature extraction; Mel frequency cepstral coefficient; Noise robustness; Signal to noise ratio; Speech recognition; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing, 2008. ICSP 2008. 9th International Conference on
Conference_Location
Beijing
Print_ISBN
978-1-4244-2178-7
Electronic_ISBN
978-1-4244-2179-4
Type
conf
DOI
10.1109/ICOSP.2008.4697213
Filename
4697213
Link To Document