DocumentCode :
2874984
Title :
Reconstructing spectral vectors with uncertain spectrographic masks for robust speech recognition
Author :
Raj, Bhiksha ; Singh, Rita
Author_Institution :
Mitsubishi Electr. Res. Labs, Cambridge, MA
fYear :
2005
fDate :
27-27 Nov. 2005
Firstpage :
65
Lastpage :
70
Abstract :
Missing-feature methods improve automatic recognition of noisy speech by removing unreliable noise corrupted spectrographic components from the signal. Recognition is performed either by modifying the recognizer to work from incomplete spectra, or by estimating the missing components to reconstruct complete spectra. While the former approach performs optimal classification with incomplete spectrograms, the latter permits recognition with cepstral features derived from reconstructed spectra. Traditionally, spectral components are considered unequivocally reliable or unreliable. Research has shown that the use of soft masks that provide a probability of reliability to spectral components instead can improve the performance of missing feature methods that modify the recognizer. However, soft masks have not been employed by methods that reconstruct the spectrogram. In this paper we present a new MMSE algorithm for spectrogram reconstruction. Experiments show that the use of soft masks results in significantly improved performance as compared to reconstruction methods that use binary masks
Keywords :
least mean squares methods; spectrometers; speech recognition; speech synthesis; MMSE algorithm; missing-feature methods; noisy speech; reliability; robust speech recognition; soft masks; spectrogram reconstruction; uncertain spectrographic masks; Automatic speech recognition; Cepstral analysis; Noise level; Noise robustness; Reconstruction algorithms; Signal to noise ratio; Spectrogram; Speech enhancement; Speech recognition; US Department of Transportation;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Automatic Speech Recognition and Understanding, 2005 IEEE Workshop on
Conference_Location :
San Juan
Print_ISBN :
0-7803-9478-X
Electronic_ISBN :
0-7803-9479-8
Type :
conf
DOI :
10.1109/ASRU.2005.1566472
Filename :
1566472
Link To Document :
بازگشت