DocumentCode :
2703724
Title :
Exploiting Uncertainties for Binaural Speech Recognition
Author :
SRINIVASAN, SUDARSHAN ; Roman, N. ; DeLiang Wang
Author_Institution :
Dept. of Biomedical Eng., Ohio State Univ., Columbus, OH, USA
Volume :
4
fYear :
2007
fDate :
15-20 April 2007
Abstract :
Recently several algorithms have been proposed to enhance noisy speech by estimating the signal-to-noise ratio (SNR) within a local time-frequency region based on binaural cues of interaural time and intensity differences (ITD and IID). However, the accuracy of the estimated SNR often varies widely across time and frequency, causing uncertainties in the enhanced speech features. We estimate this uncertainty based on statistics of ITD and IID and show that it can be effectively exploited to improve robust speech recognition. Systematic evaluations using the estimated uncertainty show significant improvement in recognition performance compared to the baseline performance.
Keywords :
feature extraction; speech enhancement; speech recognition; statistics; SNR; binaural speech recognition; intensity differences; interaural time; signal-to-noise ratio; speech features enhancement; uncertainty estimation; Acoustic noise; Automatic speech recognition; Decoding; Robustness; Signal to noise ratio; Speech coding; Speech enhancement; Speech recognition; Time frequency analysis; Uncertainty; binaural processing; computational auditory scene analysis; missing-data recognition; robust speech recognition; uncertainty decoding;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location :
Honolulu, HI
ISSN :
1520-6149
Print_ISBN :
1-4244-0727-3
Type :
conf
DOI :
10.1109/ICASSP.2007.367031
Filename :
4218219
Link To Document :
بازگشت