DocumentCode :
2880548
Title :
Causal analysis of Speech Recognition failure in adverse environments
Author :
Zhou, Guojun ; Deisher, Michael E. ; Sharma, Sangita
Author_Institution :
Intel Corporation, M/S JF2-86, 2111 NE 25th Ave, Hillsboro, OR 97124, USA
Volume :
4
fYear :
2002
fDate :
13-17 May 2002
Abstract :
A common approach to measuring the impact of noise and the effectiveness of noise mitigation (NM) algorithms for Automatic Speech Recognition (ASR) systems is to compare the word error rates (WERs). However, the WER measure does not give much insight into how an NM algorithm affects phoneme-level acoustic characteristics. Such insight can help in tuning the NM parameters and may also lead to reduced research time because the impact of an NM algorithm on ASR can first be investigated on smaller corpora. In this paper, two measures, phoneme error rate (PER) and phoneme confidence score (PCS), are investigated to assess the impact of NM algorithms on the ASR performance. Experimental results using the TIMIT corpus show that both PER and PCS can help identify where the degradation from noise occurs as well as give a useful indication of how an NM algorithm may impact ASR performance. A diagnostic method based on these two measures is also proposed to assess the NM impact on ASR and help improve the NM algorithm performance.
Keywords :
Character recognition; Noise measurement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on
Conference_Location :
Orlando, FL, USA
ISSN :
1520-6149
Print_ISBN :
0-7803-7402-9
Type :
conf
DOI :
10.1109/ICASSP.2002.5745488
Filename :
5745488
Link To Document :
بازگشت