DocumentCode :
2066228
Title :
Local Mismatch Phone for Confidence Measure in Standard and Accented Chinese Speech Recognition
Author :
Cao Wenxiao ; Liu Yi ; Zheng, Thomas Fang
Author_Institution :
Div. of Tech. Innovation & Dev., Tsinghua Nat. Lab. for Inf. Sci. & Technol., Beijing, China
fYear :
2008
fDate :
16-19 Dec. 2008
Firstpage :
1
Lastpage :
4
Abstract :
High error rate in speech recognition is largely due to effects of phone local mismatch caused by unclear speaking or noises. In this paper, we propose an approach of using local mismatch phone to improve the reliability of confidence measure. The features of local mismatch phone can be extracted from the recognition phone sequence by computing occurrence frequency of each phone and comparing with a preset threshold. Occurrence frequency is defined as occurrence time of recognition phone in its frame best phone sequence divided by interval. Frame best phone is the symbol of HMM state at the end of maximum likelihood token at certain frame. The effectiveness of this feature is evaluated on standard and accented Mandarin speech databases. It gives significant Equal Error Rate reduction of 19.7% and 8.4%, respectively. In addition to fast computation, this feature is independent of acoustic model, and is convenient for combination with other features.
Keywords :
hidden Markov models; maximum likelihood estimation; natural language processing; reliability; speech recognition; HMM state; accented Chinese speech recognition; confidence measure; local mismatch phone; maximum likelihood token; recognition phone sequence; reliability; Acoustic measurements; Automatic speech recognition; Error analysis; Frequency; Hidden Markov models; Measurement standards; Noise measurement; Probability; Speech recognition; Standards development;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Chinese Spoken Language Processing, 2008. ISCSLP '08. 6th International Symposium on
Conference_Location :
Kunming
Print_ISBN :
978-1-4244-2942-4
Electronic_ISBN :
978-1-4244-2943-1
Type :
conf
DOI :
10.1109/CHINSL.2008.ECP.64
Filename :
4730318
Link To Document :
بازگشت