DocumentCode :
2239119
Title :
Correcting posteriors by using a feedback synthesis loop in robust ASR
Author :
Glotin, Herve
Author_Institution :
ERSS, Toulouse, France
fYear :
2002
fDate :
3-6 Sept. 2002
Firstpage :
1
Lastpage :
4
Abstract :
Current Automatic Speech Recognition (ASR) systems are not efficient in noisy speech conditions. We propose a new strategy to reinforce ASR robustness, based on a feedback loop from recognition of posteriors to signal synthesis. The key idea is to use phonemes´ posteriors generated by recognition to calculate an acoustic image (AI) at each frame and to calculate its correlation with the input signal. AI is the weighted sum phonemes clean speech spectrum, where weights are directly taken as the corresponding phonemes´ posteriors. Correlation between AI and the input spectrum gives a Recognition Index (RI). We then show how a simple correction function of posteriors´ distribution using RI improves the Word Error Rate in a continuous speech recognition task compared to a state of the art ASR system (Jrasta).
Keywords :
feedback; image recognition; maximum likelihood estimation; signal synthesis; speech recognition; speech synthesis; AI recognition; ASR; RI; acoustic image recognition; automatic speech recognition index; feedback synthesis loop; posterior correction; signal synthesis; weighted sum phonemes; word error rate; Abstracts; Estimation; Hidden Markov models; Image segmentation; Noise; Noise measurement; Robustness;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing Conference, 2002 11th European
Conference_Location :
Toulouse
ISSN :
2219-5491
Type :
conf
Filename :
7072223
Link To Document :
بازگشت