DocumentCode
2239119
Title
Correcting posteriors by using a feedback synthesis loop in robust ASR
Author
Glotin, Herve
Author_Institution
ERSS, Toulouse, France
fYear
2002
fDate
3-6 Sept. 2002
Firstpage
1
Lastpage
4
Abstract
Current Automatic Speech Recognition (ASR) systems are not efficient in noisy speech conditions. We propose a new strategy to reinforce ASR robustness, based on a feedback loop from recognition of posteriors to signal synthesis. The key idea is to use phonemes´ posteriors generated by recognition to calculate an acoustic image (AI) at each frame and to calculate its correlation with the input signal. AI is the weighted sum phonemes clean speech spectrum, where weights are directly taken as the corresponding phonemes´ posteriors. Correlation between AI and the input spectrum gives a Recognition Index (RI). We then show how a simple correction function of posteriors´ distribution using RI improves the Word Error Rate in a continuous speech recognition task compared to a state of the art ASR system (Jrasta).
Keywords
feedback; image recognition; maximum likelihood estimation; signal synthesis; speech recognition; speech synthesis; AI recognition; ASR; RI; acoustic image recognition; automatic speech recognition index; feedback synthesis loop; posterior correction; signal synthesis; weighted sum phonemes; word error rate; Abstracts; Estimation; Hidden Markov models; Image segmentation; Noise; Noise measurement; Robustness;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2002 11th European
Conference_Location
Toulouse
ISSN
2219-5491
Type
conf
Filename
7072223
Link To Document