• DocumentCode
    2239119
  • Title

    Correcting posteriors by using a feedback synthesis loop in robust ASR

  • Author

    Glotin, Herve

  • Author_Institution
    ERSS, Toulouse, France
  • fYear
    2002
  • fDate
    3-6 Sept. 2002
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    Current Automatic Speech Recognition (ASR) systems are not efficient in noisy speech conditions. We propose a new strategy to reinforce ASR robustness, based on a feedback loop from recognition of posteriors to signal synthesis. The key idea is to use phonemes´ posteriors generated by recognition to calculate an acoustic image (AI) at each frame and to calculate its correlation with the input signal. AI is the weighted sum phonemes clean speech spectrum, where weights are directly taken as the corresponding phonemes´ posteriors. Correlation between AI and the input spectrum gives a Recognition Index (RI). We then show how a simple correction function of posteriors´ distribution using RI improves the Word Error Rate in a continuous speech recognition task compared to a state of the art ASR system (Jrasta).
  • Keywords
    feedback; image recognition; maximum likelihood estimation; signal synthesis; speech recognition; speech synthesis; AI recognition; ASR; RI; acoustic image recognition; automatic speech recognition index; feedback synthesis loop; posterior correction; signal synthesis; weighted sum phonemes; word error rate; Abstracts; Estimation; Hidden Markov models; Image segmentation; Noise; Noise measurement; Robustness;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2002 11th European
  • Conference_Location
    Toulouse
  • ISSN
    2219-5491
  • Type

    conf

  • Filename
    7072223