• DocumentCode
    37970
  • Title

    Maximizing Phoneme Recognition Accuracy for Enhanced Speech Intelligibility in Noise

  • Author

    Petkov, Petko N. ; Henter, G.E. ; Kleijn, W. Bastiaan

  • Author_Institution
    Sch. of Electr. Eng., KTH-R. Inst. of Technol., Stockholm, Sweden
  • Volume
    21
  • Issue
    5
  • fYear
    2013
  • fDate
    May-13
  • Firstpage
    1035
  • Lastpage
    1045
  • Abstract
    An effective measure of speech intelligibility is the probability of correct recognition of the transmitted message. We propose a speech pre-enhancement method based on matching the recognized text to the text of the original message. The selected criterion is accurately approximated by the probability of the correct transcription given an estimate of the noisy speech features. In the presence of environment noise, and with a decrease in the signal-to-noise ratio, speech intelligibility declines. We implement a speech pre-enhancement system that optimizes the proposed criterion for the parameters of two distinct speech modification strategies under an energy-preservation constraint. The proposed method requires prior knowledge in the form of a transcription of the transmitted message and acoustic speech models from an automatic speech recognition system. Performance results from an open-set subjective intelligibility test indicate a significant improvement over natural speech and a reference system that optimizes a perceptual-distortion-based objective intelligibility measure. The computational complexity of the approach permits use in on-line applications.
  • Keywords
    computational complexity; speech enhancement; speech recognition; acoustic speech models; automatic speech recognition system; computational complexity; correct recognition probability; energy-preservation constraint; environment noise; natural speech; perceptual-distortion-based objective intelligibility; phoneme recognition accuracy; reference system; speech intelligibility; speech intelligibility declines; speech preenhancement method; speech preenhancement system; transmitted message; Distortion measurement; Noise; Noise measurement; Production; Speech; Speech enhancement; Speech recognition; Environment adaptation; intelligibility enhancement; speech pre-enhancement;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2013.2244089
  • Filename
    6425441