• DocumentCode
    61734
  • Title

    Optimizing Speech Intelligibility in a Noisy Environment: A unified view

  • Author

    Kleijn, W.B. ; Crespo, Joao Bernardo ; Hendriks, Richard Christian ; Petkov, Petko ; Sauert, Bastian ; Vary, Peter

  • Author_Institution
    Eng. & Comput. Sci., Victoria Univ. of Wellington, Wellington, New Zealand
  • Volume
    32
  • Issue
    2
  • fYear
    2015
  • fDate
    Mar-15
  • Firstpage
    43
  • Lastpage
    54
  • Abstract
    Modern communication technology facilitates communication from anywhere to anywhere. As a result, low speech intelligibility has become a common problem, which is exacerbated by the lack of feedback to the talker about the rendering environment. In recent years, a range of algorithms has been developed to enhance the intelligibility of speech rendered in a noisy environment. We describe methods for intelligibility enhancement from a unified vantage point. Before one defines a measure of intelligibility, the level of abstraction of the representation must be selected. For example, intelligibility can be measured on the message, the sequence of words spoken, the sequence of sounds, or a sequence of states of the auditory system. Natural measures of intelligibility defined at the message level are mutual information and the hit-or-miss criterion. The direct evaluation of high-level measures requires quantitative knowledge of human cognitive processing. Lower-level measures can be derived from higher-level measures by making restrictive assumptions. We discuss the implementation and performance of some specific enhancement systems in detail, including speech intelligibility index (SII)-based systems and systems aimed at enhancing the sound-field where it is perceived by the listener. We conclude with a discussion of the current state of the field and open problems.
  • Keywords
    speech enhancement; communication technology; hit-or-miss criterion; human cognitive processing; mutual information; noisy environment; representation abstraction level; sound-field enhancement; speech intelligibility enhancement; speech intelligibility index-based systems; speech intelligibility optimization; words sequence; Acoustic measurements; Acoustic noise; Acoustic signal processing; Assitive devices; Distortion measurement; Mutual information; Noise measurement; Psychoacoustic models; Speech processing;
  • fLanguage
    English
  • Journal_Title
    Signal Processing Magazine, IEEE
  • Publisher
    ieee
  • ISSN
    1053-5888
  • Type

    jour

  • DOI
    10.1109/MSP.2014.2365594
  • Filename
    7038248