• DocumentCode
    698062
  • Title

    A simple correlation-based model of intelligibility for nonlinear speech enhancement and separation

  • Author

    Boldt, Jesper B. ; Ellis, Daniel P. W.

  • Author_Institution
    Oticon A/S, Smørum, Denmark
  • fYear
    2009
  • fDate
    24-28 Aug. 2009
  • Firstpage
    1849
  • Lastpage
    1853
  • Abstract
    Applying a binary mask to a pure noise signal can result in speech that is highly intelligible, despite the absence of any of the target speech signal. Therefore, to estimate the intelligibility benefit of highly nonlinear speech enhancement techniques, we contend that SNR is not useful; instead we propose a measure based on the similarity between the time-varying spectral envelopes of target speech and system output, as measured by correlation. As with previous correlation-based intelligibility measures, our system can broadly match subjective intelligibility for a range of enhanced signals. Our system, however, is notably simpler and we explain the practical motivation behind each stage. This measure, freely available as a small Matlab implementation, can provide a more meaningful evaluation measure for nonlinear speech enhancement systems, as well as providing a transparent objective function for the optimization of such systems.
  • Keywords
    correlation methods; mathematics computing; source separation; speech enhancement; speech intelligibility; Matlab implementation; SNR; binary mask; correlation-based intelligibility measures; correlation-based model; nonlinear speech enhancement; speech separation; subjective intelligibility; time-varying spectral envelopes; Correlation; Signal to noise ratio; Speech; Speech enhancement; Time-frequency analysis; Tin;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Conference, 2009 17th European
  • Conference_Location
    Glasgow
  • Print_ISBN
    978-161-7388-76-7
  • Type

    conf

  • Filename
    7077636