• DocumentCode
    730343
  • Title

    The segregation of spatialised speech in interference by optimal mapping of diverse cues

  • Author

    Jingbo Gao ; Tew, Anthony I.

  • Author_Institution
    Dept. of Electron., Univ. of York, York, UK
  • fYear
    2015
  • fDate
    19-24 April 2015
  • Firstpage
    2095
  • Lastpage
    2099
  • Abstract
    We describe optimal cue mapping (OCM), a potentially eal-time binaural signal processing method for segregating sound source in the presence of multiple interfering 3D ound sources. Spatial cues are extracted from a multisource inaural mixture and used to train artificial neural etworks (ANNs) to estimate the spectral energy fraction of wanted speech source in the mixture. Once trained, the NN outputs form a spectral ratio mask which is applied rame-by-frame to the mixture to approximate the agnitude spectrum of the wanted speech. The speech ntelligibility performance of the OCM algorithm for nechoic sound sources is evaluated on previously unseen peech mixtures using the STOI automated measures, and ompared with an established reference method. The ptimized integration of multiple cues offers clear erformance benefits and the ability to quantify the relative mportance of each cue will facilitate computationally fficient implementations.
  • Keywords
    interference (signal); neural nets; speech processing; ANN; STOI; artificial neural etworks; diverse cues optimal mapping; eal-time binaural signal processing method; interference; nechoic sound sources; optimal cue mapping; sound source segregation; spatialised speech segregation; Acoustics; Artificial neural networks; Coherence; Estimation; Neurons; Speech; Time-frequency analysis; Speech segregation; neural networks; ratio mask;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
  • Conference_Location
    South Brisbane, QLD
  • Type

    conf

  • DOI
    10.1109/ICASSP.2015.7178340
  • Filename
    7178340