Title :
The segregation of spatialised speech in interference by optimal mapping of diverse cues
Author :
Jingbo Gao ; Tew, Anthony I.
Author_Institution :
Dept. of Electron., Univ. of York, York, UK
Abstract :
We describe optimal cue mapping (OCM), a potentially eal-time binaural signal processing method for segregating sound source in the presence of multiple interfering 3D ound sources. Spatial cues are extracted from a multisource inaural mixture and used to train artificial neural etworks (ANNs) to estimate the spectral energy fraction of wanted speech source in the mixture. Once trained, the NN outputs form a spectral ratio mask which is applied rame-by-frame to the mixture to approximate the agnitude spectrum of the wanted speech. The speech ntelligibility performance of the OCM algorithm for nechoic sound sources is evaluated on previously unseen peech mixtures using the STOI automated measures, and ompared with an established reference method. The ptimized integration of multiple cues offers clear erformance benefits and the ability to quantify the relative mportance of each cue will facilitate computationally fficient implementations.
Keywords :
interference (signal); neural nets; speech processing; ANN; STOI; artificial neural etworks; diverse cues optimal mapping; eal-time binaural signal processing method; interference; nechoic sound sources; optimal cue mapping; sound source segregation; spatialised speech segregation; Acoustics; Artificial neural networks; Coherence; Estimation; Neurons; Speech; Time-frequency analysis; Speech segregation; neural networks; ratio mask;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7178340