DocumentCode
730343
Title
The segregation of spatialised speech in interference by optimal mapping of diverse cues
Author
Jingbo Gao ; Tew, Anthony I.
Author_Institution
Dept. of Electron., Univ. of York, York, UK
fYear
2015
fDate
19-24 April 2015
Firstpage
2095
Lastpage
2099
Abstract
We describe optimal cue mapping (OCM), a potentially eal-time binaural signal processing method for segregating sound source in the presence of multiple interfering 3D ound sources. Spatial cues are extracted from a multisource inaural mixture and used to train artificial neural etworks (ANNs) to estimate the spectral energy fraction of wanted speech source in the mixture. Once trained, the NN outputs form a spectral ratio mask which is applied rame-by-frame to the mixture to approximate the agnitude spectrum of the wanted speech. The speech ntelligibility performance of the OCM algorithm for nechoic sound sources is evaluated on previously unseen peech mixtures using the STOI automated measures, and ompared with an established reference method. The ptimized integration of multiple cues offers clear erformance benefits and the ability to quantify the relative mportance of each cue will facilitate computationally fficient implementations.
Keywords
interference (signal); neural nets; speech processing; ANN; STOI; artificial neural etworks; diverse cues optimal mapping; eal-time binaural signal processing method; interference; nechoic sound sources; optimal cue mapping; sound source segregation; spatialised speech segregation; Acoustics; Artificial neural networks; Coherence; Estimation; Neurons; Speech; Time-frequency analysis; Speech segregation; neural networks; ratio mask;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location
South Brisbane, QLD
Type
conf
DOI
10.1109/ICASSP.2015.7178340
Filename
7178340
Link To Document