Title :
Ideal Binary Mask Ratio: A Novel Metric for Assessing Binary-Mask-Based Sound Source Separation Algorithms
Author :
Hummersone, Christopher ; Mason, Russell ; Brookes, Tim
Author_Institution :
Inst. of Sound Recording, Univ. of Surrey, Guildford, UK
Abstract :
A number of metrics has been proposed in the literature to assess sound source separation algorithms. The addition of convolutional distortion raises further questions about the assessment of source separation algorithms in reverberant conditions as reverberation is shown to undermine the optimality of the ideal binary mask (IBM) in terms of signal-to-noise ratio (SNR). Furthermore, with a range of mixture parameters common across numerous acoustic conditions, SNR-based metrics demonstrate an inconsistency that can only be attributed to the convolutional distortion. This suggests the necessity for an alternate metric in the presence of convolutional distortion, such as reverberation. Consequently, a novel metric-dubbed the IBM ratio (IBMR)-is proposed for assessing source separation algorithms that aim to calculate the IBM. The metric is robust to many of the effects of convolutional distortion on the output of the system and may provide a more representative insight into the performance of a given algorithm .
Keywords :
acoustic signal processing; distortion; source separation; SNR; convolutional distortion; ideal binary mask ratio; signal-to-noise ratio; sound source separation algorithms; Interference; Measurement; Reverberation; Signal to noise ratio; Source separation; Speech; Objective evaluation; reverberation; time–frequency masking; underdetermined source separation;
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
DOI :
10.1109/TASL.2011.2109380