Title :
Low-latency sound-source-separation using non-negative matrix factorisation with coupled analysis and synthesis dictionaries
Author :
Barker, Tom ; Virtanen, Tuomas ; Pontoppidan, Niels Henrik
Author_Institution :
Dept. of Signal Process., Tampere Univ. of Technol., Tampere, Finland
Abstract :
For real-time or close to real-time applications, sound source separation can be performed on-line, where new frames of incoming data for a mixture signal are processed as they arrive, at very low delay. We propose an approach which generates the separation filters for short synthesis frames to achieve low latency source separation, based on a compositional model mixture of the audio to be separated. Filter parameters are derived from a longer temporal context than the current processing frame through use of a longer analysis frame. A pair of dictionaries are used, one for analysis and one for reconstruction. With this approach we are able to increase separation performance at low latencies whilst retaining the low-latency provided by the use of short synthesis frames. The proposed data handling scheme and parameters can be adjusted to achieve real-time performance, given sufficient computational power. Low-latency output allows a human listener to use the results of such a separation scheme directly, without a perceptible delay. With the proposed method, separated source-to-distortion ratios (SDRs) can be improved by over 1 dB for latencies below 20 ms, without any affect on latency.
Keywords :
data handling; filtering theory; matrix decomposition; mixture models; signal reconstruction; source separation; SDR; compositional model mixture; coupled analysis; data handling scheme; low latency sound source separation; mixture signal processing; non-negative matrix factorisation; separation filters; signal reconstruction; source to distortion ratio; synthesis dictionaries; Analytical models; Computational modeling; Dictionaries; Discrete Fourier transforms; Mixture models; Tin; Welding; NMF; Non-negative matrix factorisation; low-latency; real-time; source separation;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on
Conference_Location :
South Brisbane, QLD
DOI :
10.1109/ICASSP.2015.7177968