DocumentCode
177634
Title
Accounting for phase cancellations in non-negative matrix factorization using weighted distances
Author
Ewert, Sebastian ; Plumbley, Mark D. ; Sandler, Mark
Author_Institution
Queen Mary Univ. of London, London, UK
fYear
2014
fDate
4-9 May 2014
Firstpage
649
Lastpage
653
Abstract
Techniques based on non-negative matrix factorization (NMF) have been successfully used to decompose a spectrogram of a music recording into a dictionary of templates and activations. While advanced NMF variants often yield robust signal models, there are usually some inaccuracies in the factorization since the underlying methods are not prepared for phase cancellations that occur when sounds with similar frequency are mixed. In this paper, we present a novel method that takes phase cancellations into account to refine dictionaries learned by NMF-based methods. Our approach exploits the fact that advanced NMF methods are often robust enough to provide information about how sound sources interact in a spectrogram, where they overlap, and thus where phase cancellations could occur. Using this information, the distances used in NMF are weighted entry-wise to attenuate the influence of regions with phase cancellations. Experiments on full-length, polyphonic piano recordings indicate that our method can be successfully used to refine NMF-based dictionaries.
Keywords
audio recording; matrix decomposition; music; NMF-based dictionaries; NMF-based methods; factorization; music recording; nonnegative matrix factorization; phase cancellations; polyphonic piano recordings; robust signal models; spectrogram; weighted distances; Acoustics; Dictionaries; Robustness; Spectrogram; Speech; Speech processing; NMF; Phase Cancellation; Source Separation; Weighted Distances;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6853676
Filename
6853676
Link To Document