DocumentCode :
3517026
Title :
A tempering approach for Itakura-Saito non-negative matrix factorization. With application to music transcription
Author :
Bertin, Nancy ; Fevotte, Cédric ; Badeau, Roland
Author_Institution :
CNRS LTCI, TELECOM ParisTech, Paris
fYear :
2009
fDate :
19-24 April 2009
Firstpage :
1545
Lastpage :
1548
Abstract :
In this paper we are interested in non-negative matrix factorization (NMF) with the Itakura-Saito (IS) divergence. Previous work has demonstrated the relevance of this cost function for the decomposition of audio power spectrograms. This is in particular due to its scale invariance, which makes it more robust to the wide dynamics of audio, a property which is not shared by other popular costs such as the Euclidean distance or the generalized Kulback-Leibler (KL) divergence. However, while the latter two cost functions are convex, the IS divergence is not, which makes it more prone to convergence to irrelevant local minima, as observed empirically. Thus, the aim of this paper is to propose a tempering scheme that favors convergence of IS-NMF to global minima. Our algorithm is based on NMF with the beta-divergence, where the shape parameter beta acts as a temperature parameter. Results on both synthetical and music data (in a transcription context) show the relevance of our approach.
Keywords :
acoustic signal processing; matrix decomposition; Itakura-Saito nonnegative matrix factorization; audio power spectrograms; generalized Kulback-Leibler divergence; music transcription; tempering approach; Contracts; Convergence; Cost function; Matrix decomposition; Maximum likelihood estimation; Multiple signal classification; Robustness; Signal processing algorithms; Spectrogram; Telecommunications; Itakura-Saito (IS) divergence; Non-negative matrix factorization (NMF); beta divergence; music transcription;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
Conference_Location :
Taipei
ISSN :
1520-6149
Print_ISBN :
978-1-4244-2353-8
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2009.4959891
Filename :
4959891
Link To Document :
بازگشت