DocumentCode
705273
Title
Sparse multi-label linear embedding nonnegative tensor factorization for automatic music tagging
Author
Panagakis, Yannis ; Kotropoulos, Constantine ; Arce, Gonzalo R.
Author_Institution
Dept. of Inf., Aristotle Univ. of Thessaloniki, Thessaloniki, Greece
fYear
2010
fDate
23-27 Aug. 2010
Firstpage
492
Lastpage
496
Abstract
In this paper, a robust framework for automatic music tagging is proposed. First, each music recording is represented by its auditory temporal modulations. Then, a multilinear subspace learning algorithm based on sparse label coding is proposed to effectively harness the multi-label information for dimensionality reduction. The proposed algorithm is referred to as Sparse Multi-label Linear Embedding Nonnegative Tensor Factorization. Finally, a recently proposed sparse representation-based method for multi-label data is employed to propagate the multiple labels of the training auditory temporal modulations to annotate the auditory temporal modulations extracted from a test music recording with the sparse ℓ1 reconstruction coefficients. The proposed framework outperforms both humans and state-of-the-art computer audition systems in the music tagging task, when applied to the CAL500 dataset.
Keywords
digital signal processing chips; music; social networking (online); tensors; CAL500 dataset; auditory temporal modulation training; auditory temporal modulations annotation; automatic music tagging; dimensionality reduction; multilabel data; multilabel information; multilinear subspace learning algorithm; music recording; sparse label coding; sparse multilabel linear embedding nonnegative tensor factorization; sparse representation based method; Feature extraction; Maximum likelihood estimation; Modulation; Semantics; Tagging; Tensile stress; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2010 18th European
Conference_Location
Aalborg
ISSN
2219-5491
Type
conf
Filename
7096546
Link To Document