Title :
A regressive boosting approach to automatic audio tagging based on soft annotator fusion
Author :
Foucard, Rémi ; Essid, Slim ; Lagrange, Mathieu ; Richard, Gaël
Author_Institution :
LTCI, TELECOM ParisTech., Paris, France
Abstract :
Automatic tagging of music has mostly been treated as a classification problem. In this framework, the association of a tag to a song is characterized in a “hard” fashion: the tag is either relevant or not. Yet, the relevance of a tag to a song is not always evident. Indeed, during the ground-truth annotation process, several annotators may express doubts, or disagree with each other. In this paper, we propose to fuse annotators´ decisions in a way to keep information about this uncertainty. This fusion provides us continuous scores, that are used for training a regressive boosting algorithm. Our experiments show that regression with this soft ground truth leads to a more accurate learning, and better predictions, compared to traditionally used binary classification.
Keywords :
audio signal processing; information retrieval; learning (artificial intelligence); music; regression analysis; signal classification; automatic audio tagging; binary classification; ground-truth annotation process; machine learning; music automatic tagging; music information retrieval; regressive boosting approach; soft annotator fusion; Boosting; Databases; Mel frequency cepstral coefficient; Prediction algorithms; Tagging; Training; Uncertainty; Autotagging; Boosting; Machine learning; Music information retrieval; Regression analysis;
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
Conference_Location :
Kyoto
Print_ISBN :
978-1-4673-0045-2
Electronic_ISBN :
1520-6149
DOI :
10.1109/ICASSP.2012.6287820