DocumentCode :
2163683
Title :
Cost-sensitive stacking for audio tag annotation and retrieval
Author :
Lo, Hung-Yi ; Wang, Ju-Chiang ; Wang, Hsin-Min ; Lin, Shou-De
Author_Institution :
Inst. of Inf. Sci., Acad. Sinica, Taipei, Taiwan
fYear :
2011
fDate :
22-27 May 2011
Firstpage :
2308
Lastpage :
2311
Abstract :
Audio tags correspond to keywords that people use to de scribe different aspects of a music clip, such as the genre, mood, and instrumentation. Since social tags are usually as signed by people with different levels of musical knowledge, they inevitably contain noisy information. By treating the tag counts as costs, we can model the audio tagging problem as a cost-sensitive classification problem. In addition, tag correlation is another useful information for automatic audio tagging since some tags often co-occur. By considering the co-occurrences of tags, we can model the audio tagging problem as a multi-label classification problem. To exploit the tag count and correlation information jointly, we formulate the audio tagging task as a novel cost-sensitive multi-label (CSML) learning problem. The results of audio tag annotation and retrieval experiments demonstrate that the new approach outperforms our MIREX 2009 winning method.
Keywords :
audio signal processing; information retrieval; learning (artificial intelligence); music; signal classification; MIREX 2009 winning method; audio retrieval; audio tag annotation; cost-sensitive classification problem; cost-sensitive multilabel learning problem; cost-sensitive stacking; multilabel classification problem; music clip; tag correlation; Correlation; Feature extraction; Mood; Stacking; Support vector machines; Tagging; Training; Audio tag annotation; audio tag retrieval; cost-sensitive learning; multi-label; tag count;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on
Conference_Location :
Prague
ISSN :
1520-6149
Print_ISBN :
978-1-4577-0538-0
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2011.5946944
Filename :
5946944
Link To Document :
بازگشت