DocumentCode
729706
Title
Probabilistic learning from mislabelled data for multimedia content recognition
Author
Kakar, Pravin ; Chia, Alex Yong-Sang
Author_Institution
Inst. for Infocomm Res., Singapore, Singapore
fYear
2015
fDate
June 29 2015-July 3 2015
Firstpage
1
Lastpage
6
Abstract
There have been considerable advances in multimedia recognition recently as powerful computing capabilities and large, representative datasets become ubiquitous. A fundamental assumption of traditional recognition techniques is that the data available for training are accurately labelled. Given the scale and diversity of web data, it takes considerable annotation effort to reduce label noise to acceptable levels. In this work, we propose a novel method to work around this issue by utilizing approximate apriori estimates of the mislabelling probabilities to design a noise-aware learning framework. We demonstrate the proposed framework´s effectiveness on several datasets of various modalities and show that it is able to achieve high levels of accuracy even when faced with significant mislabelling in the data.
Keywords
Internet; image denoising; image recognition; learning (artificial intelligence); multimedia computing; Web data; computing capabilities; label noise reduction; mislabelled data; multimedia content recognition; noise-aware learning framework; probabilistic learning; representative datasets; Accuracy; Neural networks; Noise; Noise level; Noise measurement; Testing; Training; mislabelled data; multimedia content recognition; probabilistic learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia and Expo (ICME), 2015 IEEE International Conference on
Conference_Location
Turin
Type
conf
DOI
10.1109/ICME.2015.7177393
Filename
7177393
Link To Document