DocumentCode :
3141113
Title :
Batch Mode Active Learning for Multimedia Pattern Recognition
Author :
Chakraborty, Shiladri ; Balasubramanian, Vineeth ; Panchanathan, Sethuraman
Author_Institution :
Center for Cognitive Ubiquitous Comput. (CUbiC), Arizona State Univ., Tempe, AZ, USA
fYear :
2012
fDate :
10-12 Dec. 2012
Firstpage :
489
Lastpage :
490
Abstract :
Multimedia applications like face recognition and facial expression recognition inherently rely on the availability of a large amount of labeled data to train a robust recognition system. In order to induce a reliable classification model for a multimedia pattern recognition application, the data is typically labeled by human experts based on some domain knowledge. However, manual annotation of a large number of images is an expensive process in terms of time, labor and human expertise. This has led to the development of active learning algorithms, which automatically identify the salient instances from a given set of unlabeled data and are effective in reducing the human annotation effort to train a classification model. Further, to address the possible presence of multiple labeling oracles, there have been efforts towards a batch form of active learning, where a set of unlabeled images are selected simultaneously for labeling instead of a single image at a time. Existing algorithms on batch mode active learning concentrate only on the development of a batch selection criterion and assume that the batch size (number of samples to be queried from an unlabeled set) to be specified in advance. However, in multimedia applications like face/facial expression recognition, it is difficult to decide on a batch size in advance because of the dynamic nature of video streams. Further, multimedia applications like facial expression recognition involve a fuzzy label space because of the imprecision and the vagueness in the class label boundaries. This necessitates a BMAL framework, for fuzzy label problems. To address these fundamental challenges, we propose two novel BMAL techniques in this work: (i) a framework for dynamic batch mode active learning, which adaptively selects the batch size and the specific instances to be queried based on the complexity of the data stream being analyzed and (ii) a BMAL algorithm for fuzzy label classification problems. To the best of our kno- ledge, this is the first attempt to develop such techniques in the active learning literature.
Keywords :
computational complexity; face recognition; fuzzy set theory; image classification; image retrieval; learning (artificial intelligence); multimedia computing; video streaming; BMAL framework; active learning algorithms; automatic salient instance identification; batch selection criterion development; batch size selection; class label boundaries; data stream complexity; dynamic batch mode active learning framework; face recognition; facial expression recognition; fuzzy label classification problems; human annotation effort reduction; image annotation; labeled data availability; labeling oracles; multimedia pattern recognition application; reliable classification model; robust recognition system training; unlabeled image dataset; video streams; Complexity theory; Entropy; Face recognition; Manuals; Multimedia communication; Streaming media;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Multimedia (ISM), 2012 IEEE International Symposium on
Conference_Location :
Irvine, CA
Print_ISBN :
978-1-4673-4370-1
Type :
conf
DOI :
10.1109/ISM.2012.101
Filename :
6424714
Link To Document :
بازگشت