مرکز منطقه ای اطلاع رساني علوم و فناوري - Batch Mode Active Learning for Multimedia Pattern Recognition

DocumentCode :

3141113

Title :

Batch Mode Active Learning for Multimedia Pattern Recognition

Author :

Chakraborty, Shiladri ; Balasubramanian, Vineeth ; Panchanathan, Sethuraman

Author_Institution :

Center for Cognitive Ubiquitous Comput. (CUbiC), Arizona State Univ., Tempe, AZ, USA

fYear :

2012

fDate :

10-12 Dec. 2012

Firstpage :

489

Lastpage :

490

Abstract :

Multimedia applications like face recognition and facial expression recognition inherently rely on the availability of a large amount of labeled data to train a robust recognition system. In order to induce a reliable classification model for a multimedia pattern recognition application, the data is typically labeled by human experts based on some domain knowledge. However, manual annotation of a large number of images is an expensive process in terms of time, labor and human expertise. This has led to the development of active learning algorithms, which automatically identify the salient instances from a given set of unlabeled data and are effective in reducing the human annotation effort to train a classification model. Further, to address the possible presence of multiple labeling oracles, there have been efforts towards a batch form of active learning, where a set of unlabeled images are selected simultaneously for labeling instead of a single image at a time. Existing algorithms on batch mode active learning concentrate only on the development of a batch selection criterion and assume that the batch size (number of samples to be queried from an unlabeled set) to be specified in advance. However, in multimedia applications like face/facial expression recognition, it is difficult to decide on a batch size in advance because of the dynamic nature of video streams. Further, multimedia applications like facial expression recognition involve a fuzzy label space because of the imprecision and the vagueness in the class label boundaries. This necessitates a BMAL framework, for fuzzy label problems. To address these fundamental challenges, we propose two novel BMAL techniques in this work: (i) a framework for dynamic batch mode active learning, which adaptively selects the batch size and the specific instances to be queried based on the complexity of the data stream being analyzed and (ii) a BMAL algorithm for fuzzy label classification problems. To the best of our kno- ledge, this is the first attempt to develop such techniques in the active learning literature.

Keywords :

computational complexity; face recognition; fuzzy set theory; image classification; image retrieval; learning (artificial intelligence); multimedia computing; video streaming; BMAL framework; active learning algorithms; automatic salient instance identification; batch selection criterion development; batch size selection; class label boundaries; data stream complexity; dynamic batch mode active learning framework; face recognition; facial expression recognition; fuzzy label classification problems; human annotation effort reduction; image annotation; labeled data availability; labeling oracles; multimedia pattern recognition application; reliable classification model; robust recognition system training; unlabeled image dataset; video streams; Complexity theory; Entropy; Face recognition; Manuals; Multimedia communication; Streaming media;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Multimedia (ISM), 2012 IEEE International Symposium on

Conference_Location :

Irvine, CA

Print_ISBN :

978-1-4673-4370-1

Type :

conf

DOI :

10.1109/ISM.2012.101

Filename :

6424714

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=3141113