Title :
Partition sampling: an active learning selection strategy for large database annotation
Author :
Souvannavong, F. ; Merialdo, B. ; Huet, B.
Author_Institution :
Dept. Commun. Multimedias, Inst. Eurecom, Sophia-Antipolis, France
fDate :
6/3/2005 12:00:00 AM
Abstract :
Annotating a video database requires an intensive, time consuming and error prone human effort. However, this is a mandatory task to efficiently describe multimedia contents and train models for automatic content detection. A new selection strategy for active learning methods to minimise human effort in labelling a large database of video sequences is proposed. Formally, active learning is a process where new unlabelled samples are selected iteratively, then presented to users for annotation, and finally added to the training set. The major problem is to then find the best selection function to quickly reach high classification accuracy. It is shown that existing active learning approaches using selective sampling do not maintain their performances when the number of selected samples per iteration increases. The presented selection strategy attempts to provide a solution to this problem. In practice, selecting many samples offers many advantages when dealing with a large amount of data; among them the possibility to share the annotation effort between several users. Finally an attempt to tackle the more realistic and challenging task of multiple label annotation is made. This would reduce to greater extend the human effort for labelling.
Keywords :
classification; learning systems; multimedia databases; sampling methods; video databases; active learning selection strategy; automatic multimedia content detection; classification accuracy; detection model training; k-nearest-neighbour classifier; large database annotation; multiple label annotation; partition sampling; sample selection; video database annotation; video sequence labelling;
Journal_Title :
Vision, Image and Signal Processing, IEE Proceedings -
DOI :
10.1049/ip-vis:20045079