Title :
A generalized multiple instance learning algorithm for large scale modeling of multimedia semantics
Author :
Naphade, Milind R. ; Smith, John R.
Author_Institution :
IBM Thomas J. Watson Res. Center, Hawthorne, NY, USA
Abstract :
Statistical learning techniques provide a robust framework for learning representations of semantic concepts from multimedia features. The bottleneck is the number of training samples needed to construct robust models. This is particularly expensive when the annotation needs to happen at finer granularity. We present a novel approach where the annotations may be entered at coarser spatial granularity while the concept may still be learnt at finer granularity. This can speed up annotation significantly. Using the multiple instance learning paradigm, we show that it is possible to learn representations of concepts occurring at the regional level by using annotations for several images. We present a generalized multiple instance learning algorithm that can scale to a large number of training samples as well as a large number of instances per bag. The algorithm also provides the ability to plug in different density modeling or regression techniques. Using the TREC 2001 Corpus we demonstrate the superior performance of the proposed algorithm over the existing diverse density algorithm.
Keywords :
feature extraction; learning (artificial intelligence); multimedia databases; pattern classification; sampling methods; TREC 2001 Corpus; annotation; coarse spatial granularity; density modeling; generalized multiple instance learning algorithm; instances per bag; large scale modeling; multimedia features; multimedia semantics; performance; regression techniques; semantic concept representation; statistical learning; training samples; Content management; Feedback; Government; Indexing; Large-scale systems; Machine learning; Phase detection; Plugs; Robustness; Statistical learning;
Conference_Titel :
Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on
Print_ISBN :
0-7803-8874-7
DOI :
10.1109/ICASSP.2005.1416310