Title :
Multi-granular detection of regional semantic concepts [video annotation]
Author :
Naphade, Milind R. ; Natsev, Apostol ; Lin, Ching-Yung ; Smith, John R.
Author_Institution :
IBM Thomas J. Watson Res. Center, Hawthorne, NY, USA
Abstract :
A large number of interesting visual semantic concepts occur at a sub-frame granularity in images and occupy one or more regions at the sub-frame level. Detecting these concepts is a challenge due to segmentation imperfections. We propose multi-granular detection of visual concepts that have regional support. We build a single set of support vector machine based binary concept models from the training set with manually marked up regions. In this paper, we show that detection can be significantly improved by scoring these models over multiple granularities in the test set images, where the regions are automatically detected as a preprocessing step in detection. Using 27 regional semantic concepts from the NIST TRECVID 2003 common annotation lexicon and the corpus, we demonstrate that multi-granular detection leads to improvement in detection.
Keywords :
image classification; learning (artificial intelligence); semantic networks; support vector machines; vocabulary; binary classifiers; binary concept models; image sub-frame granularity; machine learning; manually marked up training set; multimedia content semantic analysis; regional semantic concepts; segmentation imperfections; semantic concept multigranular detection; support vector machines; video annotation; visual semantic concepts; vocabulary visual semantic concept detection; Automatic testing; Broadcasting; Content based retrieval; Gunshot detection systems; Hidden Markov models; Image segmentation; Multimedia communication; NIST; Phase detection; Vocabulary;
Conference_Titel :
Multimedia and Expo, 2004. ICME '04. 2004 IEEE International Conference on
Print_ISBN :
0-7803-8603-5
DOI :
10.1109/ICME.2004.1394137