DocumentCode :
996719
Title :
Supervised Learning of Semantic Classes for Image Annotation and Retrieval
Author :
Carneiro, Gustavo ; Chan, Antoni B. ; Moreno, Pedro J. ; Vasconcelos, Nuno
Author_Institution :
Dept. of Integrated Data Syst., Siemens Corp. Res. Inc., Princeton, NJ
Volume :
29
Issue :
3
fYear :
2007
fDate :
3/1/2007 12:00:00 AM
Firstpage :
394
Lastpage :
410
Abstract :
A probabilistic formulation for semantic image annotation and retrieval is proposed. Annotation and retrieval are posed as classification problems where each class is defined as the group of database images labeled with a common semantic label. It is shown that, by establishing this one-to-one correspondence between semantic labels and semantic classes, a minimum probability of error annotation and retrieval are feasible with algorithms that are 1) conceptually simple, 2) computationally efficient, and 3) do not require prior semantic segmentation of training images. In particular, images are represented as bags of localized feature vectors, a mixture density estimated for each image, and the mixtures associated with all images annotated with a common semantic label pooled into a density estimate for the corresponding semantic class. This pooling is justified by a multiple instance learning argument and performed efficiently with a hierarchical extension of expectation-maximization. The benefits of the supervised formulation over the more complex, and currently popular, joint modeling of semantic label and visual feature distributions are illustrated through theoretical arguments and extensive experiments. The supervised formulation is shown to achieve higher accuracy than various previously published methods at a fraction of their computational cost. Finally, the proposed method is shown to be fairly robust to parameter tuning
Keywords :
expectation-maximisation algorithm; image classification; image retrieval; learning (artificial intelligence); expectation-maximization algorithm; image annotation; image classification; image retrieval; semantic classes; supervised learning; Computational efficiency; Image databases; Image retrieval; Image segmentation; Information retrieval; Labeling; Robustness; Supervised learning; Unsupervised learning; Visual databases; Content-based image retrieval; Gaussian mixtures; expectation-maximization; image segmentation; multiple instance learning; object recognition.; semantic image annotation and retrieval; weakly supervised learning; Algorithms; Artificial Intelligence; Database Management Systems; Databases, Factual; Documentation; Image Enhancement; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Natural Language Processing; Pattern Recognition, Automated; Semantics; Sensitivity and Specificity;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/TPAMI.2007.61
Filename :
4069257
Link To Document :
بازگشت