Title :
Empirical Comparison of Automatic Image Annotation Systems
Author :
Maher Ben Ismail, M. ; Frigui, Hichem ; Caudill, Joshua
Author_Institution :
CECS Dept., Univ. of Louisville, Louisville, KY
Abstract :
The performance of content-based image retrieval systems has proved to be inherently constrained by the used low-level features, and cannot give satisfactory results when the user´s high level concepts cannot be expressed by low level features. In an attempt to bridge this semantic gap, recent approaches started integrating both low level-visual features and high-level textual keywords. Unfortunately, manual image annotation is a tedious process and may not be possible for large image databases. To overcome this limitation, several approaches that can annotate images in a semi-supervised or unsupervised way have emerged. In this paper, we outline and compare four different algorithms. The first one is simple and assumes that image annotation can be viewed as the task of translating from a vocabulary of fixed image regions to a vocabulary of words. The second approach uses a set of annotated images as a training set and learns the joint distribution of regions and words. The third and fourth approaches are based on segmenting the images into homogeneous regions. Both of these approaches rely on a clustering algorithm to learn the association between visual features and keywords. The clustering task is not trivial as it involves clustering a very high-dimensional and sparse feature spaces. To address this, the third approach uses semi-supervised constrained clustering while the fourth approach relies on an algorithm that performs simultaneous clustering and feature discrimination. These four algorithms were implemented and tested on a data set that includes 6000 images using four-fold cross validation.
Keywords :
content-based retrieval; image retrieval; image segmentation; learning (artificial intelligence); visual databases; automatic image annotation systems; clustering algorithm; content-based image retrieval; feature discrimination; high-level textual keywords; image segmentation; low level-visual features; semisupervised constrained clustering; Bridges; Clustering algorithms; Content based retrieval; Image databases; Image processing; Image retrieval; Image segmentation; Information retrieval; Labeling; Vocabulary; clustering; constrained clustering; content-based image retrieval; image annotation;
Conference_Titel :
Image Processing Theory, Tools and Applications, 2008. IPTA 2008. First Workshops on
Conference_Location :
Sousse
Print_ISBN :
978-1-4244-3321-6
Electronic_ISBN :
978-1-4244-3322-3
DOI :
10.1109/IPTA.2008.4743754