Building a Multi-Modal Thesaurus from Annotated Images

Author

Frigui, Hichem ; Caudill, Joshua

Author_Institution

Dept. of CECS, Louisville Univ.

Volume

4

fYear

0

fDate

0-0 0

Firstpage

198

Lastpage

201

Abstract

We propose an unsupervised approach to learn associations between low-level visual features and keywords. We assume that a collection of images is available and that each image is globally annotated. The objective is to extract representative visual profiles that correspond to frequent homogeneous regions, and to associate them with keywords. These labeled profiles would be used to build a multi-modal thesaurus that could serve as a foundation for hybrid navigation and search algorithms. Our approach has two main steps. First, each image is coarsely segmented into regions, and visual features are extracted from each region. Second, the regions are categorized using a novel algorithm that performs clustering and feature weighting simultaneously. As a result, we obtain clusters of regions that share subsets of relevant features. Representatives from each cluster and their relevant visual and textual features would be used to build a thesaurus. The proposed approach is validated using a collection of 1169 images

Keywords

image retrieval; thesauri; unsupervised learning; annotated images; image collection; multimodal thesaurus; navigation algorithm; representative visual profile extraction; search algorithm; unsupervised learning; Clustering algorithms; Feature extraction; Image retrieval; Image segmentation; Information retrieval; Navigation; Organizing; Shape; Software libraries; Thesauri;

fLanguage

English

Publisher

ieee

Conference_Titel

Pattern Recognition, 2006. ICPR 2006. 18th International Conference on

Conference_Location

Hong Kong

ISSN

1051-4651

Print_ISBN

0-7695-2521-0

Type

conf

DOI

10.1109/ICPR.2006.344

Filename

1699815