DocumentCode
2608170
Title
Building a Multi-Modal Thesaurus from Annotated Images
Author
Frigui, Hichem ; Caudill, Joshua
Author_Institution
Dept. of CECS, Louisville Univ.
Volume
4
fYear
0
fDate
0-0 0
Firstpage
198
Lastpage
201
Abstract
We propose an unsupervised approach to learn associations between low-level visual features and keywords. We assume that a collection of images is available and that each image is globally annotated. The objective is to extract representative visual profiles that correspond to frequent homogeneous regions, and to associate them with keywords. These labeled profiles would be used to build a multi-modal thesaurus that could serve as a foundation for hybrid navigation and search algorithms. Our approach has two main steps. First, each image is coarsely segmented into regions, and visual features are extracted from each region. Second, the regions are categorized using a novel algorithm that performs clustering and feature weighting simultaneously. As a result, we obtain clusters of regions that share subsets of relevant features. Representatives from each cluster and their relevant visual and textual features would be used to build a thesaurus. The proposed approach is validated using a collection of 1169 images
Keywords
image retrieval; thesauri; unsupervised learning; annotated images; image collection; multimodal thesaurus; navigation algorithm; representative visual profile extraction; search algorithm; unsupervised learning; Clustering algorithms; Feature extraction; Image retrieval; Image segmentation; Information retrieval; Navigation; Organizing; Shape; Software libraries; Thesauri;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location
Hong Kong
ISSN
1051-4651
Print_ISBN
0-7695-2521-0
Type
conf
DOI
10.1109/ICPR.2006.344
Filename
1699815
Link To Document