Application of Diffusion Kernel in Multimodal Image Retrieval

Author

Agrawal, Rajeev ; Grosky, William ; Fotouhi, Farshad ; Wu, Changhua

fYear

2007

fDate

10-12 Dec. 2007

Firstpage

271

Lastpage

276

Abstract

In this paper, we propose an approach to negotiate the gap between low-level image features and the human interpretation of the image. Taking the cue from text-based retrieval techniques, we construct "visual keywords" using vector quantization of small- sized image tiles. Both visual and textual keywords are combined and used to represent an image as a single multimodal vector. We use a diffusion kernel-based non-linear approach to fuse the visual and textual keywords. By comparing the performance of this approach with a low-level features-based approach, we demonstrate that visual keywords, when combined with textual keywords, improve the image retrieval results significantly.

Keywords

Conferences; Fuses; Humans; Image retrieval; Information retrieval; Kernel; Pixel; Tiles; USA Councils; Vector quantization;

fLanguage

English

Publisher

ieee

Conference_Titel

Multimedia Workshops, 2007. ISMW '07. Ninth IEEE International Symposium on

Conference_Location

Taichung, Taiwan

Print_ISBN

9780-7695-3084-0

Type

conf

DOI

10.1109/ISM.Workshops.2007.53

Filename

4475982

Link To Document

https://search.isc.ac/dl/search/defaultta.aspx?DTC=49&DC=3184430