DocumentCode
3184430
Title
Application of Diffusion Kernel in Multimodal Image Retrieval
Author
Agrawal, Rajeev ; Grosky, William ; Fotouhi, Farshad ; Wu, Changhua
fYear
2007
fDate
10-12 Dec. 2007
Firstpage
271
Lastpage
276
Abstract
In this paper, we propose an approach to negotiate the gap between low-level image features and the human interpretation of the image. Taking the cue from text-based retrieval techniques, we construct "visual keywords" using vector quantization of small- sized image tiles. Both visual and textual keywords are combined and used to represent an image as a single multimodal vector. We use a diffusion kernel-based non-linear approach to fuse the visual and textual keywords. By comparing the performance of this approach with a low-level features-based approach, we demonstrate that visual keywords, when combined with textual keywords, improve the image retrieval results significantly.
Keywords
Conferences; Fuses; Humans; Image retrieval; Information retrieval; Kernel; Pixel; Tiles; USA Councils; Vector quantization;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Workshops, 2007. ISMW '07. Ninth IEEE International Symposium on
Conference_Location
Taichung, Taiwan
Print_ISBN
9780-7695-3084-0
Type
conf
DOI
10.1109/ISM.Workshops.2007.53
Filename
4475982
Link To Document