• DocumentCode
    3184430
  • Title

    Application of Diffusion Kernel in Multimodal Image Retrieval

  • Author

    Agrawal, Rajeev ; Grosky, William ; Fotouhi, Farshad ; Wu, Changhua

  • fYear
    2007
  • fDate
    10-12 Dec. 2007
  • Firstpage
    271
  • Lastpage
    276
  • Abstract
    In this paper, we propose an approach to negotiate the gap between low-level image features and the human interpretation of the image. Taking the cue from text-based retrieval techniques, we construct "visual keywords" using vector quantization of small- sized image tiles. Both visual and textual keywords are combined and used to represent an image as a single multimodal vector. We use a diffusion kernel-based non-linear approach to fuse the visual and textual keywords. By comparing the performance of this approach with a low-level features-based approach, we demonstrate that visual keywords, when combined with textual keywords, improve the image retrieval results significantly.
  • Keywords
    Conferences; Fuses; Humans; Image retrieval; Information retrieval; Kernel; Pixel; Tiles; USA Councils; Vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Workshops, 2007. ISMW '07. Ninth IEEE International Symposium on
  • Conference_Location
    Taichung, Taiwan
  • Print_ISBN
    9780-7695-3084-0
  • Type

    conf

  • DOI
    10.1109/ISM.Workshops.2007.53
  • Filename
    4475982