• DocumentCode
    3441193
  • Title

    Searching an appropriate template size for multimodal image clustering

  • Author

    Agrawal, Rajeev ; Grosky, William I. ; Fotouhi, Farshad

  • Author_Institution
    Dept. of Comput. Sci., Wayne State Univ., Detroit, MI, USA
  • fYear
    2009
  • fDate
    2-4 April 2009
  • Firstpage
    560
  • Lastpage
    564
  • Abstract
    It has been shown by researchers that using a multimodality approach can help in identifying better clusters in an image collection. The multimodal image features include low-level image features and available text annotations. This approach helps in identifying inherent relationships among different types of features associated with an image. In our approach, we divide images into small tiles and create visual keywords using a high-dimensional clustering algorithm. These visual keywords act the same as text keywords. One of the challenges of this approach is to identify an appropriate size for visual keywords. In this paper, we report our results in finding a suitable template size that can be used to create tiles for visual keywords. These visual keywords are combined with text keywords to create a multimodal image representation before applying clustering.
  • Keywords
    feature extraction; image representation; pattern clustering; text analysis; multimodal image clustering; multimodal image representation; multimodality approach; text annotation; visual keyword; Clustering algorithms; Computer science; Frequency; Image analysis; Image classification; Image databases; Image representation; Kernel; Shape; Tiles; diffusion kernel; image clustering; multimodal; visual keyword;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Computing and Systems, 2009. ICMCS '09. International Conference on
  • Conference_Location
    Ouarzazate
  • Print_ISBN
    978-1-4244-3756-6
  • Electronic_ISBN
    978-1-4244-3757-3
  • Type

    conf

  • DOI
    10.1109/MMCS.2009.5256634
  • Filename
    5256634