Searching an appropriate template size for multimodal image clustering

Author

Agrawal, Rajeev ; Grosky, William I. ; Fotouhi, Farshad

Author_Institution

Dept. of Comput. Sci., Wayne State Univ., Detroit, MI, USA

fYear

2009

fDate

2-4 April 2009

Firstpage

560

Lastpage

564

Abstract

It has been shown by researchers that using a multimodality approach can help in identifying better clusters in an image collection. The multimodal image features include low-level image features and available text annotations. This approach helps in identifying inherent relationships among different types of features associated with an image. In our approach, we divide images into small tiles and create visual keywords using a high-dimensional clustering algorithm. These visual keywords act the same as text keywords. One of the challenges of this approach is to identify an appropriate size for visual keywords. In this paper, we report our results in finding a suitable template size that can be used to create tiles for visual keywords. These visual keywords are combined with text keywords to create a multimodal image representation before applying clustering.

Keywords

feature extraction; image representation; pattern clustering; text analysis; multimodal image clustering; multimodal image representation; multimodality approach; text annotation; visual keyword; Clustering algorithms; Computer science; Frequency; Image analysis; Image classification; Image databases; Image representation; Kernel; Shape; Tiles; diffusion kernel; image clustering; multimodal; visual keyword;

fLanguage

English

Publisher

ieee

Conference_Titel

Multimedia Computing and Systems, 2009. ICMCS '09. International Conference on

Conference_Location

Ouarzazate

Print_ISBN

978-1-4244-3756-6

Electronic_ISBN

978-1-4244-3757-3

Type

conf

DOI

10.1109/MMCS.2009.5256634

Filename

5256634