Title :
Searching an appropriate template size for multimodal image clustering
Author :
Agrawal, Rajeev ; Grosky, William I. ; Fotouhi, Farshad
Author_Institution :
Dept. of Comput. Sci., Wayne State Univ., Detroit, MI, USA
Abstract :
It has been shown by researchers that using a multimodality approach can help in identifying better clusters in an image collection. The multimodal image features include low-level image features and available text annotations. This approach helps in identifying inherent relationships among different types of features associated with an image. In our approach, we divide images into small tiles and create visual keywords using a high-dimensional clustering algorithm. These visual keywords act the same as text keywords. One of the challenges of this approach is to identify an appropriate size for visual keywords. In this paper, we report our results in finding a suitable template size that can be used to create tiles for visual keywords. These visual keywords are combined with text keywords to create a multimodal image representation before applying clustering.
Keywords :
feature extraction; image representation; pattern clustering; text analysis; multimodal image clustering; multimodal image representation; multimodality approach; text annotation; visual keyword; Clustering algorithms; Computer science; Frequency; Image analysis; Image classification; Image databases; Image representation; Kernel; Shape; Tiles; diffusion kernel; image clustering; multimodal; visual keyword;
Conference_Titel :
Multimedia Computing and Systems, 2009. ICMCS '09. International Conference on
Conference_Location :
Ouarzazate
Print_ISBN :
978-1-4244-3756-6
Electronic_ISBN :
978-1-4244-3757-3
DOI :
10.1109/MMCS.2009.5256634