DocumentCode
3441193
Title
Searching an appropriate template size for multimodal image clustering
Author
Agrawal, Rajeev ; Grosky, William I. ; Fotouhi, Farshad
Author_Institution
Dept. of Comput. Sci., Wayne State Univ., Detroit, MI, USA
fYear
2009
fDate
2-4 April 2009
Firstpage
560
Lastpage
564
Abstract
It has been shown by researchers that using a multimodality approach can help in identifying better clusters in an image collection. The multimodal image features include low-level image features and available text annotations. This approach helps in identifying inherent relationships among different types of features associated with an image. In our approach, we divide images into small tiles and create visual keywords using a high-dimensional clustering algorithm. These visual keywords act the same as text keywords. One of the challenges of this approach is to identify an appropriate size for visual keywords. In this paper, we report our results in finding a suitable template size that can be used to create tiles for visual keywords. These visual keywords are combined with text keywords to create a multimodal image representation before applying clustering.
Keywords
feature extraction; image representation; pattern clustering; text analysis; multimodal image clustering; multimodal image representation; multimodality approach; text annotation; visual keyword; Clustering algorithms; Computer science; Frequency; Image analysis; Image classification; Image databases; Image representation; Kernel; Shape; Tiles; diffusion kernel; image clustering; multimodal; visual keyword;
fLanguage
English
Publisher
ieee
Conference_Titel
Multimedia Computing and Systems, 2009. ICMCS '09. International Conference on
Conference_Location
Ouarzazate
Print_ISBN
978-1-4244-3756-6
Electronic_ISBN
978-1-4244-3757-3
Type
conf
DOI
10.1109/MMCS.2009.5256634
Filename
5256634
Link To Document