DocumentCode
3017175
Title
Discovery of Collocation Patterns: from Visual Words to Visual Phrases
Author
Yuan, Junsong ; Wu, Ying ; Yang, Ming
Author_Institution
Northwestern Univ., Evanston
fYear
2007
fDate
17-22 June 2007
Firstpage
1
Lastpage
8
Abstract
A visual word lexicon can be constructed by clustering primitive visual features, and a visual object can be described by a set of visual words. Such a "bag-of-words" representation has led to many significant results in various vision tasks including object recognition and categorization. However, in practice, the clustering of primitive visual features tends to result in synonymous visual words that over-represent visual patterns, as well as polysemous visual words that bring large uncertainties and ambiguities in the representation. This paper aims at generating a higher-level lexicon, i.e. visual phrase lexicon, where a visual phrase is a meaningful spatially co-occurrent pattern of visual words. This higher-level lexicon is much less ambiguous than the lower-level one. The contributions of this paper include: (1) a fast and principled solution to the discovery of significant spatial co-occurrent patterns using frequent itemset mining; (2) a pattern summarization method that deals with the compositional uncertainties in visual phrases; and (3) a top-down refinement scheme of the visual word lexicon by feeding back discovered phrases to tune the similarity measure through metric learning.
Keywords
feature extraction; image representation; pattern clustering; bag-of-words representation; collocation patterns discovery; higher-level lexicon; object recognition; pattern summarization method; polysemous visual words; primitive visual features clustering; synonymous visual words; top-down refinement scheme; visual phrase lexicon; visual word lexicon; Computer vision; Data mining; Impedance; Information retrieval; Itemsets; Object recognition; Spatial resolution; Text recognition; Uncertainty; Vector quantization;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE Conference on
Conference_Location
Minneapolis, MN
ISSN
1063-6919
Print_ISBN
1-4244-1179-3
Electronic_ISBN
1063-6919
Type
conf
DOI
10.1109/CVPR.2007.383222
Filename
4270247
Link To Document