• DocumentCode
    3017175
  • Title

    Discovery of Collocation Patterns: from Visual Words to Visual Phrases

  • Author

    Yuan, Junsong ; Wu, Ying ; Yang, Ming

  • Author_Institution
    Northwestern Univ., Evanston
  • fYear
    2007
  • fDate
    17-22 June 2007
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    A visual word lexicon can be constructed by clustering primitive visual features, and a visual object can be described by a set of visual words. Such a "bag-of-words" representation has led to many significant results in various vision tasks including object recognition and categorization. However, in practice, the clustering of primitive visual features tends to result in synonymous visual words that over-represent visual patterns, as well as polysemous visual words that bring large uncertainties and ambiguities in the representation. This paper aims at generating a higher-level lexicon, i.e. visual phrase lexicon, where a visual phrase is a meaningful spatially co-occurrent pattern of visual words. This higher-level lexicon is much less ambiguous than the lower-level one. The contributions of this paper include: (1) a fast and principled solution to the discovery of significant spatial co-occurrent patterns using frequent itemset mining; (2) a pattern summarization method that deals with the compositional uncertainties in visual phrases; and (3) a top-down refinement scheme of the visual word lexicon by feeding back discovered phrases to tune the similarity measure through metric learning.
  • Keywords
    feature extraction; image representation; pattern clustering; bag-of-words representation; collocation patterns discovery; higher-level lexicon; object recognition; pattern summarization method; polysemous visual words; primitive visual features clustering; synonymous visual words; top-down refinement scheme; visual phrase lexicon; visual word lexicon; Computer vision; Data mining; Impedance; Information retrieval; Itemsets; Object recognition; Spatial resolution; Text recognition; Uncertainty; Vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE Conference on
  • Conference_Location
    Minneapolis, MN
  • ISSN
    1063-6919
  • Print_ISBN
    1-4244-1179-3
  • Electronic_ISBN
    1063-6919
  • Type

    conf

  • DOI
    10.1109/CVPR.2007.383222
  • Filename
    4270247