• DocumentCode
    88190
  • Title

    A Hierarchical Word-Merging Algorithm with Class Separability Measure

  • Author

    Lei Wang ; Luping Zhou ; Chunhua Shen ; Lingqiao Liu ; Huan Liu

  • Author_Institution
    Sch. of Comput. Sci. & Software Eng., Univ. of Wollongong, Wollongong, NSW, Australia
  • Volume
    36
  • Issue
    3
  • fYear
    2014
  • fDate
    Mar-14
  • Firstpage
    417
  • Lastpage
    435
  • Abstract
    In image recognition with the bag-of-features model, a small-sized visual codebook is usually preferred to obtain a low-dimensional histogram representation and high computational efficiency. Such a visual codebook has to be discriminative enough to achieve excellent recognition performance. To create a compact and discriminative codebook, in this paper we propose to merge the visual words in a large-sized initial codebook by maximally preserving class separability. We first show that this results in a difficult optimization problem. To deal with this situation, we devise a suboptimal but very efficient hierarchical word-merging algorithm, which optimally merges two words at each level of the hierarchy. By exploiting the characteristics of the class separability measure and designing a novel indexing structure, the proposed algorithm can hierarchically merge 10,000 visual words down to two words in merely 90 seconds. Also, to show the properties of the proposed algorithm and reveal its advantages, we conduct detailed theoretical analysis to compare it with another hierarchical word-merging algorithm that maximally preserves mutual information, obtaining interesting findings. Experimental studies are conducted to verify the effectiveness of the proposed algorithm on multiple benchmark data sets. As shown, it can efficiently produce more compact and discriminative codebooks than the state-of-the-art hierarchical word-merging algorithms, especially when the size of the codebook is significantly reduced.
  • Keywords
    feature extraction; image coding; image recognition; image representation; indexing; optimisation; statistical analysis; bag-of-features model; class separability measure; hierarchical word-merging algorithm; image recognition; indexing structure; low-dimensional histogram representation; mutual information; optimization problem; recognition performance; small-sized visual codebook; visual words; Algorithm design and analysis; Computational modeling; Histograms; Merging; Tin; Training; Visualization; Hierarchical word merge; bag-of-features model; class separability; compact codebook; object recognition;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2013.160
  • Filename
    6731380