• DocumentCode
    46239
  • Title

    Informative visual words construction to improve bag of words image representation

  • Author

    Farhangi, Mohammad Mehdi ; Soryani, Mohsen ; Fathy, Mahmood

  • Author_Institution
    Sch. of Comput. Eng., Iran Univ. of Sci. & Technol., Tehran, Iran
  • Volume
    8
  • Issue
    5
  • fYear
    2014
  • fDate
    May-14
  • Firstpage
    310
  • Lastpage
    318
  • Abstract
    Bag of visual words model has recently attracted much attention from computer vision society because of its notable success in analysing images and exploring their content. This study improves this model by utilising the adjacency information between words. To explore this information, a binary tree structure is constructed from the visual words in order to model the is - a relationships in the vocabulary. Informative nodes of this tree are extracted by using the χ2 criterion and are used to capture the adjacency information of visual words. This approach is a simple and computationally effective way for modelling the spatial relations of visual words, which improves the image classification performance. The authors evaluated our method for visual classification of three known datasets: 15 natural scenes, Caltech-101 and Graz-01.
  • Keywords
    computer vision; image classification; image representation; adjacency information; binary tree structure; computer vision; image classification; image representation; informative visual words construction;
  • fLanguage
    English
  • Journal_Title
    Image Processing, IET
  • Publisher
    iet
  • ISSN
    1751-9659
  • Type

    jour

  • DOI
    10.1049/iet-ipr.2013.0449
  • Filename
    6829932