• DocumentCode
    3515295
  • Title

    A hierarchical grid feature representation framework for automatic image annotation

  • Author

    Kim, Ilseo ; Lee, Chin-Hui

  • Author_Institution
    Sch. of Electr. & Comput. Eng., Georgia Inst. of Technol., Atlanta, GA
  • fYear
    2009
  • fDate
    19-24 April 2009
  • Firstpage
    1125
  • Lastpage
    1128
  • Abstract
    We propose a hierarchical-grid (HG) feature analysis framework for representing images in automatic image annotation (AIA). We explore the properties of codebooks constructed with different-sized grids in image sub-blocks, and co-occurrence relationship between VQ codewords constructed from different grid systems. The proposed HG approach is evaluated on the TRECVID 2005 data set using classifiers obtained with maximal figure-of-merit discriminative training. With multi-level and cross-level grid systems incorporating bigram information within and between higher and lower grid levels, we show that the AIA performance can be significantly improved. For 20 selected concepts from the 39-concept LSCOM-Lite annotation set, we achieve a best F1 in almost all the concepts. The overall performance improvement with the combined multi-level and cross-level grid systems over the best single-size grid system in micro F1 is about 12.1%.
  • Keywords
    feature extraction; grid computing; image representation; automatic image annotation; bigram information; codebooks; codewords; cross-level grid system; figure-of-merit discriminative training; grid feature representation framework; hierarchical grid feature analysis framework; image representation; multi-level grid system; single-size grid system; Computational efficiency; Feature extraction; Gaussian processes; Grid computing; Image analysis; Image representation; Image retrieval; Indexing; Mercury (metals); Text categorization; Automatic image annotation; hierarchical-grid; high-level feature extraction; video indexing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on
  • Conference_Location
    Taipei
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-2353-8
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2009.4959786
  • Filename
    4959786