• DocumentCode
    1756425
  • Title

    Objective-Guided Image Annotation

  • Author

    Qi Mao ; Tsang, Ivor Wai-Hung ; Shenghua Gao

  • Author_Institution
    Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • Volume
    22
  • Issue
    4
  • fYear
    2013
  • fDate
    41365
  • Firstpage
    1585
  • Lastpage
    1597
  • Abstract
    Automatic image annotation, which is usually formulated as a multi-label classification problem, is one of the major tools used to enhance the semantic understanding of web images. Many multimedia applications (e.g., tag-based image retrieval) can greatly benefit from image annotation. However, the insufficient performance of image annotation methods prevents these applications from being practical. On the other hand, specific measures are usually designed to evaluate how well one annotation method performs for a specific objective or application, but most image annotation methods do not consider optimization of these measures, so that they are inevitably trapped into suboptimal performance of these objective-specific measures. To address this issue, we first summarize a variety of objective-guided performance measures under a unified representation. Our analysis reveals that macro-averaging measures are very sensitive to infrequent keywords, and hamming measure is easily affected by skewed distributions. We then propose a unified multi-label learning framework, which directly optimizes a variety of objective-specific measures of multi-label learning tasks. Specifically, we first present a multilayer hierarchical structure of learning hypotheses for multi-label problems based on which a variety of loss functions with respect to objective-guided measures are defined. And then, we formulate these loss functions as relaxed surrogate functions and optimize them by structural SVMs. According to the analysis of various measures and the high time complexity of optimizing micro-averaging measures, in this paper, we focus on example-based measures that are tailor-made for image annotation tasks but are seldom explored in the literature. Experiments show consistency with the formal analysis on two widely used multi-label datasets, and demonstrate the superior performance of our proposed method over state-of-the-art baseline methods in terms of example-based measures on four - mage annotation datasets.
  • Keywords
    image classification; automatic image annotation; hamming measure; image annotation methods; macro-averaging measures; multilabel classification problem; multilabel datasets; multimedia applications; objective-guided image annotation; objective-guided performance; objective-specific measures; relaxed surrogate functions; semantic understanding; skewed distributions; structural SVM; suboptimal performance; tag-based image retrieval; unified multilabel learning framework; Frequency measurement; Image retrieval; Joints; Loss measurement; Random variables; Support vector machines; Vectors; Image annotation; multi-label learning; performance measures; structural support vector machine (SVM);
  • fLanguage
    English
  • Journal_Title
    Image Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1057-7149
  • Type

    jour

  • DOI
    10.1109/TIP.2012.2233490
  • Filename
    6378455