• DocumentCode
    66604
  • Title

    Parameter Estimation and Energy Minimization for Region-Based Semantic Segmentation

  • Author

    Kumar, M. Pawan ; Turki, Haithem ; Preston, Dan ; Koller, Daphne

  • Author_Institution
    Center for Visual Comput., Ecole Centrale Paris, Chatenay-Malabry, France
  • Volume
    37
  • Issue
    7
  • fYear
    2015
  • fDate
    July 1 2015
  • Firstpage
    1373
  • Lastpage
    1386
  • Abstract
    We consider the problem of parameter estimation and energy minimization for a region-based semantic segmentation model. The model divides the pixels of an image into non-overlapping connected regions, each of which is to a semantic class. In the context of energy minimization, the main problem we face is the large number of putative pixel-to-region assignments. We address this problem by designing an accurate linear programming based approach for selecting the best set of regions from a large dictionary. The dictionary is constructed by merging and intersecting segments obtained from multiple bottom-up over-segmentations. The linear program is solved efficiently using dual decomposition. In the context of parameter estimation, the main problem we face is the lack of fully supervised data. We address this issue by developing a principled framework for parameter estimation using diverse data. More precisely, we propose a latent structural support vector machine formulation, where the latent variables model any missing information in the human annotation. Of particular interest to us are three types of annotations: (i) images segmented using generic foreground or background classes; (ii) images with bounding boxes specified for objects; and (iii) images labeled to indicate the presence of a class. Using large, publicly available datasets we show that our methods are able to significantly improve the accuracy of the region-based model.
  • Keywords
    image segmentation; linear programming; minimisation; parameter estimation; support vector machines; background classes; bounding boxes; dictionary; dual decomposition; energy minimization; generic foreground classes; human annotation; image pixels; latent structural support vector machine formulation; latent variables; linear programming; multiple bottom-up over-segmentations; nonoverlapping connected regions; parameter estimation; putative pixel-to-region assignments; region-based semantic segmentation model; semantic class; Biological system modeling; Dictionaries; Feature extraction; Image segmentation; Linear programming; Minimization; Semantics; LP relaxation; Semantic segmentation; energy minimization; weakly supervised learning;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2014.2372766
  • Filename
    6971190