• DocumentCode
    178191
  • Title

    Image Segmentation and Labeling Using Free-Form Semantic Annotation

  • Author

    Tegen, A. ; Weegar, R. ; Hammarlund, L. ; Oskarsson, M. ; Fangyuan Jiang ; Medved, D. ; Nugues, P. ; Astrom, K.

  • Author_Institution
    Centre for Math. Sci., Lund Univ., Lund, Sweden
  • fYear
    2014
  • fDate
    24-28 Aug. 2014
  • Firstpage
    2281
  • Lastpage
    2286
  • Abstract
    In this paper we investigate the problem of segmenting images using the information in text annotations. In contrast to the general image understanding problem, this type of annotation guided segmentation is less ill-posed in the sense that for the output there is higher consensus among human annotations. In the paper we present a system based on a combined visual and semantic pipeline. In the visual pipeline, a list of tentative figure-ground segmentations is first proposed. Each such segmentation is classified into a set of visual categories. In the natural language processing pipeline, the text is parsed and chunked into objects. Each chunk is then compared with the visual categories and the relative distance is computed using the word-net structure. The final choice of segments and their correspondence to the chunked objects are then obtained using combinatorial optimization. The output is compared to manually annotated ground-truth images. The results are promising and there are several interesting avenues for continued research.
  • Keywords
    combinatorial mathematics; image segmentation; natural language processing; optimisation; text analysis; annotation guided segmentation; combinatorial optimization; free-form semantic annotation; human annotations; image labeling; image segmentation; image understanding problem; manually annotated ground-truth images; natural language processing pipeline; semantic pipeline; tentative figure-ground segmentations; text annotations; visual categories; visual pipeline; word-net structure; Context; Databases; Detectors; Image segmentation; Pipelines; Semantics; Visualization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2014 22nd International Conference on
  • Conference_Location
    Stockholm
  • ISSN
    1051-4651
  • Type

    conf

  • DOI
    10.1109/ICPR.2014.396
  • Filename
    6977108