• DocumentCode
    73001
  • Title

    Scale and Orientation Invariant Text Segmentation for Born-Digital Compound Images

  • Author

    Huan Yang ; Shiqian Wu ; Chenwei Deng ; Weisi Lin

  • Author_Institution
    Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • Volume
    45
  • Issue
    3
  • fYear
    2015
  • fDate
    Mar-15
  • Firstpage
    533
  • Lastpage
    547
  • Abstract
    Many recent applications require text segmentation for born-digital compound images. To this end, we propose a coarse-to-fine framework for segmenting texts of arbitrary scales and orientations in born-digital compound images. In the coarse stage, the local image activity measure is designed based upon the variation distribution of characters, to highlight the difference between textual and pictorial regions. This stage outputs a coarse textual layer including textual regions as well as a few pictorial regions with high activity. In the fine stage, a textual connected component (TCC) based refinement is proposed to eliminate the survived pictorial regions. In particular, a scale and orientation invariant grouping algorithm is proposed to adaptively generate TCCs with uniform statistical features. The minimum average distance and morphological operations are employed to assist the formation of candidate TCCs. Then, three string-level features (i.e., shapeness, color similarity, and mean activity level) are designed to distinguish the true TCCs from the false positive ones that are formed by connecting the high activity pictorial components. Extensive experiments show that the proposed framework can segment textual regions precisely from born-digital compound images, while preserving the integrity of texts with varied scales and orientations, and avoiding over-connection of textual regions.
  • Keywords
    feature extraction; image colour analysis; image segmentation; image texture; TCC based refinement; born-digital compound images; character variation distribution; coarse-to-fine framework; color similarity feature; image activity measure; mean activity level feature; minimum average distance; morphological operation; orientation invariant text segmentation; pictorial region; scale invariant text segmentation; scale-and-orientation invariant grouping algorithm; shapeness feature; string-level feature; textual connected component; textual region; uniform statistical features; Algorithm design and analysis; Compounds; Cybernetics; Feature extraction; Image color analysis; Image segmentation; Joining processes; Born-digital compound image; connected component; image activity measure; text segmentation;
  • fLanguage
    English
  • Journal_Title
    Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2168-2267
  • Type

    jour

  • DOI
    10.1109/TCYB.2014.2330657
  • Filename
    6845343