Title :
Scale and Orientation Invariant Text Segmentation for Born-Digital Compound Images
Author :
Huan Yang ; Shiqian Wu ; Chenwei Deng ; Weisi Lin
Author_Institution :
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
Abstract :
Many recent applications require text segmentation for born-digital compound images. To this end, we propose a coarse-to-fine framework for segmenting texts of arbitrary scales and orientations in born-digital compound images. In the coarse stage, the local image activity measure is designed based upon the variation distribution of characters, to highlight the difference between textual and pictorial regions. This stage outputs a coarse textual layer including textual regions as well as a few pictorial regions with high activity. In the fine stage, a textual connected component (TCC) based refinement is proposed to eliminate the survived pictorial regions. In particular, a scale and orientation invariant grouping algorithm is proposed to adaptively generate TCCs with uniform statistical features. The minimum average distance and morphological operations are employed to assist the formation of candidate TCCs. Then, three string-level features (i.e., shapeness, color similarity, and mean activity level) are designed to distinguish the true TCCs from the false positive ones that are formed by connecting the high activity pictorial components. Extensive experiments show that the proposed framework can segment textual regions precisely from born-digital compound images, while preserving the integrity of texts with varied scales and orientations, and avoiding over-connection of textual regions.
Keywords :
feature extraction; image colour analysis; image segmentation; image texture; TCC based refinement; born-digital compound images; character variation distribution; coarse-to-fine framework; color similarity feature; image activity measure; mean activity level feature; minimum average distance; morphological operation; orientation invariant text segmentation; pictorial region; scale invariant text segmentation; scale-and-orientation invariant grouping algorithm; shapeness feature; string-level feature; textual connected component; textual region; uniform statistical features; Algorithm design and analysis; Compounds; Cybernetics; Feature extraction; Image color analysis; Image segmentation; Joining processes; Born-digital compound image; connected component; image activity measure; text segmentation;
Journal_Title :
Cybernetics, IEEE Transactions on
DOI :
10.1109/TCYB.2014.2330657