DocumentCode
73001
Title
Scale and Orientation Invariant Text Segmentation for Born-Digital Compound Images
Author
Huan Yang ; Shiqian Wu ; Chenwei Deng ; Weisi Lin
Author_Institution
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
Volume
45
Issue
3
fYear
2015
fDate
Mar-15
Firstpage
533
Lastpage
547
Abstract
Many recent applications require text segmentation for born-digital compound images. To this end, we propose a coarse-to-fine framework for segmenting texts of arbitrary scales and orientations in born-digital compound images. In the coarse stage, the local image activity measure is designed based upon the variation distribution of characters, to highlight the difference between textual and pictorial regions. This stage outputs a coarse textual layer including textual regions as well as a few pictorial regions with high activity. In the fine stage, a textual connected component (TCC) based refinement is proposed to eliminate the survived pictorial regions. In particular, a scale and orientation invariant grouping algorithm is proposed to adaptively generate TCCs with uniform statistical features. The minimum average distance and morphological operations are employed to assist the formation of candidate TCCs. Then, three string-level features (i.e., shapeness, color similarity, and mean activity level) are designed to distinguish the true TCCs from the false positive ones that are formed by connecting the high activity pictorial components. Extensive experiments show that the proposed framework can segment textual regions precisely from born-digital compound images, while preserving the integrity of texts with varied scales and orientations, and avoiding over-connection of textual regions.
Keywords
feature extraction; image colour analysis; image segmentation; image texture; TCC based refinement; born-digital compound images; character variation distribution; coarse-to-fine framework; color similarity feature; image activity measure; mean activity level feature; minimum average distance; morphological operation; orientation invariant text segmentation; pictorial region; scale invariant text segmentation; scale-and-orientation invariant grouping algorithm; shapeness feature; string-level feature; textual connected component; textual region; uniform statistical features; Algorithm design and analysis; Compounds; Cybernetics; Feature extraction; Image color analysis; Image segmentation; Joining processes; Born-digital compound image; connected component; image activity measure; text segmentation;
fLanguage
English
Journal_Title
Cybernetics, IEEE Transactions on
Publisher
ieee
ISSN
2168-2267
Type
jour
DOI
10.1109/TCYB.2014.2330657
Filename
6845343
Link To Document