DocumentCode
514940
Title
Run-Based Approach to Labeling Connected Components in Document Images
Author
Tu, Xiao ; Lu, Yue
Author_Institution
Dept. of Comput. Sci. & Technol., East China Normal Univ., Shanghai, China
Volume
2
fYear
2010
fDate
6-7 March 2010
Firstpage
206
Lastpage
209
Abstract
A fast algorithm is proposed in this paper to label connected components in binary document images. Runs are extracted from the image row by row. The positional relations among the runs of current rows and the runs of their preceding rows are represented utilizing trees, where each tree corresponds to a connected component. Only one-pass scan is required for the proposed approach to obtain the characteristics of the connected components, such as bounding rectangle, area, number of pixels. It is thus a fast and effective algorithm. Experimental results have shown that the efficiency of the present algorithm is superior to that of the conventional algorithms in terms of computational speed.
Keywords
document image processing; image recognition; binary document images; document image recognition systems; labeling connected components; run-based approach; Computer science; Computer science education; Data mining; Educational technology; Flowcharts; Image analysis; Image storage; Labeling; Pixel; Text analysis; connected component; document image analysis; run-based; tree;
fLanguage
English
Publisher
ieee
Conference_Titel
Education Technology and Computer Science (ETCS), 2010 Second International Workshop on
Conference_Location
Wuhan
Print_ISBN
978-1-4244-6388-6
Electronic_ISBN
978-1-4244-6389-3
Type
conf
DOI
10.1109/ETCS.2010.424
Filename
5459935
Link To Document