• DocumentCode
    514940
  • Title

    Run-Based Approach to Labeling Connected Components in Document Images

  • Author

    Tu, Xiao ; Lu, Yue

  • Author_Institution
    Dept. of Comput. Sci. & Technol., East China Normal Univ., Shanghai, China
  • Volume
    2
  • fYear
    2010
  • fDate
    6-7 March 2010
  • Firstpage
    206
  • Lastpage
    209
  • Abstract
    A fast algorithm is proposed in this paper to label connected components in binary document images. Runs are extracted from the image row by row. The positional relations among the runs of current rows and the runs of their preceding rows are represented utilizing trees, where each tree corresponds to a connected component. Only one-pass scan is required for the proposed approach to obtain the characteristics of the connected components, such as bounding rectangle, area, number of pixels. It is thus a fast and effective algorithm. Experimental results have shown that the efficiency of the present algorithm is superior to that of the conventional algorithms in terms of computational speed.
  • Keywords
    document image processing; image recognition; binary document images; document image recognition systems; labeling connected components; run-based approach; Computer science; Computer science education; Data mining; Educational technology; Flowcharts; Image analysis; Image storage; Labeling; Pixel; Text analysis; connected component; document image analysis; run-based; tree;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Education Technology and Computer Science (ETCS), 2010 Second International Workshop on
  • Conference_Location
    Wuhan
  • Print_ISBN
    978-1-4244-6388-6
  • Electronic_ISBN
    978-1-4244-6389-3
  • Type

    conf

  • DOI
    10.1109/ETCS.2010.424
  • Filename
    5459935