• DocumentCode
    3022987
  • Title

    Multi-scale techniques for document page segmentation

  • Author

    Shi, Zhixin ; Govindaraju, Venu

  • Author_Institution
    Center of Excellence for Document Anal. & Recognition, State Univ. of New York at Buffalo, Amherst, NY, USA
  • fYear
    2005
  • fDate
    29 Aug.-1 Sept. 2005
  • Firstpage
    1020
  • Abstract
    Page segmentation algorithms found in published literatures often rely on some predetermined parameters such as general font sizes, distances between text lines and document scan resolutions. Variations of these parameters in real document images greatly affect the performance of the algorithms. In this paper, we present a novel approach for document page segmentation using a multi-scale technique. An efficient implementation of a local connectivity algorithm transforms a document image into a parameter domain in which a parameter value at a pixel location represents a connectivity property for its neighboring foreground pixels in the original document image. Then a top-down approach with a linear search reveals the document regions at each scale levels as text block, text lines and graphics. We consider our algorithm a transform based multi-scale method. Our ongoing research shows that the algorithm is robust for variations of document parameters.
  • Keywords
    document image processing; image segmentation; document images; document page segmentation; linear search; local connectivity algorithm; multiscale technique; Algorithm design and analysis; Geometry; Graphics; Image segmentation; Layout; Pixel; Robustness; Text analysis; Text recognition; Venus;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2005. Proceedings. Eighth International Conference on
  • ISSN
    1520-5263
  • Print_ISBN
    0-7695-2420-6
  • Type

    conf

  • DOI
    10.1109/ICDAR.2005.165
  • Filename
    1575698