• DocumentCode
    1633858
  • Title

    Voronoi++: A Dynamic Page Segmentation Approach Based on Voronoi and Docstrum Features

  • Author

    Agrawal, Mudit ; Doermann, David

  • Author_Institution
    Inst. of Adv. Comput. Studies, Univ. of Maryland, College Park, MD, USA
  • fYear
    2009
  • Firstpage
    1011
  • Lastpage
    1015
  • Abstract
    This paper presents a dynamic approach to document page segmentation. Current page segmentation algorithms lack the ability to dynamically adapt local variations in the size, orientation and distance of components within a page. Our approach builds upon one of the best algorithms, Kise et. al. work based on Area Voronoi Diagrams, which adapts globally to page content to determine algorithm parameters. In our approach, local thresholds are determined dynamically based on parabolic relations between components, and Docstrum based angular and neighborhood features are integrated to improve accuracy. Zone-based evaluation was performed on four sets of printed and handwritten documents in English and Arabic scripts and an increase of 33% in accuracy is reported.
  • Keywords
    computational geometry; document handling; image segmentation; Docstrum features; Voronoi++; document page segmentation; handwritten documents; neighborhood features; page content; printed documents; zone-based evaluation; Algorithm design and analysis; Carbon capture and storage; Communications technology; Educational institutions; Image analysis; Image segmentation; Magnetic separation; Performance evaluation; Text analysis; White spaces; adaptive; docstrum; dynamic; page segmentation; voronoi;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4244-4500-4
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2009.270
  • Filename
    5277532