• DocumentCode
    2629256
  • Title

    A hybrid page segmentation method

  • Author

    Okamoto, Masayuki ; Takahashi, Makoto

  • Author_Institution
    Dept. of Inf. Eng., Shinshu Univ., Nagano, Japan
  • fYear
    1993
  • fDate
    20-22 Oct 1993
  • Firstpage
    743
  • Lastpage
    746
  • Abstract
    A method of page segmentation using field separators and white streams is described and applied to the layout analysis of various types of printed pages which may have horizontal and vertical textlines. In complex page layouts, text columns which are printed closely together are often separated by thin black lines (field separators) or long white spaces (white streams). These separators are first extracted by horizontal and vertical scanning of a page, and then a global partitioning of the page into blocks is performed. Next in each block, black connected components are merged into textlines along the directions of separators horizontally or vertically. In experimental trials on various types of page layouts, such techniques produced robust and fast results
  • Keywords
    document image processing; image segmentation; black connected components; field separators; global partitioning; horizontal scanning; horizontal textlines; hybrid page segmentation method; layout analysis; page layouts; printed pages; text columns; vertical scanning; vertical textlines; white streams; Image segmentation; Information analysis; National electric code; Particle separators; Robustness; White spaces;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
  • Conference_Location
    Tsukuba Science City
  • Print_ISBN
    0-8186-4960-7
  • Type

    conf

  • DOI
    10.1109/ICDAR.1993.395630
  • Filename
    395630