• DocumentCode
    3775922
  • Title

    A hybrid method for table detection from document image

  • Author

    Tran Tuan Anh;Na In-Seop;Kim Soo-Hyung

  • Author_Institution
    Department of Computer Science, Chonnam National University, 77 Yongbong-ro, 500-757 South Korea
  • fYear
    2015
  • Firstpage
    131
  • Lastpage
    135
  • Abstract
    In this paper, we present a hybrid method consisting of three main stages for detecting tables in document images. Based on table structure, our system separates table into two main categories, ruling line table and non-ruling line table. In the first stage, the text and non-text elements in document are classified by a heuristic filter. Then, the white space analysis is used to group the text elements into text lines, while ruling line table candidates are identified from non-text elements. In the second stage, based on the text lines, text and non-text elements, a hybrid method which consist of the alternative bottom-up and top-down approaches is implemented to find the table region candidates. In the final stage, these candidates are examined to get the table regions by analyzing text lines and spare lines. Experimental results with the document database from the ICDAR2013 table competition show that the proposed method works better than the previous ones.
  • Keywords
    "Image color analysis","Portable document format","Text analysis","Feature extraction","White spaces","Image segmentation","Optical character recognition software"
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ACPR), 2015 3rd IAPR Asian Conference on
  • Electronic_ISBN
    2327-0985
  • Type

    conf

  • DOI
    10.1109/ACPR.2015.7486480
  • Filename
    7486480