Title :
Robust table-form structure analysis based on box-driven reasoning
Author :
Hori, Osamu ; Doermann, David S.
Author_Institution :
Center for Autom. Res., Maryland Univ., College Park, MD, USA
Abstract :
Table form document structure analysis is an important problem in the document processing domain. The paper presents a method called Box Driven Reasoning (BDR) to robustly analyze the structure of table form documents which include touching characters and broken lines. Most previous methods employ a line oriented approach. Real documents are copied repeatedly and overlaid with printed data, resulting in characters which touch cells and lines which are broken. BDR deals with regions directly, in contrast with other previous methods. Experimental tests show that BDR reliably recognizes cells and strings in document images with touching characters and broken lines
Keywords :
character recognition; data structures; document image processing; inference mechanisms; BDR; box driven reasoning; broken lines; document images; document processing domain; robust table form structure analysis; table form document structure analysis; touching characters; Automatic testing; Character recognition; Data mining; Data models; Educational institutions; Electronic equipment testing; Image databases; Office automation; Robustness; Spatial databases;
Conference_Titel :
Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
Conference_Location :
Montreal, Que.
Print_ISBN :
0-8186-7128-9
DOI :
10.1109/ICDAR.1995.598980