DocumentCode :
2630198
Title :
Toward a practical document understanding of table-form documents: its framework and knowledge representation
Author :
Watanabe, Toyohide ; Luo, Qin ; Sugie, Nobru
Author_Institution :
Dept. of Inf. Eng., Nagoya Univ., Japan
fYear :
1993
fDate :
20-22 Oct 1993
Firstpage :
510
Lastpage :
515
Abstract :
A framework of four-layer recognition processes is proposed for understanding documents, and a knowledge representation method adaptable to the understanding of table-form documents is addressed. Although Y. Nakano et al. (1986) looked upon the recognition of multi-kinds of table-form documents as an important subject from a practical point of view, they could not report any successful approach because their knowledge was based only on the physical coordinate data. In the approach presented, this recognition issue was solved, using both the classification tree based on the physical characteristics and the structure description tree based on the logical characteristics. At least, it is not so difficult to classify various kinds of documents into appropriate document classes since table-form documents are well designed on the basis of vertical and horizontal line segments. However, it is not easy in the case of the other documents because the geometric and spatial characteristics of documents are not well specified. It is necessary to investigate the application techniques for the other documents from the viewpoint of the knowledge representation
Keywords :
document handling; image recognition; knowledge representation; trees (mathematics); classification tree; document classes; four-layer recognition processes; horizontal line segments; knowledge representation; logical characteristics; physical characteristics; practical document understanding; spatial characteristics; structure description tree; table-form documents; Application specific integrated circuits; Character recognition; Data mining; Image processing; Knowledge engineering; Knowledge representation; Libraries; Pattern recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
Conference_Location :
Tsukuba Science City
Print_ISBN :
0-8186-4960-7
Type :
conf
DOI :
10.1109/ICDAR.1993.395684
Filename :
395684
Link To Document :
بازگشت