Title :
A knowledge-based approach to Chinese archive document understanding
Author :
You, Shih-Shien ; Chang, Gan-How ; Chang, Pao-Chung ; Chien, Bing-Shan
Author_Institution :
Basic Res. Lab., Telecommun. Labs., Taoyuan, Taiwan
Abstract :
The Chinese archive document possesses special geometrical and logical properties due to its construction based upon rectangular field which contain either title strings or data strings related to some other titles. In this paper, we propose a knowledge-based approach to analyze the logical relationship among the fields. After extracting the lines and fields of an archive document image, this procedure can identify fields as the title fields, the sub-title fields (if there exist such tree-structure logical relationship), and the corresponding data fields. This proposed approach enables us to achieve a better performance in information manipulation of archive documents
Keywords :
document handling; document image processing; knowledge based systems; Chinese archive document; archive document image; archive documents; data strings; document understanding; knowledge-based approach; sub-title fields; title fields; title strings; Character recognition; Data mining; Detectors; Graphics; Image segmentation; Intelligent structures; Intelligent systems; Laboratories; Seals; US Government;
Conference_Titel :
Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
Conference_Location :
Montreal, Que.
Print_ISBN :
0-8186-7128-9
DOI :
10.1109/ICDAR.1995.601957