DocumentCode :
2502558
Title :
Form analysis and understanding based on knowledge
Author :
He, Xiuling ; Yang, Yang ; Chen, Zengzhao ; Yu, Ying ; Dong, Cailin
Author_Institution :
Sch. of Math. & Stat., Central China Normal Univ., Wuhan
fYear :
2008
fDate :
25-27 June 2008
Firstpage :
9286
Lastpage :
9291
Abstract :
Forms are different from common documents and more difficult to process. It is not very well done if processed by traditional document analyzing technology. The purpose of this thesis is to research on the key technologies of form analysis and understanding aiming at forms which are multi-type, large quantity, noisy, and variety of paper quality used by bank and revenue in our country. Based on analyzing knowledge of forms, a model of form based on object-orient sorting tree knowledge base and triple node representation is put forward. As for form feature extraction, hierarchical regulated hit or miss transform (HRHMT) is proposed. Feature extraction algorithm is also given. The algorithm is proved to be feasibility theoretically. Analysis on theoretical complexity and experiments shows their efficiency and superiority.
Keywords :
business forms; document image processing; feature extraction; knowledge based systems; object-oriented methods; sorting; transforms; document analyzing technology; form analysis; form feature extraction; hierarchical regulated hit or miss transform; object-orient sorting tree knowledge base; triple node representation; Automation; Computer science; Feature extraction; Intelligent control; Mathematics; Optical character recognition software; Paper technology; Sociotechnical systems; Sorting; Statistical analysis; Hierarchical Regulated Hit or Miss Transform; form; presentation model;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Control and Automation, 2008. WCICA 2008. 7th World Congress on
Conference_Location :
Chongqing
Print_ISBN :
978-1-4244-2113-8
Electronic_ISBN :
978-1-4244-2114-5
Type :
conf
DOI :
10.1109/WCICA.2008.4594401
Filename :
4594401
Link To Document :
بازگشت