DocumentCode :
2057505
Title :
Modeling documents for structure recognition using generalized N-grams
Author :
Brugger, R. ; Zramdini, A. ; Ingold, R.
Author_Institution :
Inst. de Inf., Fribourg Univ., Switzerland
Volume :
1
fYear :
1997
fDate :
18-20 Aug 1997
Firstpage :
56
Abstract :
We present and discuss a novel approach to modeling logical structures of documents, based on a statistical representation of patterns in a document class. An efficient and error tolerant recognition heuristics adapted to the model is proposed. The statistical approach permits easily automated and incremental learning of the model. The approach has been partially evaluated on a prototype. A discussion of the results achieved by the prototype is finally made
Keywords :
document image processing; image recognition; software fault tolerance; statistical analysis; trees (mathematics); document class; document modelling; error tolerant recognition heuristics; generalized N-grams; incremental learning; logical structures; statistical approach; statistical pattern representation; structure recognition; Application software; Decision trees; Error correction; Humans; Knowledge based systems; Optical character recognition software; Prototypes; Software prototyping; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 1997., Proceedings of the Fourth International Conference on
Conference_Location :
Ulm
Print_ISBN :
0-8186-7898-4
Type :
conf
DOI :
10.1109/ICDAR.1997.619813
Filename :
619813
Link To Document :
بازگشت