Title :
Modeling documents for structure recognition using generalized N-grams
Author :
Brugger, R. ; Zramdini, A. ; Ingold, R.
Author_Institution :
Inst. de Inf., Fribourg Univ., Switzerland
Abstract :
We present and discuss a novel approach to modeling logical structures of documents, based on a statistical representation of patterns in a document class. An efficient and error tolerant recognition heuristics adapted to the model is proposed. The statistical approach permits easily automated and incremental learning of the model. The approach has been partially evaluated on a prototype. A discussion of the results achieved by the prototype is finally made
Keywords :
document image processing; image recognition; software fault tolerance; statistical analysis; trees (mathematics); document class; document modelling; error tolerant recognition heuristics; generalized N-grams; incremental learning; logical structures; statistical approach; statistical pattern representation; structure recognition; Application software; Decision trees; Error correction; Humans; Knowledge based systems; Optical character recognition software; Prototypes; Software prototyping; Text analysis;
Conference_Titel :
Document Analysis and Recognition, 1997., Proceedings of the Fourth International Conference on
Conference_Location :
Ulm
Print_ISBN :
0-8186-7898-4
DOI :
10.1109/ICDAR.1997.619813