Title :
Model matching in intelligent document understanding
Author :
Farrow, Gary S D ; Xydeas, Costas S. ; Oakley, John P.
Author_Institution :
Div. of Electr. Eng., Manchester Univ., UK
Abstract :
Intelligent Document Understanding (IDU) is the process of converting scanned document pages into an electronic, processable form. We have previously presented a IDU system architecture suitable for this task which uses a hybrid bottom-up/top-down control strategy. In this paper we focus on a specific subproblem that arises within the chosen framework, concerned with selecting an appropriate page layout structure. A detailed analysis of the problem using an error propagation model, allows computationally simple search strategies to be developed. A multistage layout formation algorithm is proposed and its performance is critically assessed when implemented using two different Layout Object selection criterion. The first selection criterion is based on a maximal column area coverage; the second is based on a probabilistic Layout Object selection. Both techniques have been incorporated into the hybrid IDU system and the results presented indicate its superiority over previously reported systems
Keywords :
document image processing; optical character recognition; search problems; appropriate page layout structure; computationally simple search strategies; error propagation model; hybrid bottom-up/top-down control strategy; intelligent document understanding; maximal column area coverage; model matching; probabilistic layout object selection; Computational modeling; Computer architecture; Computer vision; Control systems; Degradation; Image databases; Intelligent structures; Object detection; Performance analysis; Process control;
Conference_Titel :
Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
Conference_Location :
Montreal, Que.
Print_ISBN :
0-8186-7128-9
DOI :
10.1109/ICDAR.1995.598997