Title :
Logical Entity Recognition in Multi-Style Document Page Images
Author :
Mao, Song ; Xu, Zheng ; Tjahjadi, Tardi ; Thoma, George R.
Author_Institution :
U.S. Nat. Libr. of Medicine, Bethesda, MD
Abstract :
Logical entity recognition in document page images is the essential part of a document image analysis system. A heterogeneous collection of document pages usually has many layout styles. Features extracted from same logical entities in different styles may have very different values and vice versa. Therefore, logical entity classifiers learned from a training set of multi-style document pages may not be reliable due to possible feature overlap of different logical entities in different styles. In this paper, we propose a novel method in which style information is used in both logical entity classifier training and recognition phases. In the training phase, training data are first classified into distinct styles, and a dedicated support vector machine (SVM) is then learned for each style. In the recognition phase, the style of a new document page image is first identified and its logical entities are then recognized using corresponding SVM. We show in our experiments that the use of the style information significantly improves the accuracy of logical entity recognition in multi-style document page images
Keywords :
document image processing; feature extraction; image classification; learning (artificial intelligence); support vector machines; document image analysis system; features extraction; logical entity classifier training; logical entity recognition; multistyle document page images; support vector machine; Biomedical imaging; Feature extraction; Image analysis; Image recognition; Information retrieval; Libraries; Support vector machine classification; Support vector machines; Text analysis; Training data;
Conference_Titel :
Pattern Recognition, 2006. ICPR 2006. 18th International Conference on
Conference_Location :
Hong Kong
Print_ISBN :
0-7695-2521-0
DOI :
10.1109/ICPR.2006.775