DocumentCode
384149
Title
Page classification through logical labelling
Author
Liang, Jian ; Doermann, David ; Ma, Matthew ; Guo, Jinhong K.
Author_Institution
Language & Media Process. Lab., Maryland Univ., College Park, MD, USA
Volume
3
fYear
2002
fDate
2002
Firstpage
477
Abstract
We propose an integrated approach to page classification and logical labelling. Layout is represented by a fully connected attributed relational graph that is matched to the graph of an unknown document, achieving classification and labelling simultaneously. By incorporating global constraints in an integrated fashion, ambiguity at the zone level can be reduced, providing robustness to noise and variation. Models are automatically trained from sample documents. Experimental results show promise for the classification and labelling of technical article title pages, and supports the idea of a hierarchical model base.
Keywords
document image processing; graph theory; image classification; optical character recognition; OCR; attributed relational graph; document images; experimental results; global constraints; hierarchical model base; labelling; logical labelling; noise; page classification; technical article title pages; unknown document; Companies; Educational institutions; Hardware; Image databases; Labeling; Laboratories; Noise level; Noise reduction; Noise robustness; Optical character recognition software;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2002. Proceedings. 16th International Conference on
ISSN
1051-4651
Print_ISBN
0-7695-1695-X
Type
conf
DOI
10.1109/ICPR.2002.1047980
Filename
1047980
Link To Document