مرکز منطقه ای اطلاع رساني علوم و فناوري - Learning document structure for retrieval and classification

DocumentCode :

595011

Title :

Learning document structure for retrieval and classification

Author :

Kumar, Jayant ; Peng Ye ; Doermann, David

Author_Institution :

Univ. of Maryland, College Park, MD, USA

fYear :

2012

fDate :

11-15 Nov. 2012

Firstpage :

1558

Lastpage :

1561

Abstract :

In this paper, we present a method for the retrieval of document images with chosen layout characteristics. The proposed method is based on statistics of patch-codewords over different regions of image. We begin with a set of wanted and a random set of unwanted images representative of a large heterogeneous collection. We then use raw-image patches extracted from the unlabeled images to learn a codebook. To model the spatial relationships between patches, the image is recursively partitioned horizontally and vertically, and a histogram of patch-codewords is computed in each partition. The resulting set of features give a high precision and recall for the retrieval of hand-drawn and machine-print table-documents, and unconstrained mixed form-type documents, when trained using a random forest classifier. We compare our method to the spatial-pyramid method, and show that the proposed approach for learning layout characteristics is competitive for document images.

Keywords :

document image processing; image classification; image retrieval; learning (artificial intelligence); statistics; codebook learning; document structure learning; hand-drawn-documents; heterogeneous collection; image classification; image retrieval; layout characteristics learning; machine-print table-documents; patch-codeword histogram; patch-codeword statistics; random forest classifier; raw-image patch extraction; spatial-pyramid method; unconstrained mixed form-type documents; unlabeled images; unwanted images; Accuracy; Computer vision; Feature extraction; Histograms; Layout; Radio frequency; Support vector machines;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Pattern Recognition (ICPR), 2012 21st International Conference on

Conference_Location :

Tsukuba

ISSN :

1051-4651

Print_ISBN :

978-1-4673-2216-4

Type :

conf

Filename :

6460441

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=595011