DocumentCode
178477
Title
Convolutional Neural Networks for Document Image Classification
Author
Le Kang ; Kumar, Jayant ; Peng Ye ; Yi Li ; Doermann, David
Author_Institution
Univ. of Maryland, College Park, MD, USA
fYear
2014
fDate
24-28 Aug. 2014
Firstpage
3168
Lastpage
3172
Abstract
This paper presents a Convolutional Neural Network (CNN) for document image classification. In particular, document image classes are defined by the structural similarity. Previous approaches rely on hand-crafted features for capturing structural information. In contrast, we propose to learn features from raw image pixels using CNN. The use of CNN is motivated by the the hierarchical nature of document layout. Equipped with rectified linear units and trained with dropout, our CNN performs well even when document layouts present large inner-class variations. Experiments on public challenging datasets demonstrate the effectiveness of the proposed approach.
Keywords
document image processing; feature extraction; image classification; neural nets; CNN; convolutional neural networks; document image classification; document layout; hand-crafted features; inner-class variations; public challenging datasets; raw image pixels; rectified linear units; structural similarity; Accuracy; Computer vision; Kernel; Layout; NIST; Pattern recognition; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition (ICPR), 2014 22nd International Conference on
Conference_Location
Stockholm
ISSN
1051-4651
Type
conf
DOI
10.1109/ICPR.2014.546
Filename
6977258
Link To Document