• DocumentCode
    178477
  • Title

    Convolutional Neural Networks for Document Image Classification

  • Author

    Le Kang ; Kumar, Jayant ; Peng Ye ; Yi Li ; Doermann, David

  • Author_Institution
    Univ. of Maryland, College Park, MD, USA
  • fYear
    2014
  • fDate
    24-28 Aug. 2014
  • Firstpage
    3168
  • Lastpage
    3172
  • Abstract
    This paper presents a Convolutional Neural Network (CNN) for document image classification. In particular, document image classes are defined by the structural similarity. Previous approaches rely on hand-crafted features for capturing structural information. In contrast, we propose to learn features from raw image pixels using CNN. The use of CNN is motivated by the the hierarchical nature of document layout. Equipped with rectified linear units and trained with dropout, our CNN performs well even when document layouts present large inner-class variations. Experiments on public challenging datasets demonstrate the effectiveness of the proposed approach.
  • Keywords
    document image processing; feature extraction; image classification; neural nets; CNN; convolutional neural networks; document image classification; document layout; hand-crafted features; inner-class variations; public challenging datasets; raw image pixels; rectified linear units; structural similarity; Accuracy; Computer vision; Kernel; Layout; NIST; Pattern recognition; Training;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition (ICPR), 2014 22nd International Conference on
  • Conference_Location
    Stockholm
  • ISSN
    1051-4651
  • Type

    conf

  • DOI
    10.1109/ICPR.2014.546
  • Filename
    6977258