• DocumentCode
    3540067
  • Title

    Document zone content classification for technical document images using Artificial Neural Networks and Support Vector Machines

  • Author

    Ibrahim, Zaidah ; Isa, Dino ; Rajkumar, Rajprasad ; Kendall, Graham

  • Author_Institution
    Fac. of Comp. & Math. Sci., Univ. Technol. MARA, Shah Alam, Malaysia
  • fYear
    2009
  • fDate
    4-6 Aug. 2009
  • Firstpage
    345
  • Lastpage
    350
  • Abstract
    Artificial Neural Networks (ANN) are a classic pattern classifier and widely applicable to various problems and are relatively easy to use. Three of the most popular ANNs are Multilayer Perceptron (MLP) with Backpropagation learning algorithm, Self Organizing Map (SOM) and Recurrent Neural Network (RNN). Support Vector Machines (SVM) have gained great interest in the last few years in pattern recognition. Thus, this research compares the recognition performance of text and non-text images (text, table, figure and graph) from technical document images based on the pixel intensity of various zones between BPNN, SOM, RNN and SVM. Symmetrical and non-symmetrical zoning algorithms were compared as input. 400 different datasets have been tested and the experiments indicate that SVM classification is superior to the other three classifiers. The experiments also indicate that the combination of symmetrical and non-symmetrical zoning design is better than non-symmetrical or symmetrical zoning only.
  • Keywords
    document image processing; image classification; multilayer perceptrons; support vector machines; artificial neural network; document zone content classification; multilayer perceptron; support vector machine; technical document image; Artificial neural networks; Backpropagation algorithms; Image recognition; Multilayer perceptrons; Organizing; Pattern recognition; Recurrent neural networks; Support vector machine classification; Support vector machines; Text recognition; Backpropagation Neural Network; Non-text Classification; Recurrent Neural Network; Self Organizing Map; Support Vector Machine;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Applications of Digital Information and Web Technologies, 2009. ICADIWT '09. Second International Conference on the
  • Conference_Location
    London
  • Print_ISBN
    978-1-4244-4456-4
  • Electronic_ISBN
    978-1-4244-4457-1
  • Type

    conf

  • DOI
    10.1109/ICADIWT.2009.5273957
  • Filename
    5273957