• DocumentCode
    1164742
  • Title

    Artificial neural networks for document analysis and recognition

  • Author

    Marinai, Simone ; Gori, Marco ; Soda, Giovanni

  • Author_Institution
    Dipt. di Sistemi e Informatica, Firenze Univ., Italy
  • Volume
    27
  • Issue
    1
  • fYear
    2005
  • Firstpage
    23
  • Lastpage
    35
  • Abstract
    Artificial neural networks have been extensively applied to document analysis and recognition. Most efforts have been devoted to the recognition of isolated handwritten and printed characters with widely recognized successful results. However, many other document processing tasks, like preprocessing, layout analysis, character segmentation, word recognition, and signature verification, have been effectively faced with very promising results. This paper surveys the most significant problems in the area of offline document image processing, where connectionist-based approaches have been applied. Similarities and differences between approaches belonging to different categories are discussed. A particular emphasis is given on the crucial role of prior knowledge for the conception of both appropriate architectures and learning algorithms. Finally, the paper provides a critical analysts on the reviewed approaches and depicts the most promising research guidelines in the field. In particular, a second generation of connectionist-based models are foreseen which are based on appropriate graphical representations of the learning environment.
  • Keywords
    document image processing; handwriting recognition; handwritten character recognition; image segmentation; learning (artificial intelligence); recurrent neural nets; artificial neural networks; character recognition; character segmentation; connectionist based approach; document image analysis; document image recognition; document preprocessing; graphical representations; handwritten recognition; layout analysis; learning algorithms; offline document image processing; recurrent neural nets; signature verification; word recognition; Artificial neural networks; Character recognition; Face recognition; Handwriting recognition; Image analysis; Image recognition; Image segmentation; Neural networks; Optical character recognition software; Text analysis; Index Terms- Character segmentation; document image analysis and recognition; layout analysis; neural networks; preprocessing; recursive neural networks; word recognition.; Algorithms; Artificial Intelligence; Automatic Data Processing; Computer Graphics; Documentation; Handwriting; Image Enhancement; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Neural Networks (Computer); Numerical Analysis, Computer-Assisted; Pattern Recognition, Automated; Reading; Reproducibility of Results; Sensitivity and Specificity; Signal Processing, Computer-Assisted; User-Computer Interface;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2005.4
  • Filename
    1359749