• DocumentCode
    2483968
  • Title

    Recognition of books by verification and retraining

  • Author

    Neeba, N.V. ; Jawahar, C.V.

  • Author_Institution
    Centre for Visual Inf. Technol., Int. Inst. of Inf. Technol., Hyderabad
  • fYear
    2008
  • fDate
    8-11 Dec. 2008
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    The problem of character recognition in a book should be formulated significantly different from that of a single page or word. An ideal approach to design such a recognizer is to adapt the classifier to the font and style of the collection. In this paper, we propose an adaptation framework to recognize characters in a book with a learning framework. In the proposed system, the post processor verifies the output of the recognition module, which is further used for learning and thus to improve the performance over iteration. Experiments are conducted on about 500,000 annotated symbols from five books in Malayalam (an Indian language). We achieve an average improvement of 14% in classification accuracy.
  • Keywords
    document image processing; image classification; learning (artificial intelligence); optical character recognition; adaptation framework; book recognition; image classification; learning framework; optical character recognition; verification module; Books; Character recognition; Dictionaries; Image converters; Image recognition; Information technology; Natural languages; Optical character recognition software; Pattern recognition; Software libraries;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
  • Conference_Location
    Tampa, FL
  • ISSN
    1051-4651
  • Print_ISBN
    978-1-4244-2174-9
  • Electronic_ISBN
    1051-4651
  • Type

    conf

  • DOI
    10.1109/ICPR.2008.4761538
  • Filename
    4761538