DocumentCode
2483968
Title
Recognition of books by verification and retraining
Author
Neeba, N.V. ; Jawahar, C.V.
Author_Institution
Centre for Visual Inf. Technol., Int. Inst. of Inf. Technol., Hyderabad
fYear
2008
fDate
8-11 Dec. 2008
Firstpage
1
Lastpage
4
Abstract
The problem of character recognition in a book should be formulated significantly different from that of a single page or word. An ideal approach to design such a recognizer is to adapt the classifier to the font and style of the collection. In this paper, we propose an adaptation framework to recognize characters in a book with a learning framework. In the proposed system, the post processor verifies the output of the recognition module, which is further used for learning and thus to improve the performance over iteration. Experiments are conducted on about 500,000 annotated symbols from five books in Malayalam (an Indian language). We achieve an average improvement of 14% in classification accuracy.
Keywords
document image processing; image classification; learning (artificial intelligence); optical character recognition; adaptation framework; book recognition; image classification; learning framework; optical character recognition; verification module; Books; Character recognition; Dictionaries; Image converters; Image recognition; Information technology; Natural languages; Optical character recognition software; Pattern recognition; Software libraries;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2008. ICPR 2008. 19th International Conference on
Conference_Location
Tampa, FL
ISSN
1051-4651
Print_ISBN
978-1-4244-2174-9
Electronic_ISBN
1051-4651
Type
conf
DOI
10.1109/ICPR.2008.4761538
Filename
4761538
Link To Document