Title :
Character Recognition Using Conditional Random Field Based Recognition Engine
Author :
Ray, Avik ; Chandawala, Ankit ; Chaudhury, Santanu
Author_Institution :
Dept. of Electr. Eng., IIT Delhi, New Delhi, India
Abstract :
The paper presents a novel script independent CRF based inferencing framework for character recognition. In this framework we consider a word as a sequence of connected components. The connected components are obtained using different binarization schemes and different possible sequences are considered using a tree structure. CRF uses contextual information to learn perfect primitive sequences and finds the most probable labeling of the sequence of primitives using multiple hypothesis tree to form the correct sequence of alphabets. This approach is particularly suitable for degraded printed document images as it considers multiple alternate hypotheses for correct decision.
Keywords :
document image processing; inference mechanisms; optical character recognition; probability; trees (mathematics); CRF based inferencing framework; binarization schemes; character recognition; conditional random field based recognition engine; connected components sequence; degraded printed document images; tree structure; Accuracy; Character recognition; Engines; Hidden Markov models; Labeling; Optical character recognition software; Training;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
DOI :
10.1109/ICDAR.2013.13