DocumentCode :
1994947
Title :
Methods, reports and survey for the comparison of diverse isolated character recognition results on the UNIPEN database
Author :
Ratzlaff, Eugene H.
Author_Institution :
IBM T.J. Watson Res. Center, Yorktown Heights, NY, USA
fYear :
2003
fDate :
3-6 Aug. 2003
Firstpage :
623
Abstract :
A framework of data organization methods and corresponding recognition results for UNIPEN databases is presented to enable the comparison of recognition results from different isolated character recognizers. A reproducible method for splitting the Train-R01/V07 data into an array of multi-writer and omni-writer training and testing pairs is proposed. Recognition results and uncertainties are provided for each pair, as well as results for the DevTest-R01/V02 character subsets, using an online scanning n-tuple recognizer. Several other published results are surveyed within this context. In sum, this report provides the reader multiple points of reference useful for comparing a number of published recognition results and a proposed framework that similarly allows private evaluation of unpublished recognition results.
Keywords :
character recognition; image classification; visual databases; DevTest-R01/V02 character subsets; Train-R01/V07 data; UNIPEN database; data organization methods; isolated character recognition; isolated character recognizers; multiwriter training; omni-writer training; online scanning n-tuple recognizer; Character recognition; Cleaning; Databases; Diversity methods; Sampling methods; Testing; Text analysis; Training data; Uncertainty;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2003. Proceedings. Seventh International Conference on
Print_ISBN :
0-7695-1960-1
Type :
conf
DOI :
10.1109/ICDAR.2003.1227737
Filename :
1227737
Link To Document :
بازگشت