DocumentCode :
1242084
Title :
Validation of image defect models for optical character recognition
Author :
Li, Yanhong ; Lopresti, Daniel ; Nagy, George ; Tomkins, Andrew
Author_Institution :
GARI Software, Livingston, NJ, USA
Volume :
18
Issue :
2
fYear :
1996
fDate :
2/1/1996 12:00:00 AM
Firstpage :
99
Lastpage :
107
Abstract :
Considers the problem of evaluating character image generators that model distortions encountered in optical character recognition (OCR). While a number of such defect models have been proposed, the contention that they produce the desired result is typically argued in an ad hoc and informal way. The authors introduce a rigorous and more pragmatic definition of when a model is accurate: they say a defect model is validated if the OCR errors induced by the model are indistinguishable from the errors encountered when using real scanned documents. The authors describe four measures to quantify this similarity, and compare and contrast them using over ten million scanned and synthesized characters in three fonts. The measures differentiate effectively between different fonts and different scans of the same font regardless of the underlying text
Keywords :
character sets; document image processing; optical character recognition; character image generators; distortions; fonts; image defect models; optical character recognition; Character generation; Character recognition; Degradation; Error analysis; Facsimile; Image generation; Optical character recognition software; Optical distortion; Predictive models; Prototypes;
fLanguage :
English
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
Publisher :
ieee
ISSN :
0162-8828
Type :
jour
DOI :
10.1109/34.481536
Filename :
481536
Link To Document :
بازگشت