DocumentCode :
1992756
Title :
Training on severely degraded text-line images
Author :
Sarkar, Prateek ; Baird, Henry S. ; Zhang, Xiaohu
Author_Institution :
Palo Alto Res. Center, CA, USA
fYear :
2003
fDate :
3-6 Aug. 2003
Firstpage :
38
Abstract :
We show that document image decoding (DID) supervised training algorithms, as a result of recent refinements, achieve high accuracy with low manual effort even under conditions of severe image degradation in both training and test data. We describe improvements in DID training of character template, set-width, and channel (noise) models. Large-scale experimental trials, using synthetically degraded images of text, have established two new and practically important advantages of DID algorithms: 1) high accuracy (> 99% characters correct) in decoding using models trained on even severely degraded images from the same distribution; and 2) greatly improved accuracy (< 1/10 the error rate) across a wide range of image degradations compared to untrained (idealized) models. This ability to train reliably on low-quality images that suffer from massive fragmentation and merging of characters, without the need for manual segmentation and labeling of character images, significantly reduces the manual effort of DID training.
Keywords :
decoding; document image processing; image coding; learning (artificial intelligence); optical character recognition; channel model; character image labeling; character merging; character segmentation; character template model; document image decoding; low-quality images; manual segmentation; massive fragmentation; set-width model; severely degraded text-line images; supervised training algorithms; synthetically degraded text images; test data; training data; Decoding; Degradation; Error analysis; Image recognition; Image segmentation; Labeling; Large-scale systems; Merging; Robustness; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2003. Proceedings. Seventh International Conference on
Print_ISBN :
0-7695-1960-1
Type :
conf
DOI :
10.1109/ICDAR.2003.1227624
Filename :
1227624
Link To Document :
بازگشت