DocumentCode :
442750
Title :
Learning to binarize document images using a decision cascade
Author :
Chou, Chien-Hsing ; Huang, Chih-Ching ; Lin, Wen-Hsiung ; Chang, Fu
Author_Institution :
Inst. of Information Sci., Academia Sinica, Taipei, Taiwan
Volume :
2
fYear :
2005
fDate :
11-14 Sept. 2005
Abstract :
In this article, we propose a special type of decision tree, called a decision cascade, for binarizing document images. Such images are produced by cameras, resulting in varying degrees of brightness over different parts of the images. Our method decides what action to take on each part of the input image in order to obtain a satisfactory binary result. The advantage of this approach lies in its ability to learn decision rules from training data that may be labeled with multiple identities. Tests on images produced under improperly illuminated conditions show that our method yields much better visual quality and OCR performance than using a global threshold for binarization.
Keywords :
cameras; decision trees; document image processing; cameras; decision cascade; decision rule learning; decision tree; document image binarization; Brightness; Cameras; Data mining; Decision trees; Gray-scale; Information science; Optical character recognition software; Optical scattering; Testing; Training data;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Image Processing, 2005. ICIP 2005. IEEE International Conference on
Print_ISBN :
0-7803-9134-9
Type :
conf
DOI :
10.1109/ICIP.2005.1530106
Filename :
1530106
Link To Document :
بازگشت