DocumentCode :
3485989
Title :
Ground-Truth Estimation in Multispectral Representation Space: Application to Degraded Document Image Binarization
Author :
Hedjam, Rachid ; Cheriet, Mohamed
Author_Institution :
Synchromedia Lab. for Multemedia Commun. in Telepresence, Montreal, QC, Canada
fYear :
2013
fDate :
25-28 Aug. 2013
Firstpage :
190
Lastpage :
194
Abstract :
Human ground-truthing is the manual labelling of samples (pixels for example) to generate reference data without any automatic algorithm help. Although a manual ground-truth is more accurate than a machine ground-truth, it still suffers from mislabeling and/or judgement errors. In this paper we propose a new method of ground-truth estimation using multispectral (MS) imaging representation space for the sake of document image binarization. Starting from the initial manual ground-truth, the proposed classification method aims to select automatically some samples with correct labels (well-labeled pixels) from each class for the training phase, then reassign new labels to the document image pixels. The classification scheme is based on the cooperation of multiple classifiers under some constraints. A real data set of MS historical document images and their ground-truth is created to demonstrate the effectiveness of the proposed method of ground-truth estimation.
Keywords :
document image processing; image classification; image representation; image resolution; MS historical document images; MS imaging representation space; degraded document image binarization; document image pixels; ground-truth estimation; human ground-truthing; image classification method; multispectral imaging representation space; Estimation; Image color analysis; Imaging; Labeling; Manuals; Text analysis; Training; Document image analysis; Document image binarization; Ground-truth estimation; Historical document images; Multispectral document imaging;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
ISSN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2013.45
Filename :
6628610
Link To Document :
بازگشت