Title :
Bleed-Through Removal by Learning a Discriminative Color Channel
Author :
Villegas, Mauricio ; Toselli, Alejandro Hector
Author_Institution :
PRHLT, Univ. Politec. de Valencia, Valencia, Spain
Abstract :
This paper proposes a novel bleed-through removal technique based on learning a color channel that is optimized so that the foreground text is enhanced while at the same time the variability of the background (including the bleed-through) is diminished. The technique is intended to be part of an interactive transcription system in which the objective is obtaining high quality transcriptions with the least human effort. Thus, instead of training the bleed-through removal to work in general for any document, the technique requires a user to label regions both as foreground text and as bleed-through, with the aim that the method is adapted to the characteristics of each document. The proposal is assessed using the handwritten recognition performance on a real 17th century manuscript.
Keywords :
document image processing; handwriting recognition; image colour analysis; interactive systems; learning (artificial intelligence); text detection; bleed-through removal; discriminative color channel; document; foreground text; handwritten recognition performance; interactive transcription system; learning; Gray-scale; Hidden Markov models; Histograms; Image color analysis; Ink; Text recognition; Vectors; Bleed-through; Handwritten Text Recognition; Scanned Document Noise Removal; Show-through;
Conference_Titel :
Frontiers in Handwriting Recognition (ICFHR), 2014 14th International Conference on
Conference_Location :
Heraklion
Print_ISBN :
978-1-4799-4335-7
DOI :
10.1109/ICFHR.2014.16