Title :
Unsupervised learning technique for binarization of gray scale text images
Author :
Srivastava, S. ; Sanyal, S.
Author_Institution :
Dept. Human Comput. Interaction, Indian Inst. of Inf. Technol., Allahabad, India
Abstract :
This paper suggests an unsupervised learning technique for binarization of gray scale text images based on fusion of existing approaches. The existing binarization approaches have though resolved issues of degraded contrast, varied intensity, shadow, smudges, bleed through and smear to a certain extent, yet none of the existing algorithms are capable to work out all the issues independently. However, these have also led to issues like broken characters and merged characters. Some of the standard algorithms in this field have been chosen as the core methods to produce partial binarization results. The decision on these partial binarized results has been fused using three different proposed approaches. The benchmark of performance evaluation are those set by DIBCo. The standard dataset along with the ground truth have also been taken from the same. Our binarized results produced from the above methods have given better results in contrast to DIBCo-2011 and have also surpassed the failure cases of DIBCo 2011. The Results and Analysis section comprehends the conclusion drawn.
Keywords :
image fusion; unsupervised learning; DIBCo; binarization approaches; gray scale text images; image fusion; unsupervised learning technique; Character recognition; Image resolution; Noise; Optical character recognition software; Patents; Standards; Unsupervised learning; Binarization; Broken Characters; Merged Characters; Unsupervised Learning;
Conference_Titel :
India Conference (INDICON), 2014 Annual IEEE
Conference_Location :
Pune
Print_ISBN :
978-1-4799-5362-2
DOI :
10.1109/INDICON.2014.7030453