Title :
Scene Character Reconstruction through Medial Axis
Author :
Shangxuan Tian ; Shivakumara, Palaiahnakote ; Trung Quy Phan ; Chew Lim Tan
Author_Institution :
Dept. of Comput. Sci., Nat. Univ. of Singapore, Singapore, Singapore
Abstract :
Character shape reconstruction for the scene character is challenging and interesting because scene character usually suffers from uneven illumination, complex background, perspective distortion. To address such ill conditions, we propose to utilize Histogram Gradient Division (HGD) and Reverse Gradient Orientation (RGO) to select Candidate Text Pixels (CTPs) for a given input character. Ring Radius Transform is applied on each pixel in a CTP image to obtain radius map where each pixel is assigned a value which is the radius to the nearest CTP. Candidate medial axis pixels are those having maximum radius values in their neighborhoods. We find such pixels on horizontal, vertical, principal diagonal and secondary diagonal directions to determine the respective medial axis pixels. The union of all medial axis pixels at each pixel location is considered as a candidate medial axis pixel of the character. Then color difference and k-means clustering are employed to eliminate false candidate medial axis. The potential medial axis values are used to reconstruct the shape of the character. The method is tested on 1025 characters of complex foreground and background from ICDAR 2003 dataset in terms of shape reconstruction and recognition rate. Experimental results demonstrate the effectiveness of our proposed method for complex foreground and background characters in terms of character recognition rate and reconstruction error.
Keywords :
character recognition; image colour analysis; image recognition; image reconstruction; image resolution; lighting; pattern clustering; shape recognition; transforms; visual databases; CTP image; HGD; ICDAR 2003 dataset; RGO; candidate medial axis pixels; candidate text pixels; character recognition rate; character shape reconstruction; color difference; complex background characters; complex foreground characters; false candidate medial axis elimination; histogram gradient division; illumination; k-means clustering; pixel location; radius map; reverse gradient orientation; ring radius transform; scene character reconstruction; shape recognition rate; Character recognition; Histograms; Image color analysis; Image edge detection; Image reconstruction; Optical character recognition software; Shape;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
DOI :
10.1109/ICDAR.2013.275