• DocumentCode
    3490146
  • Title

    Scene Character Reconstruction through Medial Axis

  • Author

    Shangxuan Tian ; Shivakumara, Palaiahnakote ; Trung Quy Phan ; Chew Lim Tan

  • Author_Institution
    Dept. of Comput. Sci., Nat. Univ. of Singapore, Singapore, Singapore
  • fYear
    2013
  • fDate
    25-28 Aug. 2013
  • Firstpage
    1360
  • Lastpage
    1364
  • Abstract
    Character shape reconstruction for the scene character is challenging and interesting because scene character usually suffers from uneven illumination, complex background, perspective distortion. To address such ill conditions, we propose to utilize Histogram Gradient Division (HGD) and Reverse Gradient Orientation (RGO) to select Candidate Text Pixels (CTPs) for a given input character. Ring Radius Transform is applied on each pixel in a CTP image to obtain radius map where each pixel is assigned a value which is the radius to the nearest CTP. Candidate medial axis pixels are those having maximum radius values in their neighborhoods. We find such pixels on horizontal, vertical, principal diagonal and secondary diagonal directions to determine the respective medial axis pixels. The union of all medial axis pixels at each pixel location is considered as a candidate medial axis pixel of the character. Then color difference and k-means clustering are employed to eliminate false candidate medial axis. The potential medial axis values are used to reconstruct the shape of the character. The method is tested on 1025 characters of complex foreground and background from ICDAR 2003 dataset in terms of shape reconstruction and recognition rate. Experimental results demonstrate the effectiveness of our proposed method for complex foreground and background characters in terms of character recognition rate and reconstruction error.
  • Keywords
    character recognition; image colour analysis; image recognition; image reconstruction; image resolution; lighting; pattern clustering; shape recognition; transforms; visual databases; CTP image; HGD; ICDAR 2003 dataset; RGO; candidate medial axis pixels; candidate text pixels; character recognition rate; character shape reconstruction; color difference; complex background characters; complex foreground characters; false candidate medial axis elimination; histogram gradient division; illumination; k-means clustering; pixel location; radius map; reverse gradient orientation; ring radius transform; scene character reconstruction; shape recognition rate; Character recognition; Histograms; Image color analysis; Image edge detection; Image reconstruction; Optical character recognition software; Shape;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
  • Conference_Location
    Washington, DC
  • ISSN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2013.275
  • Filename
    6628836