• DocumentCode
    3673820
  • Title

    Adaptive clustering algorithm for optical character recognition

  • Author

    Abubakar Muhammad Ashir;Gaddafi Sani Shehu

  • Author_Institution
    Department of Mechanical and Mechatronic Engineering, Mevlana University, Konya, Turkey
  • fYear
    2015
  • fDate
    6/1/2015 12:00:00 AM
  • Abstract
    In analytical optical character recognition, effects of noise and overlapping character blocks constitute a major problem to feature extraction algorithms. This problem degrades the performance of the recognition stage. In this approach, an adaptive clustering algorithm and Hamming Distance computation are proposed to aid the extraction and recognition processes. Initially a line is picked from the segmented lines of a template. The line matrix is projected unto a characters´ vector which is used to compute the average width of a character in that line. This information is passed to adaptive clustering analysis. This algorithm check for characters with abnormal width, compute the total binary large objects and return each object as a separate character. Hence any block of characters mistaken for a single character can be detected and re-segmented. The recognition stage adaptively switches from correlation computation to Hamming Distance whenever the former is doomed to fail or produce undesirable results. The approach works well and an appreciable performance improvement is recorded. 5 test images of different font types are used with total of 739 characters. 89.04% successful recognition rate was recorded.
  • Keywords
    "Optical character recognition software","Character recognition","Feature extraction","Correlation","Clustering algorithms","Databases","Hamming distance"
  • Publisher
    ieee
  • Conference_Titel
    Electronics, Computers and Artificial Intelligence (ECAI), 2015 7th International Conference on
  • Print_ISBN
    978-1-4673-6646-5
  • Type

    conf

  • DOI
    10.1109/ECAI.2015.7301192
  • Filename
    7301192