• DocumentCode
    2174213
  • Title

    A fast recognition system for isolated arabic characters

  • Author

    Cowell, John ; Hussain, Dr Fiaz

  • Author_Institution
    Dept. of Comput. Sci., De Montfort Univ., Leicester, UK
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    650
  • Lastpage
    654
  • Abstract
    This paper presents a very fast multi-stage algorithm for the recognition of non-Latin script. Although the examples use Arabic script, the system could be adapted in minutes to deal with any character set, in particular non-Latin characters where no commercial OCR systems are available. The approach used normalises isolated characters for size and extracts an image signature based on the number of black pixels in the rows and columns of the character and compares these values to a set of signatures for typical characters of the set. This technique identifies not only the closet match but gives the closeness of match to all other characters in the set, which is expressed in a triangular confusion matrix.
  • Keywords
    character recognition; pattern matching; Arabic script; character recognition; confusion matrix; fast multistage algorithm; fonts; image signatures; isolated arabic characters; normalisation; pattern matching; Character recognition; Computer science; Data mining; Feature extraction; Information systems; Optical character recognition software; Pattern recognition; Pixel; Robustness; Text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Visualisation, 2002. Proceedings. Sixth International Conference on
  • ISSN
    1093-9547
  • Print_ISBN
    0-7695-1656-4
  • Type

    conf

  • DOI
    10.1109/IV.2002.1028844
  • Filename
    1028844