• DocumentCode
    1578446
  • Title

    Automatic Thai and English fonts identification without character recognition

  • Author

    Kruatrachue, Boontee ; Piyatrakul, Pongsakorn

  • Author_Institution
    Dept. of Comput. Eng., King Mongkut´´s Inst. of Technol., Bangkok, Thailand
  • Volume
    2
  • fYear
    2001
  • fDate
    6/23/1905 12:00:00 AM
  • Firstpage
    603
  • Abstract
    This paper describes a simple and fast algorithm to detect Thai and English characters in a document without doing actual characters recognition. The document is segmented into strings of letters separated by a blank, then each string is identified using characters features and their writing positions. This method achieves 100% accuracy if the characters have clear head feature. But if this feature is not used 90% of the strings still can be identified. This identification provides more information about the character set so that OCR can recognize faster with better accuracy
  • Keywords
    character sets; optical character recognition; English fonts identification; Thai fonts identification; characters features; optical font recognition; writing positions; Character recognition; Head; Information technology; Natural languages; Optical character recognition software; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications, Computers and signal Processing, 2001. PACRIM. 2001 IEEE Pacific Rim Conference on
  • Conference_Location
    Victoria, BC
  • Print_ISBN
    0-7803-7080-5
  • Type

    conf

  • DOI
    10.1109/PACRIM.2001.953705
  • Filename
    953705