DocumentCode :
1578446
Title :
Automatic Thai and English fonts identification without character recognition
Author :
Kruatrachue, Boontee ; Piyatrakul, Pongsakorn
Author_Institution :
Dept. of Comput. Eng., King Mongkut´´s Inst. of Technol., Bangkok, Thailand
Volume :
2
fYear :
2001
fDate :
6/23/1905 12:00:00 AM
Firstpage :
603
Abstract :
This paper describes a simple and fast algorithm to detect Thai and English characters in a document without doing actual characters recognition. The document is segmented into strings of letters separated by a blank, then each string is identified using characters features and their writing positions. This method achieves 100% accuracy if the characters have clear head feature. But if this feature is not used 90% of the strings still can be identified. This identification provides more information about the character set so that OCR can recognize faster with better accuracy
Keywords :
character sets; optical character recognition; English fonts identification; Thai fonts identification; characters features; optical font recognition; writing positions; Character recognition; Head; Information technology; Natural languages; Optical character recognition software; Writing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Communications, Computers and signal Processing, 2001. PACRIM. 2001 IEEE Pacific Rim Conference on
Conference_Location :
Victoria, BC
Print_ISBN :
0-7803-7080-5
Type :
conf
DOI :
10.1109/PACRIM.2001.953705
Filename :
953705
Link To Document :
بازگشت