DocumentCode
1578446
Title
Automatic Thai and English fonts identification without character recognition
Author
Kruatrachue, Boontee ; Piyatrakul, Pongsakorn
Author_Institution
Dept. of Comput. Eng., King Mongkut´´s Inst. of Technol., Bangkok, Thailand
Volume
2
fYear
2001
fDate
6/23/1905 12:00:00 AM
Firstpage
603
Abstract
This paper describes a simple and fast algorithm to detect Thai and English characters in a document without doing actual characters recognition. The document is segmented into strings of letters separated by a blank, then each string is identified using characters features and their writing positions. This method achieves 100% accuracy if the characters have clear head feature. But if this feature is not used 90% of the strings still can be identified. This identification provides more information about the character set so that OCR can recognize faster with better accuracy
Keywords
character sets; optical character recognition; English fonts identification; Thai fonts identification; characters features; optical font recognition; writing positions; Character recognition; Head; Information technology; Natural languages; Optical character recognition software; Writing;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications, Computers and signal Processing, 2001. PACRIM. 2001 IEEE Pacific Rim Conference on
Conference_Location
Victoria, BC
Print_ISBN
0-7803-7080-5
Type
conf
DOI
10.1109/PACRIM.2001.953705
Filename
953705
Link To Document