Title :
Thai handwriting legal amounts recognition
Author :
Chatwiriya, Watchara ; Klinkhachorn, P. ; Lass, Norman
Author_Institution :
Dept. of Comput. Eng., King Mongkut´´s Inst. of Technol., Bangkok, Thailand
Abstract :
A legal amount, or the word amount, is defined as the expression of a numerical value into words. It is used mostly in monetary documents. Thai legal amounts consist of a set of 17 words of 25 characters. This paper´s objective is to develop a system to recognize Thai handwriting legal amounts. The strategies to improve the character recognition rates at the preprocessing and recognition process are discussed including the division of input characters into smaller subgroups: the body and high characters group, using multiple feature combinations and using the classifiers that give answers as a list of possible answers with scores. And at the post processes, we improved overall recognition rate using prior knowledge such as lexicon and syntax. We proposed the concept of partial lexicon matching mainly relying on the recognition results of the body group characters. The syntax verification algorithms are based on the sequences of word types, the joining between word types, and special rules for specific words. Irregular handwriting, i.e., touching and broken characters, are detected. Also holistic word recognition is employed to recognize these characters without explicit character segmentation.
Keywords :
cheque processing; handwritten character recognition; image matching; Thai handwriting legal amounts recognition; body group; broken characters; character recognition rates; high characters group; holistic word recognition; input characters division; irregular handwriting detection; monetary documents; multiple feature combinations; partial lexicon matching; preprocessing process; recognition process; subgroups; syntax verification algorithms; touching characters; word type sequences; word types; Banking; Character recognition; Data preprocessing; Handwriting recognition; Law; Legal factors; Optical character recognition software; Pathology; Speech; Writing;
Conference_Titel :
System Theory, 2003. Proceedings of the 35th Southeastern Symposium on
Print_ISBN :
0-7803-7697-8
DOI :
10.1109/SSST.2003.1194596