Title :
Structure based modern Korean character set partitioning and pre-classification method of korean character recognition
Author :
Mingji Piao ; Sejin Kim ; Rongyi Cui
Author_Institution :
Dept. of Comput. Sci. & Technol., Yanbian Univ., Yanji, China
Abstract :
With respect to practical Korean documents, structure characteristics and structure classification of Korean character and a new pre-classification method of character recognition were studied in this paper. Firstly, the concept of character structure distance and its simple calculation method were proposed to describe the difference between structures, according to the unique linearizability of each character and the occurrence of basic graphemes. Furthermore, based on ID3 algorithm and the information gain of graphemes, the decision tree of structure classification was constructed. Finally, a decision tree based pre-classification algorithm was designed for printed character recognition. The experimental results show that basic graphemes v2, c3, v3 and v1 or c4 possess high contribution for character pre-classification, and the proposed method has reliable theoretical foundation and effective classification performance.
Keywords :
character recognition; decision trees; document image processing; image classification; ID3 algorithm; Korean character recognition; Korean documents; basic graphemes; character structure distance; decision tree; information gain; preclassification method; printed character recognition; structure based modern Korean character set partitioning; structure characteristics; structure classification; Character recognition; Educational institutions; Reliability; Korean character; character pre-classification; character structure classification; decision tree; information gain; structure distance;
Conference_Titel :
Computer Science and Information Processing (CSIP), 2012 International Conference on
Conference_Location :
Xi´an, Shaanxi
Print_ISBN :
978-1-4673-1410-7
DOI :
10.1109/CSIP.2012.6309086