Title :
Utilization of hierarchical, stochastic relationship modeling for Hangul character recognition
Author :
Kang, Kyung-Won ; Kim, Jin H.
Author_Institution :
Dept. of Comput. Sci., KAIST, Daejon, South Korea
Abstract :
In structural character recognition, a character is usually viewed as a set of strokes and the spatial relationships between them. Therefore, strokes and their relationships should be properly modeled for effective character representation. For this purpose, we propose a modeling scheme by which strokes as well as relationships are stochastically represented by utilizing the hierarchical characteristics of target characters. A character is defined by a multivariate random variable over the components and its probability distribution is learned from a training data set. To overcome difficulties of the learning due to the high order of the probability distribution (a problem of curse of dimensionality), the probability distribution is factorized and approximated by a set of lower-order probability distributions by applying the idea of relationship decomposition recursively to components and subcomponents. Based on the proposed method, a handwritten Hangul (Korean) character recognition system is developed. Recognition experiments conducted on a public database show the effectiveness of the proposed relationship modeling. The recognition accuracy increased by 5.5 percent in comparison to the most successful system ever reported.
Keywords :
handwritten character recognition; image matching; image segmentation; probability; stochastic processes; Korean character recognition; character representation; handwritten Hangul character recognition; hierarchical characteristics; image segmentation; multivariate random variable; probability distribution; stochastic relationship modeling; strokes relationships modeling; structural character recognition; target characters; training data set; Character recognition; Databases; Handwriting recognition; Noise robustness; Probability distribution; Random variables; Statistical analysis; Stochastic processes; Training data; Writing; Hangul character recognition.; Index Terms- Pattern recognition; handwritten character recognition; hierarchical character representation; stochastic relationship modeling; Algorithms; Artificial Intelligence; Automatic Data Processing; Documentation; Handwriting; Image Enhancement; Image Interpretation, Computer-Assisted; Information Storage and Retrieval; Korea; Models, Statistical; Pattern Recognition, Automated; Reproducibility of Results; Sensitivity and Specificity; Stochastic Processes;
Journal_Title :
Pattern Analysis and Machine Intelligence, IEEE Transactions on
DOI :
10.1109/TPAMI.2004.74