Title :
Word segmentation in handwritten Chinese text image based on component clustering techniques
Author :
Chen, Qingshan ; Zhen, Lixin
Author_Institution :
Shanghai Jiao Tong Univ., China
Abstract :
Segmentation of handwritten Chinese input into individual character is a crucial step in many connected handwriting recognition systems. In this paper, a new method is addressed to segment off-line handwritten Chinese text images. We first adopt the HMM method to produce the segmentation paths and apply two rules to reduce the redundant paths, then the left candidate paths dissect the text line into radicals or pseudo-radicals-components. In the second stage, we propose three new criteria -aspect ratio, gap ratio, longer edge criteria - to calculate the clustering cost matrix and use a dynamic programming technique to produce the optimal clustering scheme. A series of experiments show that our method is very effective for the word segmentation of the offline handwritten Chinese text image.
Keywords :
dynamic programming; handwritten character recognition; hidden Markov models; component clustering techniques; dynamic programming; handwritten Chinese text image segmentation; hidden Markov method; optimal clustering; word segmentation; Character recognition; Cost function; Dynamic programming; Handwriting recognition; Hidden Markov models; Image segmentation; Merging; Pattern recognition; Postal services; Writing;
Conference_Titel :
TENCON '02. Proceedings. 2002 IEEE Region 10 Conference on Computers, Communications, Control and Power Engineering
Print_ISBN :
0-7803-7490-8
DOI :
10.1109/TENCON.2002.1181307