Title :
A Novel Short Merged Off-line Handwritten Chinese Character String Segmentation Algorithm Using Hidden Markov Model
Author :
Jiang, Zhiwei ; Ding, Xiaoqing ; Liu, Changsong ; Wang, Yanwei
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
Abstract :
Hidden Markov model (called "HMM" for short) has been a widespread method to segment sequential data in speech recognition and DNA sequence analysis. According to the same principle, it can be also used in segmenting short merged off-line handwritten Chinese character strings, which is a tough issue but often met in practice. Because HMM is still not a common method in this field nowadays, in this paper, we will introduce a novel algorithm using HMM for the segmentation issue above. Eventually, this segmentation algorithm can achieve an applicable performance even when 3755 character classes are compressed into similar characters classes with only 1% amount of original ones, and it also shows an enormous potential of segmenting long text lines.
Keywords :
handwritten character recognition; hidden Markov models; image segmentation; optical character recognition; DNA sequence analysis; hidden Markov model; merged offline handwritten Chinese character string segmentation; sequential data; speech recognition; Algorithm design and analysis; Character recognition; Decoding; Handwriting recognition; Hidden Markov models; Merging; Training; HMM; merged handwritten Chinese characters; merging similar characters; string segmentation;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4577-1350-7
Electronic_ISBN :
1520-5363
DOI :
10.1109/ICDAR.2011.140