DocumentCode :
1313346
Title :
Text-Line Extraction in Handwritten Chinese Documents Based on an Energy Minimization Framework
Author :
Koo, Hyung Il ; Cho, Nam Ik
Author_Institution :
Qualcomm R&D Center, Seoul, South Korea
Volume :
21
Issue :
3
fYear :
2012
fDate :
3/1/2012 12:00:00 AM
Firstpage :
1169
Lastpage :
1175
Abstract :
Text-line extraction in unconstrained handwritten documents remains a challenging problem due to nonuniform character scale, spatially varying text orientation, and the interference between text lines. In order to address these problems, we propose a new cost function that considers the interactions between text lines and the curvilinearity of each text line. Precisely, we achieve this goal by introducing normalized measures for them, which are based on an estimated line spacing. We also present an optimization method that exploits the properties of our cost function. Experimental results on a database consisting of 853 handwritten Chinese document images have shown that our method achieves a detection rate of 99.52% and an error rate of 0.32%, which outperforms conventional methods.
Keywords :
feature extraction; handwritten character recognition; natural language processing; text detection; energy minimization framework; handwritten Chinese document images; line spacing; nonuniform character scale; text lines; text orientation; text-line extraction; Cost function; Data mining; Image color analysis; Materials; Minimization; Robustness; State estimation; Handwritten Chinese document; state estimation in document images; text-line extraction;
fLanguage :
English
Journal_Title :
Image Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1057-7149
Type :
jour
DOI :
10.1109/TIP.2011.2166972
Filename :
6008643
Link To Document :
بازگشت