Title :
Text-Line Extraction in Handwritten Chinese Documents Based on an Energy Minimization Framework
Author :
Koo, Hyung Il ; Cho, Nam Ik
Author_Institution :
Qualcomm R&D Center, Seoul, South Korea
fDate :
3/1/2012 12:00:00 AM
Abstract :
Text-line extraction in unconstrained handwritten documents remains a challenging problem due to nonuniform character scale, spatially varying text orientation, and the interference between text lines. In order to address these problems, we propose a new cost function that considers the interactions between text lines and the curvilinearity of each text line. Precisely, we achieve this goal by introducing normalized measures for them, which are based on an estimated line spacing. We also present an optimization method that exploits the properties of our cost function. Experimental results on a database consisting of 853 handwritten Chinese document images have shown that our method achieves a detection rate of 99.52% and an error rate of 0.32%, which outperforms conventional methods.
Keywords :
feature extraction; handwritten character recognition; natural language processing; text detection; energy minimization framework; handwritten Chinese document images; line spacing; nonuniform character scale; text lines; text orientation; text-line extraction; Cost function; Data mining; Image color analysis; Materials; Minimization; Robustness; State estimation; Handwritten Chinese document; state estimation in document images; text-line extraction;
Journal_Title :
Image Processing, IEEE Transactions on
DOI :
10.1109/TIP.2011.2166972