DocumentCode
153404
Title
Graph Model Optimization Based Historical Chinese Character Segmentation Method
Author
Jingning Ji ; Liangrui Peng ; Bohan Li
Author_Institution
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
fYear
2014
fDate
7-10 April 2014
Firstpage
282
Lastpage
286
Abstract
Historical Chinese document recognition technology is important for digital library. However, historical Chinese character segmentation remains a difficult problem due to the complex structure of Chinese characters and various writing styles. This paper presents a novel method for historical Chinese character segmentation based on graph model. After a preliminary over-segmentation stage, the system applies a merging process. The candidate segmentation positions are denoted by the nodes of a graph, and the merging process is regarded as selecting an optimal path of the graph. The weight of edge in the graph is calculated by the cost function which considers geometric features and recognition confidence. Experimental results show that the proposed method is effective with a detection rate of 94.6% and an accuracy rate of 96.1% on a test set of practical historical Chinese document samples.
Keywords
character recognition; character sets; digital libraries; document image processing; graph theory; image segmentation; merging; cost function; digital library; geometric features; graph model optimization; historical Chinese character segmentation method; historical Chinese document recognition technology; merging process; Accuracy; Character recognition; Cost function; Image edge detection; Image segmentation; Merging; Text analysis; character segmentation; graph model; historical Chinese document;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis Systems (DAS), 2014 11th IAPR International Workshop on
Conference_Location
Tours
Print_ISBN
978-1-4799-3243-6
Type
conf
DOI
10.1109/DAS.2014.57
Filename
6831014
Link To Document