• DocumentCode
    153404
  • Title

    Graph Model Optimization Based Historical Chinese Character Segmentation Method

  • Author

    Jingning Ji ; Liangrui Peng ; Bohan Li

  • Author_Institution
    Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
  • fYear
    2014
  • fDate
    7-10 April 2014
  • Firstpage
    282
  • Lastpage
    286
  • Abstract
    Historical Chinese document recognition technology is important for digital library. However, historical Chinese character segmentation remains a difficult problem due to the complex structure of Chinese characters and various writing styles. This paper presents a novel method for historical Chinese character segmentation based on graph model. After a preliminary over-segmentation stage, the system applies a merging process. The candidate segmentation positions are denoted by the nodes of a graph, and the merging process is regarded as selecting an optimal path of the graph. The weight of edge in the graph is calculated by the cost function which considers geometric features and recognition confidence. Experimental results show that the proposed method is effective with a detection rate of 94.6% and an accuracy rate of 96.1% on a test set of practical historical Chinese document samples.
  • Keywords
    character recognition; character sets; digital libraries; document image processing; graph theory; image segmentation; merging; cost function; digital library; geometric features; graph model optimization; historical Chinese character segmentation method; historical Chinese document recognition technology; merging process; Accuracy; Character recognition; Cost function; Image edge detection; Image segmentation; Merging; Text analysis; character segmentation; graph model; historical Chinese document;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis Systems (DAS), 2014 11th IAPR International Workshop on
  • Conference_Location
    Tours
  • Print_ISBN
    978-1-4799-3243-6
  • Type

    conf

  • DOI
    10.1109/DAS.2014.57
  • Filename
    6831014