• DocumentCode
    1634635
  • Title

    A Tool for Ground-Truthing Text Lines and Characters in Off-Line Handwritten Chinese Documents

  • Author

    Yin, Fei ; Wang, Qiu-Feng ; Liu, Cheng-Lin

  • Author_Institution
    Nat. Lab. of Pattern Recognition (NLPR), Chinese Acad. of Sci., Beijing, China
  • fYear
    2009
  • Firstpage
    951
  • Lastpage
    955
  • Abstract
    Annotating the regions, text lines and characters of document images is an important, but tedious and expensive task. A ground-truthing tool may largely alleviate the human burden in this process. This paper describes an automated recognition-based tool GTLC for finding the best alignment between the text transcript and the connected components of unconstrained handwritten document image. The alignment process is formulated as an optimization problem involving candidate character segmentation and recognition. We have validated the effectiveness of this tool and have used it for annotating a large number of handwritten Chinese documents.
  • Keywords
    document image processing; handwritten character recognition; image segmentation; natural languages; optimisation; text analysis; GTLC; alignment process; automated recognition-based tool; character recognition; character segmentation; ground-truthing text line tool; handwritten document image; offline handwritten Chinese document; optimization problem; Algorithm design and analysis; Carbon capture and storage; Character recognition; Handwriting recognition; Hidden Markov models; Image segmentation; Pattern recognition; Shape; Text analysis; Text recognition; Annotation; Document image; Ground-truthing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2009. ICDAR '09. 10th International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4244-4500-4
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2009.93
  • Filename
    5277560