Title :
Graphics Extraction from Heterogeneous Online Documents with Hierarchical Random Fields
Author :
Delaye, Adrien ; Cheng-Lin Liu
Author_Institution :
Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China
Abstract :
Graphical objects are important elements of freely handwritten notes but their segmentation from the document is challenging due to their irregular properties. This paper introduces an original solution for automatically segmenting diagrams and drawings from unstructured online documents. We propose a multi-scale representation of the document modeled as a hierarchical Conditional Random Field to predict the detection of graphical elements at the stroke level. An experimental evaluation with realistic documents highlights the benefit of the hierarchical model in comparison with a flat Conditional Random Field and demonstrates the robustness of our system.
Keywords :
document image processing; feature extraction; image representation; image segmentation; diagram segmentation; document multiscale representation; document segmentation; drawing segmentation; flat conditional random field; graphical element detection; graphical objects; graphics extraction; handwritten notes; heterogeneous online documents; hierarchical conditional random field; unstructured online documents; Conferences; Feature extraction; Graphics; Handwriting recognition; Mathematical model; Text analysis;
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
DOI :
10.1109/ICDAR.2013.202