DocumentCode :
3488860
Title :
Graphics Extraction from Heterogeneous Online Documents with Hierarchical Random Fields
Author :
Delaye, Adrien ; Cheng-Lin Liu
Author_Institution :
Nat. Lab. of Pattern Recognition, Inst. of Autom., Beijing, China
fYear :
2013
fDate :
25-28 Aug. 2013
Firstpage :
1007
Lastpage :
1011
Abstract :
Graphical objects are important elements of freely handwritten notes but their segmentation from the document is challenging due to their irregular properties. This paper introduces an original solution for automatically segmenting diagrams and drawings from unstructured online documents. We propose a multi-scale representation of the document modeled as a hierarchical Conditional Random Field to predict the detection of graphical elements at the stroke level. An experimental evaluation with realistic documents highlights the benefit of the hierarchical model in comparison with a flat Conditional Random Field and demonstrates the robustness of our system.
Keywords :
document image processing; feature extraction; image representation; image segmentation; diagram segmentation; document multiscale representation; document segmentation; drawing segmentation; flat conditional random field; graphical element detection; graphical objects; graphics extraction; handwritten notes; heterogeneous online documents; hierarchical conditional random field; unstructured online documents; Conferences; Feature extraction; Graphics; Handwriting recognition; Mathematical model; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2013 12th International Conference on
Conference_Location :
Washington, DC
ISSN :
1520-5363
Type :
conf
DOI :
10.1109/ICDAR.2013.202
Filename :
6628767
Link To Document :
بازگشت