DocumentCode
2195913
Title
A New Hierarchical Handwritten Document Layout Extraction Based on Conditional Random Field Modeling
Author
Montreuil, Florent ; Nicolas, Stephane ; Grosicki, Emmanuéle ; Heutte, Laurent
Author_Institution
DGA, Centre d´´Expertise Parisien, Arcueil, France
fYear
2010
fDate
16-18 Nov. 2010
Firstpage
31
Lastpage
36
Abstract
In this study we describe a new approach to extract layout of unconstrained handwritten letters such as those sent by individuals to companies. The proposed model uses a hierarchical combination of Conditional Random Fields (CRFs) which gives access to various levels of the layout interpretation. The analysis proceeds by decreasing the resolution and increasing the abstraction of the document, starting from high resolution analysis (pixel level), to a low resolution of the layout structure. Informations of high resolution are used to bring a specific prior knowledge of the layout like presence of textual information. Experiments have been performed on the RIMES database composed of more than 5000 handwritten letters. Good results have been reported showing the capacity of our approach to extract simultaneously the physical and logical layouts.
Keywords
document image processing; feature extraction; handwritten character recognition; image resolution; random processes; text analysis; RIMES database; conditional random field modeling; document layout extraction; handwritten letters; layout interpretation; resolution analysis; textual information; Conditional Random Field; Handwritten Document Structure; Layout Extraction;
fLanguage
English
Publisher
ieee
Conference_Titel
Frontiers in Handwriting Recognition (ICFHR), 2010 International Conference on
Conference_Location
Kolkata
Print_ISBN
978-1-4244-8353-2
Type
conf
DOI
10.1109/ICFHR.2010.13
Filename
5693496
Link To Document