DocumentCode
2196705
Title
Alpha-Numerical Sequences Extraction in Handwritten Documents
Author
Thomas, Simon ; Chatelain, Clément ; Heutte, Laurent ; Paquet, Thierry
Author_Institution
LITIS, Univ. de Rouen, St. Etienne du Rouvray, France
fYear
2010
fDate
16-18 Nov. 2010
Firstpage
232
Lastpage
237
Abstract
In this paper, we introduce an alpha-numerical sequences extraction system (keywords, numerical fields or alpha-numerical sequences) in unconstrained handwritten documents. Contrary to most of the approaches presented in the literature, our system relies on a global handwriting line model describing two kinds of information : i) the relevant information and ii) the irrelevant information represented by a shallow parsing model. The shallow parsing of isolated text lines allows quick information extraction in any document while rejecting at the same time irrelevant information. Results on a public french incoming mails database show the efficiency of the approach.
Keywords
feature extraction; handwriting recognition; information retrieval; alpha numerical sequences extraction; handwriting line model; handwritten documents; information extraction; irrelevant information representation; isolated text lines; literature; shallow parsing model;
fLanguage
English
Publisher
ieee
Conference_Titel
Frontiers in Handwriting Recognition (ICFHR), 2010 International Conference on
Conference_Location
Kolkata
Print_ISBN
978-1-4244-8353-2
Type
conf
DOI
10.1109/ICFHR.2010.44
Filename
5693529
Link To Document