• DocumentCode
    2196705
  • Title

    Alpha-Numerical Sequences Extraction in Handwritten Documents

  • Author

    Thomas, Simon ; Chatelain, Clément ; Heutte, Laurent ; Paquet, Thierry

  • Author_Institution
    LITIS, Univ. de Rouen, St. Etienne du Rouvray, France
  • fYear
    2010
  • fDate
    16-18 Nov. 2010
  • Firstpage
    232
  • Lastpage
    237
  • Abstract
    In this paper, we introduce an alpha-numerical sequences extraction system (keywords, numerical fields or alpha-numerical sequences) in unconstrained handwritten documents. Contrary to most of the approaches presented in the literature, our system relies on a global handwriting line model describing two kinds of information : i) the relevant information and ii) the irrelevant information represented by a shallow parsing model. The shallow parsing of isolated text lines allows quick information extraction in any document while rejecting at the same time irrelevant information. Results on a public french incoming mails database show the efficiency of the approach.
  • Keywords
    feature extraction; handwriting recognition; information retrieval; alpha numerical sequences extraction; handwriting line model; handwritten documents; information extraction; irrelevant information representation; isolated text lines; literature; shallow parsing model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Frontiers in Handwriting Recognition (ICFHR), 2010 International Conference on
  • Conference_Location
    Kolkata
  • Print_ISBN
    978-1-4244-8353-2
  • Type

    conf

  • DOI
    10.1109/ICFHR.2010.44
  • Filename
    5693529