• Title of article

    A synthesised word approach to word retrieval in handwritten documents

  • Author/Authors

    Liang، نويسنده , , Y. and Fairhurst، نويسنده , , M.C. and Guest، نويسنده , , R.M.، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2012
  • Pages
    12
  • From page
    4225
  • To page
    4236
  • Abstract
    Recent technological advances have enhanced the computer-based indexing and searching of digitised printed books. The performance now achievable in this domain, however, does not at present extend to handwritten texts which inherently contain more significant letter-based variation within their content. Furthermore, in most studies that address the handwritten text retrieval problem, a large training dataset is required which, very often, influences the context and search lexicon. In this paper a novel method is described to overcome the training data problem using a character-based modelling (termed grapheme spectrum) approach and a word modelling technique (termed synthesised word) enabling the retrieval of keywords that have not explicitly been seen in the training set. When tested on an illustrative historical manuscript the performance of the proposed word retrieval technique shows a clear advantage over existing methods.
  • Keywords
    Handwriting analysis , digital archives , Handwritten word retrieval , Word spotting , Handwriting recognition , Historical manuscript analysis , information retrieval
  • Journal title
    PATTERN RECOGNITION
  • Serial Year
    2012
  • Journal title
    PATTERN RECOGNITION
  • Record number

    1734966