• DocumentCode
    3695097
  • Title

    Aligning transcript of historical documents using energy minimization

  • Author

    Rafi Cohen;Irina Rabaev;Jihad El-Sana;Klara Kedem;Itshak Dinstein

  • Author_Institution
    Department of Computer Science, Ben-Gurion University, Beer-Sheva, Israel
  • fYear
    2015
  • Firstpage
    266
  • Lastpage
    270
  • Abstract
    An ongoing considerable effort for digitizing historical manuscripts has produced images of original manuscripts, some accompanied by transcripts. Aligning the text in the input image with the text in the transcript will allow learning, training and evaluating recognition algorithms. Here we propose a system that computes the alignment by formulating the problem as an energy minimization task, where the alignment is performed between the input line image to a synthetic one. The energy function works at a connected component level and it combines a visual similarity measure and a learned distance metric that separates between inter-word and intra-word connected components.
  • Keywords
    Image recognition
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
  • Type

    conf

  • DOI
    10.1109/ICDAR.2015.7333765
  • Filename
    7333765