• DocumentCode
    3695071
  • Title

    Combining handwriting and speech recognition for transcribing historical handwritten documents

  • Author

    Emilio Granell;Carlos-D. Martínez-Hinarejos

  • Author_Institution
    Pattern Recognition and Human Language Technology Research Center, Universitat Politè
  • fYear
    2015
  • Firstpage
    126
  • Lastpage
    130
  • Abstract
    Transcription of historical documents is an interesting task for libraries in order to make available their funds. In the lasts years, the use of Handwritten Text Recognition allowed paleographs to speed up the manual transcription process, since they are able to correct on a draft transcription. Another alternative is obtaining the draft transcription by dictating the contents to an Automatic Speech Recognition system. When both sources (image and speech) are available, a multimodal combination is possible, and an iterative process can be used in order to refine the final hypothesis. In this work, a multimodal combination based on confusion networks is presented. Results on two different sets of data, with different difficulty level, show that the proposed technique provides similar or better draft transcriptions than a previously proposed approach, allowing for a faster transcription process.
  • Keywords
    "Iterative decoding","Acoustics","Proposals","Laplace equations","Integrated optics","Optical imaging"
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
  • Type

    conf

  • DOI
    10.1109/ICDAR.2015.7333739
  • Filename
    7333739