• DocumentCode
    3695096
  • Title

    Investigation of Segmental Conditional Random Fields for large vocabulary handwriting recognition

  • Author

    Mahdi Hamdani;M. Ali Basha Shaik;Patrick Doetsch;Hermann Ney

  • Author_Institution
    Human Language Technology and Pattern Recognition Group - RWTH Aachen University, Germany
  • fYear
    2015
  • Firstpage
    261
  • Lastpage
    265
  • Abstract
    Multiple types of models are used in handwriting recognition and can be broadly categorized into generative and discriminative models. Gaussian Hidden Markov Models are used successfully in most of the systems. Discriminative training can be applied to these models to improve them further. Alternatively, Segmental Conditional Random Fields have the advantage of being discriminative as well as segmental. The novelty of this work is the investigation of Segmental Conditional Random Fields for handwriting recognition. In addition, Multi-Layer Perceptrons and Long Short Term Memory Recurrent Neural Networks are compared for the observations generation in this framework. Various types of features are investigated in the segmental models for handwriting recognition. Furthermore, class-based language model features are proposed to extend this model. Visual features based on moments are extracted at a word level to make the model more robust. Experimental results on English handwriting show a relative reduction of 13.7% in terms of word error rate w.r.t. the baseline system. The proposed system also outperforms the Gaussian Hidden Markov Models trained discriminatively using the minimum phone error criterion by a relative reduction of 6.9% in terms of word error rate.
  • Keywords
    "Hidden Markov models","Adaptation models","Handwriting recognition","Computer architecture","Speech recognition","Computational modeling","Markov processes"
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
  • Type

    conf

  • DOI
    10.1109/ICDAR.2015.7333764
  • Filename
    7333764