• DocumentCode
    1662089
  • Title

    Tandem HMM with convolutional neural network for handwritten word recognition

  • Author

    Bluche, Theodore ; Ney, Hermann ; Kermorvant, Christopher

  • Author_Institution
    A2iA SA, France
  • fYear
    2013
  • Firstpage
    2390
  • Lastpage
    2394
  • Abstract
    In this paper, we investigate the combination of hidden Markov models and convolutional neural networks for handwritten word recognition. The convolutional neural networks have been successfully applied to various computer vision tasks, including handwritten character recognition. In this work, we show that they can replace Gaussian mixtures to compute emission probabilities in hidden Markov models (hybrid combination), or serve as feature extractor for a standard Gaussian HMM system (tandem combination). The proposed systems outperform a basic HMM based on either decorrelated pixels or handcrafted features. We validated the approach on two publicly available databases, and we report up to 60% (Rimes) and 35% (IAM) relative improvement compared to a Gaussian HMM based on pixel values. The final systems give comparable results to recurrent neural networks, which are the best systems since 2009.
  • Keywords
    computer vision; handwritten character recognition; hidden Markov models; neural nets; computer vision tasks; convolutional neural network; decorrelated pixels; emission probabilities; feature extractor; handcrafted features; handwritten character recognition; handwritten word recognition; hidden Markov models; hybrid combination; standard Gaussian HMM system; tandem HMM; tandem combination; Databases; Feature extraction; Handwriting recognition; Hidden Markov models; Neural networks; Principal component analysis; Training; Convolutional Neural Network; Handwriting recognition; Hidden Markov Model;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on
  • Conference_Location
    Vancouver, BC
  • ISSN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2013.6638083
  • Filename
    6638083