• DocumentCode
    248616
  • Title

    Image character recognition using deep convolutional neural network learned from different languages

  • Author

    Jinfeng Bai ; Zhineng Chen ; Bailan Feng ; Bo Xu

  • Author_Institution
    Interactive Digital Media Technol. Res. Center, Inst. of Autom., Beijing, China
  • fYear
    2014
  • fDate
    27-30 Oct. 2014
  • Firstpage
    2560
  • Lastpage
    2564
  • Abstract
    This paper proposes a shared-hidden-layer deep convolutional neural network (SHL-CNN) for image character recognition. In SHL-CNN, the hidden layers are made common across characters from different languages, performing a universal feature extraction process that aims at learning common character traits existed in different languages such as strokes, while the final softmax layer is made language dependent, trained based on characters from the destination language only. This paper is the first attempt to introduce the SHL-CNN framework to image character recognition. Under the SHL-CNN framework, we discuss several issues including architecture of the network, training of the network, from which a suitable SHL-CNN model for image character recognition is empirically learned. The effectiveness of the learned SHL-CNN is verified on both English and Chinese image character recognition tasks, showing that the SHL-CNN can reduce recognition errors by 16-30% relatively compared with models trained by characters of only one language using conventional CNN, and by 35.7% relatively compared with state-of-the-art methods. In addition, the shared hidden layers learned are also useful for unseen image character recognition tasks.
  • Keywords
    character recognition; feature extraction; image recognition; learning (artificial intelligence); natural language processing; neural nets; Chinese image character recognition; English image character recognition; SHL-CNN framework; SHL-CNN model; character traits; destination language; hidden layers; shared-hidden-layer deep convolutional neural network; softmax layer; universal feature extraction process; unseen image character recognition; Character recognition; Databases; Feature extraction; Image recognition; Neural networks; Optical character recognition software; Training; deep convolutional neural network; image character recognition; multi-task learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Image Processing (ICIP), 2014 IEEE International Conference on
  • Conference_Location
    Paris
  • Type

    conf

  • DOI
    10.1109/ICIP.2014.7025518
  • Filename
    7025518