DocumentCode
248616
Title
Image character recognition using deep convolutional neural network learned from different languages
Author
Jinfeng Bai ; Zhineng Chen ; Bailan Feng ; Bo Xu
Author_Institution
Interactive Digital Media Technol. Res. Center, Inst. of Autom., Beijing, China
fYear
2014
fDate
27-30 Oct. 2014
Firstpage
2560
Lastpage
2564
Abstract
This paper proposes a shared-hidden-layer deep convolutional neural network (SHL-CNN) for image character recognition. In SHL-CNN, the hidden layers are made common across characters from different languages, performing a universal feature extraction process that aims at learning common character traits existed in different languages such as strokes, while the final softmax layer is made language dependent, trained based on characters from the destination language only. This paper is the first attempt to introduce the SHL-CNN framework to image character recognition. Under the SHL-CNN framework, we discuss several issues including architecture of the network, training of the network, from which a suitable SHL-CNN model for image character recognition is empirically learned. The effectiveness of the learned SHL-CNN is verified on both English and Chinese image character recognition tasks, showing that the SHL-CNN can reduce recognition errors by 16-30% relatively compared with models trained by characters of only one language using conventional CNN, and by 35.7% relatively compared with state-of-the-art methods. In addition, the shared hidden layers learned are also useful for unseen image character recognition tasks.
Keywords
character recognition; feature extraction; image recognition; learning (artificial intelligence); natural language processing; neural nets; Chinese image character recognition; English image character recognition; SHL-CNN framework; SHL-CNN model; character traits; destination language; hidden layers; shared-hidden-layer deep convolutional neural network; softmax layer; universal feature extraction process; unseen image character recognition; Character recognition; Databases; Feature extraction; Image recognition; Neural networks; Optical character recognition software; Training; deep convolutional neural network; image character recognition; multi-task learning;
fLanguage
English
Publisher
ieee
Conference_Titel
Image Processing (ICIP), 2014 IEEE International Conference on
Conference_Location
Paris
Type
conf
DOI
10.1109/ICIP.2014.7025518
Filename
7025518
Link To Document