DocumentCode :
1864804
Title :
Text recognition using deep BLSTM networks
Author :
Ray, Anupama ; Rajeswar, Sai ; Chaudhury, Santanu
Author_Institution :
Dept. of Electr. Eng., Indian Inst. of Technol. Delhi, New Delhi, India
fYear :
2015
fDate :
4-7 Jan. 2015
Firstpage :
1
Lastpage :
6
Abstract :
This paper presents a Deep Bidirectional Long Short Term Memory (LSTM) based Recurrent Neural Network architecture for text recognition. This architecture uses Connectionist Temporal Classification (CTC) for training to learn the labels of an unsegmented sequence with unknown alignment. This work is motivated by the results of Deep Neural Networks for isolated numeral recognition and improved speech recognition using Deep BLSTM based approaches. Deep BLSTM architecture is chosen due to its ability to access long range context, learn sequence alignment and work without the need of segmented data. Due to the use of CTC and forward backward algorithms for alignment of output labels, there are no unicode re-ordering issues, thus no need of lexicon or postprocessing schemes. This is a script independent and segmentation free approach. This system has been implemented for the recognition of unsegmented words of printed Oriya text. This system achieves 4.18% character level error and 12.11% word error rate on printed Oriya text.
Keywords :
image classification; image segmentation; learning (artificial intelligence); recurrent neural nets; text detection; CTC; connectionist temporal classification; deep BLSTM network; deep bidirectional long short term memory; deep neural networks; forward backward algorithms; isolated numeral recognition; label learning; long range context; output label alignment; printed Oriya text; recurrent neural network architecture; script independent approach; segmentation free approach; sequence alignment learning; speech recognition; text recognition; unsegmented word recognition; Computer architecture; Logic gates; Recurrent neural networks; Speech recognition; Text recognition; Training; Deep Neural Networks; Long Short Term Memory; Recurrent Neural Network; Text Recognition;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advances in Pattern Recognition (ICAPR), 2015 Eighth International Conference on
Conference_Location :
Kolkata
Type :
conf
DOI :
10.1109/ICAPR.2015.7050699
Filename :
7050699
Link To Document :
بازگشت