DocumentCode :
3695242
Title :
Curriculum learning for printed text line recognition of ligature-based scripts
Author :
Adnan Ul-Hasan;Faisal Shafaity;Marcus Liwicki
Author_Institution :
Department of Computer Science, University of Kaiserslautern, Germany
fYear :
2015
Firstpage :
1001
Lastpage :
1005
Abstract :
This paper introduces a novel curriculum learning strategy for ligature-based scripts. Long Short-Term Memory Networks require thousands or even millions of iterations on target symbols, depending upon the complexity of the target data, to converge when trained for sequence transcription because they have to localize the individual symbols along with the recognition. Curriculum learning reduces the number of target symbols to be visited before the network converges. In this paper, we propose a ligature-based complexity measure to define the sampling order of the training data. Experiments performed on UPTI database show that the curriculum learning using our strategy can reduce the total number of target symbols before convergence for printed Urdu Nastaleeq OCR task.
Keywords :
"Logic gates","Target recognition","Integrated optics"
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
Type :
conf
DOI :
10.1109/ICDAR.2015.7333912
Filename :
7333912
Link To Document :
بازگشت