Title :
Curriculum learning for printed text line recognition of ligature-based scripts
Author :
Adnan Ul-Hasan;Faisal Shafaity;Marcus Liwicki
Author_Institution :
Department of Computer Science, University of Kaiserslautern, Germany
Abstract :
This paper introduces a novel curriculum learning strategy for ligature-based scripts. Long Short-Term Memory Networks require thousands or even millions of iterations on target symbols, depending upon the complexity of the target data, to converge when trained for sequence transcription because they have to localize the individual symbols along with the recognition. Curriculum learning reduces the number of target symbols to be visited before the network converges. In this paper, we propose a ligature-based complexity measure to define the sampling order of the training data. Experiments performed on UPTI database show that the curriculum learning using our strategy can reduce the total number of target symbols before convergence for printed Urdu Nastaleeq OCR task.
Keywords :
"Logic gates","Target recognition","Integrated optics"
Conference_Titel :
Document Analysis and Recognition (ICDAR), 2015 13th International Conference on
DOI :
10.1109/ICDAR.2015.7333912