DocumentCode :
134569
Title :
Skeleton extraction: Comparison of five methods on the Arabic IFN/ENIT database
Author :
Al-Shatnawi, Atallah M. ; Omar, K. ; AlFawwaz, Bader M. ; Zeki, Akram M.
Author_Institution :
Dept. of Inf. Syst., Al-albayt Univ., Mafraq, Jordan
fYear :
2014
fDate :
26-27 March 2014
Firstpage :
50
Lastpage :
59
Abstract :
Thinning “Skeletonization” is a very crucial stage in the Arabic Character Recognition (ACR) system. It simplifies the text shape and reduces the amount of data that needs to be handled and it is usually used as a pre-processing stage for recognition and storage systems. The skeleton of Arabic text can be used for: baseline detection, character segmentation, and features extraction, and ultimately supporting the classification. In this paper, five of the state of the art thinning algorithms are selected and implemented. The five algorithms are: SPTA, Zhang-Suen parallel thinning algorithm, Huang-Wan-Liu thinning algorithm, thinning and skeletonization based morphological operation algorithms. The five selected algorithms are applied on the IFN/ENIT dataset. The results obtained by the five methods are discussed and analyzed against the IFN/ENIT dataset based on preserving shape and the text connectivity, preventing spurious tails, maintaining one pixel width skeleton and avoiding the necking problem as well as running time efficiently. In addition to that some performance measurement for checking text connectivity, spurious tails and calculating the stroke thickness are proposed and carried out.
Keywords :
character recognition; feature extraction; image classification; image segmentation; image thinning; ACR system; Arabic IFN/ENIT database; Arabic character recognition system; Arabic text skeleton; Huang-Wan-Liu thinning algorithm; IFN/ENIT dataset; SPTA; Zhang-Suen parallel thinning algorithm; baseline detection; character segmentation; classification; data handling; features extraction; performance measurement; shape preservation; skeleton extraction; skeletonization based morphological operation algorithm; spurious tail prevention; storage system; text connectivity preservation; text shape; thinning based morphological operation algorithm; thinning skeletonization; Algorithm design and analysis; Classification algorithms; Educational institutions; Image edge detection; Information systems; Shape; Skeleton; Arabic Character Recognition; Huang-Wan-L; SPTA; Skeleton; Text connectivity; Thinning; Zhang-Suen; morphological;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Information Technology (CSIT), 2014 6th International Conference on
Conference_Location :
Amman
Print_ISBN :
978-1-4799-3998-5
Type :
conf
DOI :
10.1109/CSIT.2014.6805978
Filename :
6805978
Link To Document :
بازگشت