Title :
Re-evaluating chain-code as features for Bangla script
Author :
Alam, Md Nafiul ; Naser, M.A.
Author_Institution :
Islamic Univ. of Technol., Gazipur, Bangladesh
Abstract :
Most of the characters in Bangla script have similar shapes. Since chain-code is one kind of shape descriptor, we argue that it would not be able to distinguish between two similar characters and hence not be able to provide good recognition rate. Even though, the chain-code has widely been used as feature for Bangla script, none of the literature has considered the fact that chain-code may not be compatible with the script itself. We assume that chain-code cannot provide the variation necessary to describe the contours of similar characters, especially which are almost identical in the first place. We validated our proposal through a statistical test called one-way Analysis of Variance (ANOVA), to verify whether chain-codes of two similar looking characters are truly different. The results substantiate our assumption, suggesting that chain-code based features may not be the best features for Bangla character recognition.
Keywords :
feature extraction; natural language processing; optical character recognition; statistical analysis; ANOVA; Bangla character recognition; Bangla script characters; Bangla script features; analysis of variance; reevaluating chain code; shape descriptor; statistical test; Accuracy; Analysis of variance; Character recognition; Feature extraction; Histograms; Optical character recognition software; Shape; Analysis of Variance (ANOVA); Bangla script; Chain-code; F-distribution; Optical Character Recognition (OCR);
Conference_Titel :
Electrical Information and Communication Technology (EICT), 2013 International Conference on
Conference_Location :
Khulna
Print_ISBN :
978-1-4799-2297-0
DOI :
10.1109/EICT.2014.6777865