DocumentCode
3449590
Title
An Approach to Script Identification in Multi-language Text Image
Author
Mingji Piao ; Rongyi Cui
Author_Institution
Dept. of Comput. Sci. & Technol., Intell. Inf. Process. Lab., Yanji, China
fYear
2013
fDate
1-3 Nov. 2013
Firstpage
248
Lastpage
251
Abstract
A character level script identification method to identify Korean, Chinese and English scripts using PCA is proposed in this paper. First, the space of eigenvectors was constructed by using PCA, and the segmented character was reconstructed by projecting the character into the space. Second, relative entropy between original and reconstructed image is computed for vertical and horizontal histogram. Finally, the written language was identified according to Euclidean distance and relative entropy between original and reconstructed image. The experiment results show that proposed method achieved 99.78% high accuracy for correct segmentation which effectively solved the script identification problem for multi-language text image contains Korean, Chinese and English.
Keywords
eigenvalues and eigenfunctions; entropy codes; handwritten character recognition; image reconstruction; image segmentation; principal component analysis; text detection; Chinese script identification; English script identification; Euclidean distance; Korean script identification; PCA; character level script identification method; character projection; eigenvector space construction; horizontal histogram; multilanguage text image; original image; reconstructed image; relative entropy; segmented character reconstruction; vertical histogram; written language identification; Abstracts; Entropy; Euclidean distance; Histograms; Image reconstruction; Image segmentation; Principal component analysis; Euclidean distance; character segmentation; principal component analysis; relative entropy; script identificationt;
fLanguage
English
Publisher
ieee
Conference_Titel
Intelligent Networks and Intelligent Systems (ICINIS), 2013 6th International Conference on
Conference_Location
Shenyang
Print_ISBN
978-1-4799-2808-8
Type
conf
DOI
10.1109/ICINIS.2013.70
Filename
6754719
Link To Document