DocumentCode :
2631771
Title :
Binarization by Local K-means Clustering for Korean Text Extraction
Author :
Lai, Anh-Nga ; Lee, Gueesang
Author_Institution :
Dept. of Comput. Sci., Chonnam Nat. Univ., Gwangju
fYear :
2008
fDate :
16-19 Dec. 2008
Firstpage :
117
Lastpage :
122
Abstract :
Text information in a natural scene is very useful and important for understanding images. Detection and extraction of text information in such natural images have been used in many applications. Many conditions of natural scene make the problem of text segmentation quite intractable. In this paper, an effective method for the segmentation and binarization of Korean texts from signboard images is proposed, which is robust in blurred images, uneven illumination and strong boundary text images. The proposed approach is based on local K-means clustering on separate words in text region. Firstly, detected text region is divided into local areas with relatively uniform illumination, and then using 3-means clustering with Euclidean distance has been applied to segment text from the background. By dividing the region of interest into local areas, the effect of uneven lighting has been minimized. The comparison with Otsu´s method and 2-means clustering based on intensity will be representation in some metrics. Natural images from the test database, collected from mobile devices, are used in the experiment and the results show the performance of the proposed method.
Keywords :
feature extraction; image restoration; image segmentation; natural language processing; pattern clustering; text analysis; Korean text extraction; Otsu method; binarization; blurred images; local K-means clustering; natural images; signboard images; text information; text segmentation; Cameras; Colored noise; Data mining; Image edge detection; Image segmentation; Layout; Lighting; Robustness; Support vector machine classification; Support vector machines; text binarization; text boundary and text clustering; text detection; uneven lighting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Signal Processing and Information Technology, 2008. ISSPIT 2008. IEEE International Symposium on
Conference_Location :
Sarajevo
Print_ISBN :
978-1-4244-3554-8
Electronic_ISBN :
978-1-4244-3555-5
Type :
conf
DOI :
10.1109/ISSPIT.2008.4775658
Filename :
4775658
Link To Document :
بازگشت