DocumentCode :
3073598
Title :
Multi-class Enhanced Image Mining of Heterogeneous Textual Images Using Multiple Image Features
Author :
Chitrakala, S. ; Shamini, P. ; Manjula, D.
Author_Institution :
Easwari Eng. Coll., Anna Univ., Chennai
fYear :
2009
fDate :
6-7 March 2009
Firstpage :
496
Lastpage :
501
Abstract :
Image mining deals with the extraction of implicit knowledge, image data relationship, or other patterns not explicitly stored in the images. This paper proposes an enhanced image classifier to extract patterns from images containing text using a combination of features. Image containing text can be divided into the following types: scene text image, caption text image and document image. A total of eight features including intensity histogram features and GLCM texture features are used to classify the images. In the first level of classification, the histogram features are extracted from grayscale images to separate document image from the others. In the second stage, the GLCM features are extracted from binary images to classify scene text and caption text images. In both stages, the decision tree classifier (DTC) is used for the classification. Experimental results have been obtained for a dataset of about 60 images of different types. This technique of classification has not been attempted before and its applications include preprocessing for indexing of images, for simplifying and speeding up content based image retrieval (CBIR) techniques and in areas of machine vision.
Keywords :
data mining; decision trees; document image processing; feature extraction; image classification; image enhancement; image texture; statistical analysis; text analysis; GLCM texture feature extraction; binary image; caption text image; decision tree classifier; document image; grayscale image; image classification; image data relationship; intensity histogram feature extraction; knowledge extraction; multiclass enhanced heterogeneous textual image mining; scene text image; Classification tree analysis; Content based retrieval; Data mining; Decision trees; Feature extraction; Gray-scale; Histograms; Image retrieval; Indexing; Layout; GLCM features; caption text; decision tree; image classification; image features; scene text;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advance Computing Conference, 2009. IACC 2009. IEEE International
Conference_Location :
Patiala
Print_ISBN :
978-1-4244-2927-1
Electronic_ISBN :
978-1-4244-2928-8
Type :
conf
DOI :
10.1109/IADCC.2009.4809061
Filename :
4809061
Link To Document :
بازگشت