Title :
Content-based classification of graphical document images
Author :
Khare, A. ; Jeph, P. ; Ghosh, H.
Author_Institution :
TCS Innovation Labs., Tata Consultancy Services Ltd., Gurgaon, India
Abstract :
We present a computer vision based approach for classifying graphical document images by matching distinct visual patterns present in them. To accomplish this task the image is first decomposed into congruous segments, some of which contain distinct patterns followed by image matching to identify the presence of a specific pattern in the image. We have used clustering based image segmentation to extract distinctive patterns and PCA-SIFT image features for robust image matching. We have used R-Tree based feature indexing for faster retrieval of images. We have done our experiments on advertisement images which contain company´s trademarks and finally classify them based on their advertiser.
Keywords :
computer vision; content-based retrieval; document image processing; feature extraction; image classification; image matching; image segmentation; tree searching; PCA-SIFT image; R-Tree; advertisement images; computer vision; congruous segments; content-based classification; feature extraction; graphical document image classification; image matching; image segmentation; indexing; trademarks; visual pattern matching; Document image processing; Image segmentation; Pattern classification; Pattern matching; Tree searching;
Conference_Titel :
Visual Information Processing (EUVIP), 2010 2nd European Workshop on
Conference_Location :
Paris
Print_ISBN :
978-1-4244-7288-8
DOI :
10.1109/EUVIP.2010.5699113