DocumentCode :
2307402
Title :
Content-based classification of graphical document images
Author :
Khare, A. ; Jeph, P. ; Ghosh, H.
Author_Institution :
TCS Innovation Labs., Tata Consultancy Services Ltd., Gurgaon, India
fYear :
2010
fDate :
5-6 July 2010
Firstpage :
241
Lastpage :
246
Abstract :
We present a computer vision based approach for classifying graphical document images by matching distinct visual patterns present in them. To accomplish this task the image is first decomposed into congruous segments, some of which contain distinct patterns followed by image matching to identify the presence of a specific pattern in the image. We have used clustering based image segmentation to extract distinctive patterns and PCA-SIFT image features for robust image matching. We have used R-Tree based feature indexing for faster retrieval of images. We have done our experiments on advertisement images which contain company´s trademarks and finally classify them based on their advertiser.
Keywords :
computer vision; content-based retrieval; document image processing; feature extraction; image classification; image matching; image segmentation; tree searching; PCA-SIFT image; R-Tree; advertisement images; computer vision; congruous segments; content-based classification; feature extraction; graphical document image classification; image matching; image segmentation; indexing; trademarks; visual pattern matching; Document image processing; Image segmentation; Pattern classification; Pattern matching; Tree searching;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Visual Information Processing (EUVIP), 2010 2nd European Workshop on
Conference_Location :
Paris
Print_ISBN :
978-1-4244-7288-8
Type :
conf
DOI :
10.1109/EUVIP.2010.5699113
Filename :
5699113
Link To Document :
بازگشت