DocumentCode :
3299754
Title :
Profile Based Information Retrieval from Printed Document Images
Author :
Abirami, S. ; Manjula, D.
Author_Institution :
Dept. of Comput. Sci. & Eng., Anna Univ., Chennai
fYear :
2007
fDate :
14-17 Aug. 2007
Firstpage :
268
Lastpage :
272
Abstract :
This paper performs a profile based Information Retrieval from printed document image collections. Keywords are valuable indexing tools and if they can be identified at the image level, extensive computation during recognition will be avoided. Printed documents can be scanned to produce document images. Instead of converting entire document images into text equivalent, word profiles are identified to match the word images in Bilingual document images.(English and Tamil). During retrieval, the same profile could be extracted from the user specified word and can be matched with the word images in the document. This yields a faster result even in a quality-degraded document. This kind of Information Retrieval (Keyword Based Search) can be adapted in Digital Libraries, which employs digitized documents instead of text processing. This promotes efficient search in document images irrespective of the language.
Keywords :
digital libraries; document image processing; information retrieval; bilingual document images; digital libraries; keyword based search; printed document images; profile based information retrieval; text processing; Character recognition; Image converters; Image recognition; Image retrieval; Image segmentation; Information retrieval; Natural languages; Optical character recognition software; Pixel; Software libraries; Document Images; Word Profiles.;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Graphics, Imaging and Visualisation, 2007. CGIV '07
Conference_Location :
Bangkok
Print_ISBN :
0-7695-2928-3
Type :
conf
DOI :
10.1109/CGIV.2007.67
Filename :
4293683
Link To Document :
بازگشت