Title :
Computer Aided Indexing of Historical Manuscripts
Author :
Shahab, S.A. ; Al-Khatib, Wasfi G. ; Mahmoud, Sabri A.
Author_Institution :
Dept. of Inf. & Comput. Sci., King Fahd Univ. of Pet. & Miner., Dhahran
Abstract :
Arabic manuscripts represent a rich source of knowledge that has been highly underutilized. Huge repositories of historical artifacts are yet to be typeset and published in book-form. Given vast content of these manuscripts, it is important to develop indexing systems that support content-based retrieval from historical manuscripts. In this paper, we propose a computer aided retrieval and indexing system for Arabic historical manuscripts. The proposed system extracts meaningful information (features) that is used in indexing. Some preprocessing steps are also implemented in order to enhance the quality of document images. More than one form of a similarity measure has been tested. The developed prototype system has shown encouraging results with respect to the word matching rates achieved
Keywords :
content-based retrieval; document image processing; history; image retrieval; indexing; optical character recognition; string matching; Arabic historical manuscript; computer aided indexing system; computer aided retrieval system; content-based retrieval; document image quality; word matching; Character recognition; Content based retrieval; Data mining; Feature extraction; Image retrieval; Indexing; Information retrieval; Optical character recognition software; Shape; Typesetting;
Conference_Titel :
Computer Graphics, Imaging and Visualisation, 2006 International Conference on
Conference_Location :
Sydney, Qld.
Print_ISBN :
0-7695-2606-3
DOI :
10.1109/CGIV.2006.31