Title :
A neural re-ranking method for searching ancient Arabic documents on the Web
Author :
Sari, Toufik ; Chemam, Chaouki
Author_Institution :
Lab. de Gestion Electron. de Documents (LabGED), Badji Mokhtar Univ., Annaba, Algeria
fDate :
Nov. 29 2011-Dec. 1 2011
Abstract :
Web users want a quick and accurate access to images. The method currently used by search engines is the analysis of text surrounding an image which usually causes errors. Since there is a huge gap between the content of the image and the textual description associated. Hence, realizing a search engine for images in the web considering their contents became therefore mandatory. In this paper, we propose a method for collecting images of old Arabic documents from the Web. This work focuses mainly on content based image retrieval by texture feature using a neural network for classification and trying to integrate the user in the search loop. The system begins with the formulation of a query text, which is expanded and sent to a conventional search engine. Then, the obtained results are filtered by a neural network and finally displayed to the user for agreement. The experiments with various query texts shown good performances and hundreds of old Arabic documents were collected.
Keywords :
Internet; classification; content-based retrieval; document image processing; image retrieval; natural language processing; neural nets; search engines; text analysis; Web users; World Wide Web; ancient Arabic documents; classification; content based image retrieval; image collection; neural network; neural reranking method; query text; search engines; search loop; text analysis; textual description; texture feature; Image retrieval; Indexes; Matrix converters; Search engines; Symmetric matrices; Visualization;
Conference_Titel :
Innovation in Information & Communication Technology (ISIICT), 2011 Fourth International Symposium on
Conference_Location :
Amman
Print_ISBN :
978-1-61284-672-9
DOI :
10.1109/ISIICT.2011.6149609