Title :
Arabic Information Retrieval System Based on Noun Phrases
Author :
Ataa Allah, F. ; Boulaknadel, S. ; El qadi, A. ; Aboutajdine, D.
Author_Institution :
Fac. of Sci., Mohamed V-Agdal Univ., Rabat
Abstract :
In a rich information context, an information retrieval system must be able to ensure the best results. For this, the aim of our study consists in extracting the knowledge based on document textual contents by associating the analysis smoothness of a linguistic approach to the statistical approach capacity treating large corpus. The statistical approach is based on text mining, mainly on the latent semantic analysis technique; while the linguistic approach is based on the noun phrases which are more susceptible to be used like textual entities in representing the text information than the simple terms. By experimentation in Arabic documents, specialized in the environment field, we show the use of noun phrase impact on the information retrieval system precision
Keywords :
computational linguistics; data mining; grammars; information retrieval systems; natural languages; statistical analysis; text analysis; Arabic information retrieval system; document textual contents; knowledge extraction; latent semantic analysis; linguistic approach; noun phrases; statistical approach; text information representation; text mining; Content based retrieval; Data mining; Indexing; Information analysis; Information retrieval; Natural languages; Performance analysis; Performance evaluation; Shape; Text mining;
Conference_Titel :
Information and Communication Technologies, 2006. ICTTA '06. 2nd
Conference_Location :
Damascus
Print_ISBN :
0-7803-9521-2
DOI :
10.1109/ICTTA.2006.1684645