Title of article :
A Rule-Based Extensible Stemmer for Information Retrieval with Application to Arabic
Author/Authors :
Harmanani, Haidar Lebanese American University - Computer Science and Mathematics Division, Lebanon , Keirouz, Walid American University of Beirut - Department of Computer Science, Lebanon , Raheel, Saeed Lebanese American University - Computer Science and Mathematics Division, Lebanon
From page :
265
To page :
272
Abstract :
This paper presents a new and extensible method for information retrieval and content analysis in Natural Languages (NL). The proposed method is stem-based; stems are extracted based on a set of language dependent rules that are interpreted by a rule engine. The rule engine allows the system to be adapted to any natural language by modifying the NL semantic rules and grammar. The system has been fully tested using Arabic, and partially using English, Hebrew, and Persian. We have validated our approach using a database-based prototype
Keywords :
Natural language processing , information retrieval , stemming
Journal title :
The International Arab Journal of Information Technology (IAJIT)
Journal title :
The International Arab Journal of Information Technology (IAJIT)
Record number :
2543347
Link To Document :
بازگشت