DocumentCode
2719108
Title
Arabic stemming with two dictionaries
Author
Kchaou, Zied ; Kanoun, Slim
Author_Institution
Res. Group on Intell. Machines, Univ. of Sfax, Sfax
fYear
2008
fDate
16-18 Dec. 2008
Firstpage
688
Lastpage
691
Abstract
We propose an approach to stemming Arabic words similar to the approach of Khoja, but with two dictionaries, one of roots and another of radicals. Our approach has the advantage of reducing the words that are inspired by their radicals to their radical and words which are inspired by their roots to their roots with great reliability and consistency and solves the problem of the handicapped radicals and roots in Khoja. We tested our approach on a large corpus of Arabic texts covering several areas.
Keywords
dictionaries; natural language processing; text analysis; Arabic text corpus; Arabic word stemming; Khoja approach; dictionary; handicapped radical; handicapped root; Dictionaries; Electric breakdown; Indexing; Information retrieval; Machine intelligence; Natural languages; Pattern matching; Testing; Vocabulary;
fLanguage
English
Publisher
ieee
Conference_Titel
Innovations in Information Technology, 2008. IIT 2008. International Conference on
Conference_Location
Al Ain
Print_ISBN
978-1-4244-3396-4
Electronic_ISBN
978-1-4244-3397-1
Type
conf
DOI
10.1109/INNOVATIONS.2008.4781780
Filename
4781780
Link To Document