DocumentCode :
2432729
Title :
A new stemmer for Farsi language
Author :
Estahbanati, Somayye ; Javidan, Reza
Author_Institution :
Sci. & Res. Branch, Dept. of Comput. Eng., Islamic Azad Univ., Khoozestan, Iran
fYear :
2011
fDate :
15-16 June 2011
Firstpage :
25
Lastpage :
29
Abstract :
In this paper, we report on the design and implementation of a stemmer for the Farsi language, according to combination of Kazem Taghva´s method and improved Krovetz´s method. The first method removes the suffixes and prefixes according to the word´s structure. And the second method is based on saving the information in a Database. This paper reports a kind of combination of these methods. The results of our evaluation on a small Farsi document collection show a significant improvement in precision/recall.
Keywords :
document handling; natural language processing; Farsi document collection; Farsi language; Kazem Taghva method; Krovetz method; stemmer; Algorithm design and analysis; Computers; Databases; Europe; Information retrieval; Internet; Morphology; Farsi language; Persian Language; Stemming; algorithm;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Software Engineering (CSSE), 2011 CSI International Symposium on
Conference_Location :
Tehran
Print_ISBN :
978-1-61284-206-6
Type :
conf
DOI :
10.1109/CSICSSE.2011.5963993
Filename :
5963993
Link To Document :
بازگشت