DocumentCode
2760967
Title
A structural rule-based stemmer for Persian
Author
Rahimtoroghi, Elaheh ; Faili, Hesham ; Shakery, Azadeh
Author_Institution
Sch. of Electr. & Comput. Eng., Univ. of Tehran, Tehran, Iran
fYear
2010
fDate
4-6 Dec. 2010
Firstpage
574
Lastpage
578
Abstract
This paper presents a new stemmer for Persian language. We used a structural approach for stemming which uses the structure of words and morphological rules of the language to recognize the stem of each word. We composed 33 rules to describe a structural rule-based stemmer. The rules are written based on the morphology of Persian language and its word derivation structure. For evaluation, we used our stemmer in an information retrieval system. The results demonstrated that by enhancing the system with this stemmer, the information retrieval system´s precision increases, by the factor of 4.78% and the indexing file size decreases by the factor of 6%.
Keywords
information retrieval systems; natural language processing; Persian language; Persian word derivation structure; information retrieval system; structural rule-based stemmer; Computers; Educational institutions; Indexing; Information retrieval; Morphology; Speech; Information Retrieval; Natural Language Processing; Persian Language; Stemming;
fLanguage
English
Publisher
ieee
Conference_Titel
Telecommunications (IST), 2010 5th International Symposium on
Conference_Location
Tehran
Print_ISBN
978-1-4244-8183-5
Type
conf
DOI
10.1109/ISTEL.2010.5734090
Filename
5734090
Link To Document