• DocumentCode
    2760967
  • Title

    A structural rule-based stemmer for Persian

  • Author

    Rahimtoroghi, Elaheh ; Faili, Hesham ; Shakery, Azadeh

  • Author_Institution
    Sch. of Electr. & Comput. Eng., Univ. of Tehran, Tehran, Iran
  • fYear
    2010
  • fDate
    4-6 Dec. 2010
  • Firstpage
    574
  • Lastpage
    578
  • Abstract
    This paper presents a new stemmer for Persian language. We used a structural approach for stemming which uses the structure of words and morphological rules of the language to recognize the stem of each word. We composed 33 rules to describe a structural rule-based stemmer. The rules are written based on the morphology of Persian language and its word derivation structure. For evaluation, we used our stemmer in an information retrieval system. The results demonstrated that by enhancing the system with this stemmer, the information retrieval system´s precision increases, by the factor of 4.78% and the indexing file size decreases by the factor of 6%.
  • Keywords
    information retrieval systems; natural language processing; Persian language; Persian word derivation structure; information retrieval system; structural rule-based stemmer; Computers; Educational institutions; Indexing; Information retrieval; Morphology; Speech; Information Retrieval; Natural Language Processing; Persian Language; Stemming;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Telecommunications (IST), 2010 5th International Symposium on
  • Conference_Location
    Tehran
  • Print_ISBN
    978-1-4244-8183-5
  • Type

    conf

  • DOI
    10.1109/ISTEL.2010.5734090
  • Filename
    5734090