• DocumentCode
    256338
  • Title

    Integrating effective rules to improve arabic text stemming

  • Author

    Cherif, Walid ; Madani, Abdellah ; Kissi, Mohamed

  • Author_Institution
    Dept. of Comput., Chouaib Doukkali Univ., El-Jadida, Morocco
  • fYear
    2014
  • fDate
    14-16 April 2014
  • Firstpage
    1077
  • Lastpage
    1081
  • Abstract
    Nowadays, with the growth in the use of search engines, the extension of spying programs and anti -terrorism prevention, several researches focused on text analysis. In this sense, lemmatization and stemming are two common requirements of these researches. They include reducing different grammatical forms of a word and bring them to a common base form. In what follows, we will discuss these treatment methods on arabic text, especially the Khoja Stemmer, show their limits and provide new tools to improve it.
  • Keywords
    linguistics; search engines; terrorism; text analysis; Arabic text stemming; Khoja Stemmer; anti-terrorism prevention; lemmatization; search engines; spying programs; text analysis; arabic language; automatic language processing; lemmatization; light-stemming; stemming;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Multimedia Computing and Systems (ICMCS), 2014 International Conference on
  • Conference_Location
    Marrakech
  • Print_ISBN
    978-1-4799-3823-0
  • Type

    conf

  • DOI
    10.1109/ICMCS.2014.6911275
  • Filename
    6911275