• DocumentCode
    2719108
  • Title

    Arabic stemming with two dictionaries

  • Author

    Kchaou, Zied ; Kanoun, Slim

  • Author_Institution
    Res. Group on Intell. Machines, Univ. of Sfax, Sfax
  • fYear
    2008
  • fDate
    16-18 Dec. 2008
  • Firstpage
    688
  • Lastpage
    691
  • Abstract
    We propose an approach to stemming Arabic words similar to the approach of Khoja, but with two dictionaries, one of roots and another of radicals. Our approach has the advantage of reducing the words that are inspired by their radicals to their radical and words which are inspired by their roots to their roots with great reliability and consistency and solves the problem of the handicapped radicals and roots in Khoja. We tested our approach on a large corpus of Arabic texts covering several areas.
  • Keywords
    dictionaries; natural language processing; text analysis; Arabic text corpus; Arabic word stemming; Khoja approach; dictionary; handicapped radical; handicapped root; Dictionaries; Electric breakdown; Indexing; Information retrieval; Machine intelligence; Natural languages; Pattern matching; Testing; Vocabulary;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Innovations in Information Technology, 2008. IIT 2008. International Conference on
  • Conference_Location
    Al Ain
  • Print_ISBN
    978-1-4244-3396-4
  • Electronic_ISBN
    978-1-4244-3397-1
  • Type

    conf

  • DOI
    10.1109/INNOVATIONS.2008.4781780
  • Filename
    4781780