• DocumentCode
    2617827
  • Title

    Arabic verb pattern extraction

  • Author

    Saad, E.M. ; Awadalla, M.H. ; Alajmi, A.

  • Author_Institution
    Commun. & Electron. Dept., Helwan Univ., Cairo, Egypt
  • fYear
    2010
  • fDate
    10-13 May 2010
  • Firstpage
    642
  • Lastpage
    645
  • Abstract
    Arabic is a highly inflected language, and therefore the processes of stemming and root extracting represent a challenge to researches. A new method is presented for extracting Arabic text stem, and lemma. Stemming sometimes affects the semantic of a word, where as lemma preserve the meaning of a word. The approach is based on pattern extraction. It uses a special encoding based on dividing letters into original and non-original letters. Codes are automatically generated for each pattern and then match against input text to extract root, pattern, and lemma of a word. A comparison with other methods reveals a promising result with accuracy up to 96%.
  • Keywords
    computational linguistics; data handling; feature extraction; pattern recognition; Arabic text stem extraction; Arabic verb pattern extraction; inflected language; Morphological Analyzer; Natural Language Processing; Root Extraction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Sciences Signal Processing and their Applications (ISSPA), 2010 10th International Conference on
  • Conference_Location
    Kuala Lumpur
  • Print_ISBN
    978-1-4244-7165-2
  • Type

    conf

  • DOI
    10.1109/ISSPA.2010.5605427
  • Filename
    5605427