DocumentCode
2617827
Title
Arabic verb pattern extraction
Author
Saad, E.M. ; Awadalla, M.H. ; Alajmi, A.
Author_Institution
Commun. & Electron. Dept., Helwan Univ., Cairo, Egypt
fYear
2010
fDate
10-13 May 2010
Firstpage
642
Lastpage
645
Abstract
Arabic is a highly inflected language, and therefore the processes of stemming and root extracting represent a challenge to researches. A new method is presented for extracting Arabic text stem, and lemma. Stemming sometimes affects the semantic of a word, where as lemma preserve the meaning of a word. The approach is based on pattern extraction. It uses a special encoding based on dividing letters into original and non-original letters. Codes are automatically generated for each pattern and then match against input text to extract root, pattern, and lemma of a word. A comparison with other methods reveals a promising result with accuracy up to 96%.
Keywords
computational linguistics; data handling; feature extraction; pattern recognition; Arabic text stem extraction; Arabic verb pattern extraction; inflected language; Morphological Analyzer; Natural Language Processing; Root Extraction;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Sciences Signal Processing and their Applications (ISSPA), 2010 10th International Conference on
Conference_Location
Kuala Lumpur
Print_ISBN
978-1-4244-7165-2
Type
conf
DOI
10.1109/ISSPA.2010.5605427
Filename
5605427
Link To Document