DocumentCode :
638304
Title :
Studying the impact of various features on the performance of Conditional Random Field-based Arabic Named Entity Recognition
Author :
Morsi, Alia ; Rafea, Ahmed
Author_Institution :
Dept. of Comput. Sci. & Eng., American Univ. in Cairo, Cairo, Egypt
fYear :
2013
fDate :
27-30 May 2013
Firstpage :
1
Lastpage :
5
Abstract :
The task of Named Entity Recognition (NER) is crucial to Natural Language Processing (NLP). NER can be defined as the computational identification and classification of Named Entities in running text. The importance of NER stems from the variety of Natural Language Processing applications where accurate NLP would be highly useful. Such include machine translation and information extraction. In this paper, we present a series of experiments to explore the impact of using various feature sets on NER results for Modern Standard Arabic (MSA) text. We rely on language independent features and we employ CRF based models for all our experiments. We create our own baseline model to use its results for comparison. Our best result is a 68.05 F-Measure, which is 10.82 points above our baseline.
Keywords :
language translation; natural language processing; CRF based models; F-measure; MSA; NER; NLP; conditional random field-based Arabic named entity recognition; information extraction; language independent features; machine translation; modern standard Arabic text; named entity recognition; natural language processing; Computational linguistics; Data models; Natural language processing; Organizations; Standards; Testing; Training; Conditional Random Fields (CRF); Named Entity Recognition (NER); Natural Language Processing (NLP);
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Systems and Applications (AICCSA), 2013 ACS International Conference on
Conference_Location :
Ifrane
ISSN :
2161-5322
Type :
conf
DOI :
10.1109/AICCSA.2013.6616423
Filename :
6616423
Link To Document :
بازگشت