DocumentCode :
2915288
Title :
A contribution to Arabic Named Entity Recognition
Author :
Koulali, Rim ; Meziane, Abdelkrim
Author_Institution :
LARI Lab., Mohammed I Univ., Oujda, Morocco
fYear :
2012
fDate :
21-23 Nov. 2012
Firstpage :
46
Lastpage :
52
Abstract :
Named Entity Recognition (NER) is a clue task for improving the automatic text processing, required in a wide variety of applications. NER techniques range from hand-crafted rules to machine learning approaches. In this paper, we describe the development and the implementation of an Arabic Named Entity Recognition (ANER) System, based on machine learning approach. We use SVM classifier with a set of dependent and independent language features. We also investigate the use of patterns to ameliorate the ANER task by implementing an automatic pattern extractor framework based on Part Of Speech (POS) Information and linguistic filters. Finally, we explore the impact of several features combinations on the performances of the developed system. Our system achieves an overall average F-measure value of 83,20%. To measure the effectiveness of the developed ANER system in Topic Detection (TD) context, we conduct several experiments using NEs as features. The obtained results were very encouraging.
Keywords :
feature extraction; learning (artificial intelligence); natural language processing; pattern classification; support vector machines; text analysis; ANER system; Arabic named entity recognition; F-measure value; NER technique; POS information; SVM classifier; automatic pattern extractor framework; automatic text processing; feature combination; independent language feature; linguistic filter; machine learning; part of speech information; topic detection; Context; Feature extraction; Machine learning; Organizations; Support vector machines; Training; Vocabulary; arabic named entity recognition; automatic natural language processing; hybrid approach; jaccard indicator; pattern extractor; svm; topic detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
ICT and Knowledge Engineering (ICT & Knowledge Engineering), 2012 10th International Conference on
Conference_Location :
Bangkok
ISSN :
2157-0981
Print_ISBN :
978-1-4673-2316-1
Type :
conf
DOI :
10.1109/ICTKE.2012.6408570
Filename :
6408570
Link To Document :
بازگشت