DocumentCode :
2100426
Title :
An experimental study for some supervised lexical disambiguation methods of arabic language
Author :
Merhbene, Laroussi ; Zouaghi, Anis ; Zrigui, M.
Author_Institution :
LAtice Lab., Univ. of Monastir, Monastir, Tunisia
fYear :
2013
fDate :
24-26 Oct. 2013
Firstpage :
1
Lastpage :
6
Abstract :
In this paper we propose an experimental study for some supervised algorithms to disambiguate arabic words. Due to the lack of linguistic data for the Arabic language, we work on non-annotated corpus and with the help of four annotators; we were able to annotate the different samples containing the ambiguous words. Since that, we test the naïve Bayes algorithm, the decision lists and the exemplar based algorithm. During the experimental study, we test the influence of the window size on the disambiguation quality, the derivation and the technique of smoothing for the (2n+1)-grams. We find that the exemplar based algorithm achieves the best rate of precision.
Keywords :
Bayes methods; computational linguistics; natural language processing; Arabic language; Arabic words; disambiguation quality; exemplar based algorithm; linguistic data; naïve Bayes algorithm; nonannotated corpus; supervised algorithm; supervised lexical disambiguation method; Smoothing methods; Decision list; Exemplar based algorithm; Supervised algorithms; Training data; Window size; naïve Bayes;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information and Communication Technology and Accessibility (ICTA), 2013 Fourth International Conference on
Conference_Location :
Hammamet
Type :
conf
DOI :
10.1109/ICTA.2013.6815307
Filename :
6815307
Link To Document :
بازگشت