Title :
Towards author identification of Arabic text articles
Author :
Otoom, Ahmed Fawzi ; Abdullah, Emad E. ; Jaafer, Shifaa ; Hamdallh, Aseel ; Amer, Dana
Author_Institution :
Fac. of Prince Al-Hussein Bin Abdullah II for Inf. Technol., Hashemite Univ., Zarqa, Jordan
Abstract :
We target the problem of identifying the author of an Arabic text article. Our main aim is to develop an intelligent system that is capable of classifying a new article into one of seven classes that belong to seven different authors. For this purpose, we propose a novel dataset consisting of 12 features and 456 instances belonging to the 7 authors. In addition, we combine the proposed feature set with strong classification algorithms to assist in distinguishing between the different authors. Our results show that the proposed dataset has proved successful with a classification performance accuracy of 82% with the hold-out test.
Keywords :
classification; learning (artificial intelligence); natural language processing; Arabic text article; author identification; classification algorithm; feature set; intelligent system; Accuracy; Classification algorithms; Feature extraction; Support vector machines; Syntactics; Testing; Writing; Arabic text features; authorship identification; functional trees; support vector machines;
Conference_Titel :
Information and Communication Systems (ICICS), 2014 5th International Conference on
Conference_Location :
Irbid
Print_ISBN :
978-1-4799-3022-7
DOI :
10.1109/IACS.2014.6841971