DocumentCode :
564876
Title :
Arabie text classification using Learning Vector Quantization
Author :
Azara, Mohammed ; Fatayer, Tamer ; El-Halees, Alaa
Author_Institution :
Fac. Inf. Technol., Islamic Univ., Gaza City, Palestinian Authority
fYear :
2012
fDate :
14-16 May 2012
Abstract :
One of the several benefits of text classification is to automatically assign document in predefined category. Researchers using LVQ algorithm in English and Persian [1, 2] and don´t be attention for Arabic language. So in our research, we used neural network approach for classify Arabic text by using Learning Vector Quantization (LVQ) algorithm. This algorithm is based on Kohonen self organizing map (SOM) that is able to organize big-size document collections according to textual similarities. Also, LVQ algorithm requires less training examples and its faster than other classification methods. We select Arabic documents from different domains. After that we select suitable preprocessing methods such as term weighting schemes, and Arabic morphological analysis (stemming and light stemming), these preprocessing prepared dataset that need for classification. Then, we compared the results obtained from different LVQ improvement versions (LVQ2.1, LVQ3, OLVQ1 and OLVQ3). The results showed that the LVQ´s algorithms especially LVQ2.1 algorithm achieved high accuracy and less time compared to other LVQ´s algorithms.
Keywords :
Accuracy; Artificial neural networks; Classification algorithms; Educational institutions; Text categorization; Vector quantization; Vectors; ANN; Arabic Text Categorization; Arabic Text Classification; Arabic Text Mining; Artificial Neural Network; LVQ; Learning Vector Quantization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Informatics and Systems (INFOS), 2012 8th International Conference on
Conference_Location :
Giza, Egypt
Print_ISBN :
978-1-4673-0828-1
Type :
conf
Filename :
6236606
Link To Document :
بازگشت