Title :
A comparative study of Neural networks architectures on Arabic text categorization using feature extraction
Author :
Harrag, Fouzi ; Al-Salman, Abdul Malik Salman ; Benmohammed, Mohammed
Author_Institution :
Comput. Sci. Dept., Farhat ABBAS Univ., Setif, Algeria
Abstract :
In this paper, we present a model based on the Neural Network (NN) for classifying Arabic texts. We propose the use of Singular Value Decomposition (SVD) as a preprocessor of NN with the aim of further reducing data in terms of both size and dimensionality. Indeed, the use of SVD makes data more amenable to classification and the convergence training process faster. Specifically, the effectiveness of the Multilayer Perceptron (MLP) and the Radial Basis Function (RBF) classifiers are implemented. Experiments are conducted using an in-house corpus of Arabic texts. Precision, recall and F-measure are used to quantify categorization effectiveness. The results show that the proposed SVD-Supported MLP/RBF ANN classifier is able to achieve high effectiveness. Experimental results also show that the MLP classifier outperforms the RBF classifier and that the SVD-supported NN classifier is better than the basic NN, as far as Arabic text categorization is concerned.
Keywords :
feature extraction; multilayer perceptrons; neural net architecture; radial basis function networks; singular value decomposition; text analysis; ANN classifier; arabic text categorization; classification; convergence; convergence training process; feature extraction; multilayer perceptron; neural network architecture; preprocessor; radial basis function classifier; singular value decomposition; Accuracy; Artificial neural networks; Computational modeling; Matrix decomposition; Support vector machine classification; Text categorization; Training; Arabic Text Categorization; MLP; Natural Language Processing; Neural Network; RBF; Singular Value Decomposition;
Conference_Titel :
Machine and Web Intelligence (ICMWI), 2010 International Conference on
Conference_Location :
Algiers
Print_ISBN :
978-1-4244-8608-3
DOI :
10.1109/ICMWI.2010.5648051