DocumentCode :
3539057
Title :
Neural Network for Arabic text classification
Author :
Harrag, Fouzi ; El-Qawasmah, Eyas
Author_Institution :
Comput. Sci. Dept., Farhat ABBAS Univ., Setif, Algeria
fYear :
2009
fDate :
4-6 Aug. 2009
Firstpage :
778
Lastpage :
783
Abstract :
This paper proposes the application of Artificial Neural Network for the classification of Arabic language documents. The automatic classification of Arabic documents using ANN has not been explored in detail so far. In this paper, an Arabic corpus is used to construct and test the ANN model. Methods of document representation, assigning weights that reflect the importance of each term are discussed. Each Arabic document is represented by the term weighting scheme. As the number of unique words in the collection set is big, the Singular Value Decomposition (SVD) has been used to select the most relevant features for the classification. The experimental results show that ANN model using SVD achieves 88.33% which is better than the performance of basic ANN which yields 85.75% on Arabic document classification.
Keywords :
classification; natural language processing; neural nets; singular value decomposition; text analysis; ANN; Arabic corpus; Arabic language documents; Arabic text classification; SVD; artificial neural network; automatic classification; document representation; singular value decomposition; unique words; weight assignment; Artificial neural networks; Computer science; Machine learning algorithms; Matrix decomposition; Natural languages; Neural networks; Singular value decomposition; Support vector machine classification; Support vector machines; Text categorization; Arabic Language; Neural Network; Singular Value Decomposition; Text Categorization;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Applications of Digital Information and Web Technologies, 2009. ICADIWT '09. Second International Conference on the
Conference_Location :
London
Print_ISBN :
978-1-4244-4456-4
Electronic_ISBN :
978-1-4244-4457-1
Type :
conf
DOI :
10.1109/ICADIWT.2009.5273841
Filename :
5273841
Link To Document :
بازگشت