DocumentCode :
1611460
Title :
Principal component analysis neural network for textual document categorization and dimension reduction
Author :
Jaffali, Soufiene ; Jamoussi, Salma
Author_Institution :
Syst. & Adv. Comput. Lab., Univ. of Sfax, Sfax, Tunisia
fYear :
2012
Firstpage :
835
Lastpage :
839
Abstract :
This manuscript presents the study and application of the method of principal component analysis (PCA) in the field of text mining. We began by studying the theoretical basis behind this method and we have focused on two of its variants namely the neural PCA and kernel PCA. We used neural PCA for automatic categorization of text documents through an extraction of semantic concepts. The second contribution of our work is the use of PCA (neuronal and kernel) for the dimension reduction of textual documents through the automatic classification.
Keywords :
data mining; data reduction; neural nets; principal component analysis; text analysis; automatic classification; automatic text documents categorization; dimension reduction; kernel PCA; neural PCA; principal component analysis neural network; semantic concepts extraction; text mining; textual document categorization; Covariance matrix; Eigenvalues and eigenfunctions; Electronic mail; Kernel; Neurons; Principal component analysis; Vectors;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Sciences of Electronics, Technologies of Information and Telecommunications (SETIT), 2012 6th International Conference on
Conference_Location :
Sousse
Print_ISBN :
978-1-4673-1657-6
Type :
conf
DOI :
10.1109/SETIT.2012.6482024
Filename :
6482024
Link To Document :
بازگشت