Title :
Principal component analysis neural network for textual document categorization and dimension reduction
Author :
Jaffali, Soufiene ; Jamoussi, Salma
Author_Institution :
Syst. & Adv. Comput. Lab., Univ. of Sfax, Sfax, Tunisia
Abstract :
This manuscript presents the study and application of the method of principal component analysis (PCA) in the field of text mining. We began by studying the theoretical basis behind this method and we have focused on two of its variants namely the neural PCA and kernel PCA. We used neural PCA for automatic categorization of text documents through an extraction of semantic concepts. The second contribution of our work is the use of PCA (neuronal and kernel) for the dimension reduction of textual documents through the automatic classification.
Keywords :
data mining; data reduction; neural nets; principal component analysis; text analysis; automatic classification; automatic text documents categorization; dimension reduction; kernel PCA; neural PCA; principal component analysis neural network; semantic concepts extraction; text mining; textual document categorization; Covariance matrix; Eigenvalues and eigenfunctions; Electronic mail; Kernel; Neurons; Principal component analysis; Vectors;
Conference_Titel :
Sciences of Electronics, Technologies of Information and Telecommunications (SETIT), 2012 6th International Conference on
Conference_Location :
Sousse
Print_ISBN :
978-1-4673-1657-6
DOI :
10.1109/SETIT.2012.6482024