Title of article :
AUTHORSHIP ATTRIBUTION USING PRINCIPAL COMPONENT ANALYSIS AND COMPETITIVE NEURAL NETWORKS
Author/Authors :
Can, Mehmet International University of Sarajevo - Faculty of Engineering and Natural Sciences Hrasniæka Cesta, Bosnia and Herzegovina
From page :
21
To page :
36
Abstract :
Feature extraction is a common problem in statistical pattern recognition. It refers to a process whereby a data space is transformed into a feature space that, in theory, has exactly the same dimension as the original data space. However, the transformation is designed in such a way that the data set may be represented by a reduced number of effective features and yet retain most of the intrinsic information content of the data; in other words, the data set undergoes a dimensionality reduction. Principal component analysis is one of these processes. In this paper the data collected by counting selected syntactic characteristics in around a thousand paragraphs of each of the sample books underwent a principal component analysis. Authors of texts identified by the competitive neural networks, which use these effective features.
Keywords :
principal components , authorship attribution , stylometry , text categorization , stylistic features , syntactic characteristics , multilayer preceptor , competitive learning , artificial neural network.
Journal title :
mathematical and computational applications
Journal title :
mathematical and computational applications
Record number :
2569213
Link To Document :
بازگشت