Title :
Author attribution using a graph based representation
Author :
Castillo, Esteban ; Vilarino, Darnes ; Cervantes, Ofelia ; Pinto, David
Author_Institution :
Dept. of Comput. Sci., Univ. de las Americas Puebla, Puebla, Mexico
Abstract :
Authorship attribution is the task of determining the real author of a given anonymous document. Even though different approaches exist in literature, this problem has been barely dealt with by using document representations that employ graph structures. Actually, most research works in literature handle this problem by employing simple sequences of n words (n-grams), such as bigrams and trigrams. In this paper, an exploration in the use of graphs for representing document sentences is presented. These structures are used for carrying out experiments for solving the problem of Authorship attribution. The experiments that are presented here attain approximately a 79% of accuracy, showing that the graph-based representation could be a way of encapsulating various levels of natural language descriptions in a simple structure.
Keywords :
graph theory; natural language processing; text analysis; anonymous document; author attribution; document sentence representation; graph based representation; graph structures; natural language descriptions; Feature extraction; Kernel; Semantics; Support vector machines; Syntactics; Topology; Writing;
Conference_Titel :
Electronics, Communications and Computers (CONIELECOMP), 2015 International Conference on
Conference_Location :
Cholula
DOI :
10.1109/CONIELECOMP.2015.7086940