DocumentCode
2454700
Title
Author attribution using a graph based representation
Author
Castillo, Esteban ; Vilarino, Darnes ; Cervantes, Ofelia ; Pinto, David
Author_Institution
Dept. of Comput. Sci., Univ. de las Americas Puebla, Puebla, Mexico
fYear
2015
fDate
25-27 Feb. 2015
Firstpage
135
Lastpage
142
Abstract
Authorship attribution is the task of determining the real author of a given anonymous document. Even though different approaches exist in literature, this problem has been barely dealt with by using document representations that employ graph structures. Actually, most research works in literature handle this problem by employing simple sequences of n words (n-grams), such as bigrams and trigrams. In this paper, an exploration in the use of graphs for representing document sentences is presented. These structures are used for carrying out experiments for solving the problem of Authorship attribution. The experiments that are presented here attain approximately a 79% of accuracy, showing that the graph-based representation could be a way of encapsulating various levels of natural language descriptions in a simple structure.
Keywords
graph theory; natural language processing; text analysis; anonymous document; author attribution; document sentence representation; graph based representation; graph structures; natural language descriptions; Feature extraction; Kernel; Semantics; Support vector machines; Syntactics; Topology; Writing;
fLanguage
English
Publisher
ieee
Conference_Titel
Electronics, Communications and Computers (CONIELECOMP), 2015 International Conference on
Conference_Location
Cholula
Type
conf
DOI
10.1109/CONIELECOMP.2015.7086940
Filename
7086940
Link To Document