DocumentCode :
3071543
Title :
Authorship attribution using committee machines with k-nearest neighbors rated voting
Author :
Kusakci, Ali Osman
Author_Institution :
Ind. Eng. Dept., Int. Univ. of Sarajevo, Sarajevo, Bosnia-Herzegovina
fYear :
2012
fDate :
20-22 Sept. 2012
Firstpage :
161
Lastpage :
166
Abstract :
Authorship attribution, namely determination of the author of a text, may become an extraordinarily complex and sensitive job due to its relatively difficult feature extraction phase and highly nonlinear nature. This paper proposes a classification tool using committee machines consisting of multilayered perceptron neural networks (MLP) to identify the author of a text. Each expert is an individual MLP learning complex input-output relation composed of 14 lexical, stylometric attributes extracted from the corpus. The resulting mapping after training is used to identify the texts in German language written by two different authors. Unlike other committee based classification tools individual answers of the experts are combined with a novel voting method, k-nearest neighbors rated voting. The proposed method shows very promising results when benchmarked with simple majority voting technique.
Keywords :
feature extraction; learning (artificial intelligence); multilayer perceptrons; natural language processing; pattern classification; text analysis; German language; MLP learning complex input-output relation; authorship attribution; classification tool; committee based classification tools; committee machines; feature extraction phase; k-nearest neighbor rated voting; k-nearest neighbors rated voting; multilayered perceptron neural networks; sensitive job; stylometric attribute extraction; Accuracy; Artificial neural networks; Collaboration; Feature extraction; Neurons; Training; Training data; Artificial neural networks; author identification; committee machines; k-nearest neighbors rated voting;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Network Applications in Electrical Engineering (NEUREL), 2012 11th Symposium on
Conference_Location :
Belgrade
Print_ISBN :
978-1-4673-1569-2
Type :
conf
DOI :
10.1109/NEUREL.2012.6419997
Filename :
6419997
Link To Document :
بازگشت