DocumentCode
3071543
Title
Authorship attribution using committee machines with k-nearest neighbors rated voting
Author
Kusakci, Ali Osman
Author_Institution
Ind. Eng. Dept., Int. Univ. of Sarajevo, Sarajevo, Bosnia-Herzegovina
fYear
2012
fDate
20-22 Sept. 2012
Firstpage
161
Lastpage
166
Abstract
Authorship attribution, namely determination of the author of a text, may become an extraordinarily complex and sensitive job due to its relatively difficult feature extraction phase and highly nonlinear nature. This paper proposes a classification tool using committee machines consisting of multilayered perceptron neural networks (MLP) to identify the author of a text. Each expert is an individual MLP learning complex input-output relation composed of 14 lexical, stylometric attributes extracted from the corpus. The resulting mapping after training is used to identify the texts in German language written by two different authors. Unlike other committee based classification tools individual answers of the experts are combined with a novel voting method, k-nearest neighbors rated voting. The proposed method shows very promising results when benchmarked with simple majority voting technique.
Keywords
feature extraction; learning (artificial intelligence); multilayer perceptrons; natural language processing; pattern classification; text analysis; German language; MLP learning complex input-output relation; authorship attribution; classification tool; committee based classification tools; committee machines; feature extraction phase; k-nearest neighbor rated voting; k-nearest neighbors rated voting; multilayered perceptron neural networks; sensitive job; stylometric attribute extraction; Accuracy; Artificial neural networks; Collaboration; Feature extraction; Neurons; Training; Training data; Artificial neural networks; author identification; committee machines; k-nearest neighbors rated voting;
fLanguage
English
Publisher
ieee
Conference_Titel
Neural Network Applications in Electrical Engineering (NEUREL), 2012 11th Symposium on
Conference_Location
Belgrade
Print_ISBN
978-1-4673-1569-2
Type
conf
DOI
10.1109/NEUREL.2012.6419997
Filename
6419997
Link To Document