DocumentCode
3736956
Title
An experimental study of stylometry in Bangla literature
Author
Prapti Das;Rishmita Tasmim;Sabir Ismail
Author_Institution
Department of Computer Science & Engineering, Shahjalal University of Science & Technology, Sylhet-3114, Bangladesh
fYear
2015
Firstpage
575
Lastpage
580
Abstract
Every writer has a different style of writing of their own. By analyzing various kinds of features we can identify and specify some characteristics in a writer´s writing which is known as stylogenetics. In this paper we gathered Bangla blogs written by four different Bangladeshi writers. Using machine learning methods we tried to identify special Stylometry features in their writing style. We analyzed various features in their writings, for example, percentage of unique words, word length, sentence length, and frequency of some parts of speech, number of suffix, frequency of first word, second word, second last word and last word of a sentence, counting average number of question marks per document, frequency of word by its position in a sentence etc. We gathered statistical data from analyzing those features and tried to find the variance among these writers using the statistical data.
Publisher
ieee
Conference_Titel
Electrical Information and Communication Technology (EICT), 2015 2nd International Conference on
Print_ISBN
978-1-4673-9256-3
Type
conf
DOI
10.1109/EICT.2015.7392018
Filename
7392018
Link To Document