• DocumentCode
    3736956
  • Title

    An experimental study of stylometry in Bangla literature

  • Author

    Prapti Das;Rishmita Tasmim;Sabir Ismail

  • Author_Institution
    Department of Computer Science & Engineering, Shahjalal University of Science & Technology, Sylhet-3114, Bangladesh
  • fYear
    2015
  • Firstpage
    575
  • Lastpage
    580
  • Abstract
    Every writer has a different style of writing of their own. By analyzing various kinds of features we can identify and specify some characteristics in a writer´s writing which is known as stylogenetics. In this paper we gathered Bangla blogs written by four different Bangladeshi writers. Using machine learning methods we tried to identify special Stylometry features in their writing style. We analyzed various features in their writings, for example, percentage of unique words, word length, sentence length, and frequency of some parts of speech, number of suffix, frequency of first word, second word, second last word and last word of a sentence, counting average number of question marks per document, frequency of word by its position in a sentence etc. We gathered statistical data from analyzing those features and tried to find the variance among these writers using the statistical data.
  • Publisher
    ieee
  • Conference_Titel
    Electrical Information and Communication Technology (EICT), 2015 2nd International Conference on
  • Print_ISBN
    978-1-4673-9256-3
  • Type

    conf

  • DOI
    10.1109/EICT.2015.7392018
  • Filename
    7392018