• DocumentCode
    146493
  • Title

    A pragmatic validation of stylometric techniques using BPA

  • Author

    Pateriya, Pushpendra Kumar ; Lakshmi ; Raj, Gaurav

  • Author_Institution
    Dept. of Comput. Sci., Lovely Prof. Univ., Phagwara, India
  • fYear
    2014
  • fDate
    25-26 Sept. 2014
  • Firstpage
    124
  • Lastpage
    131
  • Abstract
    There are many modes of communication, but electronic communication is the most noted one in present. Internet is the backbone for all these communications. In digital forensics, finding out the author of a document is a big qestion, identity of the author, their demographic background, and how they are linked to other documents. So major challenges in digital forensic investigation are author identification of message(s) and non-repudiation. In this paper we used Stylometry based human writing feature extraction as a solution for the author identification problem. Stylometry is not only a way of human writing pattern identification but it can also be used for human gender identification. This paper is oriented to highlight some of the ways to manage such problems like anonymous email messages, email abuse and even for the digital forensics. In this paper, 62 stylistic features have been collected for different users, using C language. 22 samples of 150 words for each user have been taken to train the Neural Network using Back Propagation Algorithm(BPA). In different variations of the experimental setup, 98.312% accuracy have been achieved.
  • Keywords
    C language; Internet; backpropagation; digital forensics; document handling; electronic mail; feature extraction; neural nets; BPA; C language; Internet; anonymous email messages; author identification; backpropagation algorithm; digital forensics; electronic communication; email abuse; human gender identification; human writing pattern identification; neural network; nonrepudiation; stylistic features; stylometric technique pragmatic validation; stylometry based human writing feature extraction; Electronic mail; Feature extraction; MATLAB; Pragmatics; Testing; Training; Writing; Author Identification; Back Propagation Algorithm(BPA); Electronic Communication; Non-repudiation; Stylometry;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Confluence The Next Generation Information Technology Summit (Confluence), 2014 5th International Conference -
  • Conference_Location
    Noida
  • Print_ISBN
    978-1-4799-4237-4
  • Type

    conf

  • DOI
    10.1109/CONFLUENCE.2014.6949275
  • Filename
    6949275