• DocumentCode
    3045238
  • Title

    On Identifying Authors with Style

  • Author

    Stuart, Lauren M. ; Tazhibayeva, Saltanat ; Wagoner, Amy R. ; Taylor, J.M.

  • Author_Institution
    CERIAS (Center for Educ. & Res. in Inf. Assurance & Security), Purdue Univ., West Lafayette, IN, USA
  • fYear
    2013
  • fDate
    13-16 Oct. 2013
  • Firstpage
    3048
  • Lastpage
    3053
  • Abstract
    Stylometry is the quantified (often statistical) analysis of author style as a set of (usually morphosyntactic) features expressed in several documents by the author. The focus of this paper is a task to which stylometry is often applied: authorship attribution, the question of identifying or confirming the author of a text based on the known body of work. We analyze a feature set previously introduced in the field, using a tool and corpus already available. Decomposing the set, we identify the features that seem to have contributed the most to accurate performance. In re-composing the set under different objectives - first, for English-only document sets, and then for possible multi-language use - we identify smaller sets of feature combinations that work well together in accurate performance. We then outline our continuing work based on the results we obtain.
  • Keywords
    document handling; statistical analysis; English-only document sets; author style identification; authorship attribution; morphosyntactic features; multilanguage use; quantified analysis; statistical analysis; stylometry; Accuracy; Complexity theory; Diamonds; Error analysis; Measurement uncertainty; Writing; authorship attribution; stylistics; stylometry;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics (SMC), 2013 IEEE International Conference on
  • Conference_Location
    Manchester
  • Type

    conf

  • DOI
    10.1109/SMC.2013.520
  • Filename
    6722273