• DocumentCode
    3089019
  • Title

    A fuzzy based approach to stylometric analysis of blogger´s age and gender

  • Author

    Goswami, Suparna ; Shishodia, Mayank Singh

  • Author_Institution
    Indian Inst. of Technol., Kharagpur, Kharagpur, India
  • fYear
    2012
  • fDate
    4-7 Dec. 2012
  • Firstpage
    47
  • Lastpage
    51
  • Abstract
    Fuzzy logic deals with partial truth. A fuzzy based approach to blog analysis, on the basis of various feature words, allows us to determine the degree to which a blogger´s style belongs to a particular age or gender group. Each blog was represented by a set of normalized word frequencies of selected feature words in it. Using membership values obtained from applying Fuzzy C-Means (FCM) algorithm to these blog representations, we can call the blogger´s style to belong weakly, fairly, strongly or very strongly to a particular class. The advantage of using fuzzy logic for this problem is that a weak belonging to a particular class means that there is a decent belonging to the other class (es). Hence when a search or query is carried out, no useful blog will be left out of the results for that other class (es).
  • Keywords
    Web sites; fuzzy logic; fuzzy set theory; pattern clustering; text analysis; FCM algorithm; age group; blog analysis; blog representation; blogger age; blogger style; bloggergender; feature words; fuzzy C-means algorithm; fuzzy based approach; fuzzy logic; gender group; membership value; normalized word frequency; stylometric analysis; Decision support systems; Helium; Hybrid intelligent systems; Mercury (metals); Rail to rail outputs; age; blog; clustering; fuzzy c-means; fuzzy logic; gender; stylometrics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Hybrid Intelligent Systems (HIS), 2012 12th International Conference on
  • Conference_Location
    Pune
  • Print_ISBN
    978-1-4673-5114-0
  • Type

    conf

  • DOI
    10.1109/HIS.2012.6421307
  • Filename
    6421307