• DocumentCode
    3290576
  • Title

    Automatic detection of gender on the blogs

  • Author

    Belbachir, Faiza ; Henni, Khadidja ; Zaoui, Lynda

  • Author_Institution
    LSSD Lab., USTO-MB, France
  • fYear
    2013
  • fDate
    27-30 May 2013
  • Firstpage
    1
  • Lastpage
    4
  • Abstract
    In this paper, we are interested in defining the gender of blogger while using only texts written from bloggers. For that purpose, we offer a number of features based on specific words, which were categorized into classes. For each blog, a score is calculated based on these characteristics, thereby determining the gender of its author. The evaluation was made on a corpus of 681,288 Blogs (140 million words) tagged as men or women. In our work, this collection will be taken as a reference. The obtained results show gender detection over 82% compared to the referenced collection.
  • Keywords
    Web sites; gender issues; automatic gender detection; blogs; specific words; Blogs; Dictionaries; Games; Grammar; Internet; Laboratories; Information retrieval; blogs; gender detection; social network;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Systems and Applications (AICCSA), 2013 ACS International Conference on
  • Conference_Location
    Ifrane
  • ISSN
    2161-5322
  • Type

    conf

  • DOI
    10.1109/AICCSA.2013.6616510
  • Filename
    6616510