DocumentCode
3290576
Title
Automatic detection of gender on the blogs
Author
Belbachir, Faiza ; Henni, Khadidja ; Zaoui, Lynda
Author_Institution
LSSD Lab., USTO-MB, France
fYear
2013
fDate
27-30 May 2013
Firstpage
1
Lastpage
4
Abstract
In this paper, we are interested in defining the gender of blogger while using only texts written from bloggers. For that purpose, we offer a number of features based on specific words, which were categorized into classes. For each blog, a score is calculated based on these characteristics, thereby determining the gender of its author. The evaluation was made on a corpus of 681,288 Blogs (140 million words) tagged as men or women. In our work, this collection will be taken as a reference. The obtained results show gender detection over 82% compared to the referenced collection.
Keywords
Web sites; gender issues; automatic gender detection; blogs; specific words; Blogs; Dictionaries; Games; Grammar; Internet; Laboratories; Information retrieval; blogs; gender detection; social network;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Systems and Applications (AICCSA), 2013 ACS International Conference on
Conference_Location
Ifrane
ISSN
2161-5322
Type
conf
DOI
10.1109/AICCSA.2013.6616510
Filename
6616510
Link To Document