DocumentCode
175879
Title
Discriminating gender on Chinese microblog: A study of online behaviour, writing style and preferred vocabulary
Author
Li Li ; Maosong Sun ; Zhiyuan Liu
Author_Institution
Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
fYear
2014
fDate
19-21 Aug. 2014
Firstpage
812
Lastpage
817
Abstract
As user attributes are useful for applications such as personalized recommendation, adverting and so on, user attribute predication on Twitter has attracted intensive attentions in recent years. Although Chinese micro-blogging services are different from Twitter on various aspects such as language, user behaviours and so on, few efforts have been made on Chinese micro-blogging services. In this paper, we propose a gender prediction model for Chinese microblog which exploits features including online behaviour, writing style, and preferred vocabulary. Experimental results on Sina Weibo, which is one of the most popular micro-blogging services in China, show that our model achieves the state-of-the-art accuracy 94.3%. We also find significant distinctions between male and female microblog users on online behaviour, writing style and preferred vocabulary, which would be helpful for improving personalized applications.
Keywords
Internet; Web sites; gender issues; vocabulary; Chinese microblogging services; Twitter; gender prediction model; online behaviour; personalized recommendation; preferred vocabulary; writing style; Accuracy; Fans; Feature extraction; Predictive models; Twitter; Vocabulary; Writing; Chinese microblog; gender prediction; user behaviour analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Natural Computation (ICNC), 2014 10th International Conference on
Conference_Location
Xiamen
Print_ISBN
978-1-4799-5150-5
Type
conf
DOI
10.1109/ICNC.2014.6975942
Filename
6975942
Link To Document