• DocumentCode
    175879
  • Title

    Discriminating gender on Chinese microblog: A study of online behaviour, writing style and preferred vocabulary

  • Author

    Li Li ; Maosong Sun ; Zhiyuan Liu

  • Author_Institution
    Dept. of Comput. Sci. & Technol., Tsinghua Univ., Beijing, China
  • fYear
    2014
  • fDate
    19-21 Aug. 2014
  • Firstpage
    812
  • Lastpage
    817
  • Abstract
    As user attributes are useful for applications such as personalized recommendation, adverting and so on, user attribute predication on Twitter has attracted intensive attentions in recent years. Although Chinese micro-blogging services are different from Twitter on various aspects such as language, user behaviours and so on, few efforts have been made on Chinese micro-blogging services. In this paper, we propose a gender prediction model for Chinese microblog which exploits features including online behaviour, writing style, and preferred vocabulary. Experimental results on Sina Weibo, which is one of the most popular micro-blogging services in China, show that our model achieves the state-of-the-art accuracy 94.3%. We also find significant distinctions between male and female microblog users on online behaviour, writing style and preferred vocabulary, which would be helpful for improving personalized applications.
  • Keywords
    Internet; Web sites; gender issues; vocabulary; Chinese microblogging services; Twitter; gender prediction model; online behaviour; personalized recommendation; preferred vocabulary; writing style; Accuracy; Fans; Feature extraction; Predictive models; Twitter; Vocabulary; Writing; Chinese microblog; gender prediction; user behaviour analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Computation (ICNC), 2014 10th International Conference on
  • Conference_Location
    Xiamen
  • Print_ISBN
    978-1-4799-5150-5
  • Type

    conf

  • DOI
    10.1109/ICNC.2014.6975942
  • Filename
    6975942