• DocumentCode
    621170
  • Title

    Inferring cellular user demographic information using homophily on call graphs

  • Author

    Yi Wang ; Hui Zang ; Faloutsos, Michalis

  • Author_Institution
    Univ. of California Riverside, Riverside, CA, USA
  • fYear
    2013
  • fDate
    14-19 April 2013
  • Firstpage
    211
  • Lastpage
    216
  • Abstract
    Homophily refers to the phenomenon where people who are socially-connected share many characteristics including demographic and behavioral properties. The goal of this paper is to see whether homophily exists in call networks and if so, to what degree we can infer a cellphone user´s demographic properties by knowing the demographic information of the people that s/he talks to. We focus on three types of demographic information: a) home location, b) age group, and c) income level. The novelty is two-folds. First, we use both communication metrics and structural properties of call graphs to identify those “important” friends for each user with whom (s)he is most likely to be in homophily. Second, we assess the importance of different time slices such as weekdays, or nights and weekends for capturing different user relationships. We conduct our study on a real data trace with 20M subscribers during one month from a nationwide cellular carrier. Our first contribution is that we quantify the extent of homophily on the call graph and identify the correlations between homophily and communication and structural features. As a second contribution, we develop effective methods to infer demographic information for a cellular user using linear regression to select the most homophily-like friend of her/him. We find that we can predict home location within 20km radius with 80% accuracy, and age group and income level with 78% and 72% accuracy, respectively.
  • Keywords
    cellular radio; graphs; mobile computing; social networking (online); age group; behavioral properties; call graph; call network homophily; communication metric; demographic properties; home location; important friends; income level; inferring cellular user demographic information; socially connected user; structural properties; Accuracy; Communication networks; Conferences; Correlation; Linear regression; Prediction algorithms; Social network services;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Communications Workshops (INFOCOM WKSHPS), 2013 IEEE Conference on
  • Conference_Location
    Turin
  • Print_ISBN
    978-1-4799-0055-8
  • Type

    conf

  • DOI
    10.1109/INFCOMW.2013.6562897
  • Filename
    6562897