• DocumentCode
    245067
  • Title

    Bayesian Heteroskedastic Choice Modeling on Non-identically Distributed Linkages

  • Author

    Liang Hu ; Wei Cao ; Jian Cao ; Guandong Xu ; Longbing Cao ; Zhiping Gu

  • Author_Institution
    Shanghai Jiaotong Univ., Shanghai, China
  • fYear
    2014
  • fDate
    14-17 Dec. 2014
  • Firstpage
    851
  • Lastpage
    856
  • Abstract
    Choice modeling (CM) aims to describe and predict choices according to attributes of subjects and options. If we presume each choice making as the formation of link between subjects and options, immediately CM can be bridged to link analysis and prediction (LAP) problem. However, such a mapping is often not trivial and straightforward. In LAP problems, the only available observations are links among objects but their attributes are often inaccessible. Therefore, we extend CM into a latent feature space to avoid the need of explicit attributes. Moreover, LAP is usually based on binary linkage assumption that models observed links as positive instances and unobserved links as negative instances. Instead, we use a weaker assumption that treats unobserved links as pseudo negative instances. Furthermore, most subjects or options may be quite heterogeneous due to the long-tail distribution, which is failed to capture by conventional LAP approaches. To address above challenges, we propose a Bayesian heteroskedastic choice model to represent the non-identically distributed linkages in the LAP problems. Finally, the empirical evaluation on real-world datasets proves the superiority of our approach.
  • Keywords
    belief networks; data analysis; statistical distributions; Bayesian heteroskedastic choice modeling; CM; LAP problems; link analysis and prediction; long-tail distribution; nonidentically distributed linkages; pseudo negative instances; Adaptation models; Bayes methods; Biological system modeling; Couplings; Data models; Predictive models; Vectors; heteroskedastic choice model; link analysis and prediction; non-IID Bayesian analysis; parallel Gibbs sampling;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2014 IEEE International Conference on
  • Conference_Location
    Shenzhen
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4799-4303-6
  • Type

    conf

  • DOI
    10.1109/ICDM.2014.84
  • Filename
    7023412