• DocumentCode
    549105
  • Title

    On Bayesian interpretation of fact-finding in information networks

  • Author

    Wang, Dong ; Abdelzaher, Tarek ; Ahmadi, Hossein ; Pasternack, Jeff ; Roth, Dan ; Gupta, Manish ; Han, Jiawei ; Fatemieh, Omid ; Le, Hieu ; Aggarwal, Charu C.

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Illinois, Urbana, IL, USA
  • fYear
    2011
  • fDate
    5-8 July 2011
  • Firstpage
    1
  • Lastpage
    8
  • Abstract
    When information sources are unreliable, information networks have been used in data mining literature to uncover facts from large numbers of complex relations between noisy variables. The approach relies on topology analysis of graphs, where nodes represent pieces of (unreliable) information and links represent abstract relations. Such topology analysis was often empirically shown to be quite powerful in extracting useful conclusions from large amounts of poor-quality information. However, no systematic analysis was proposed for quantifying the accuracy of such conclusions. In this paper, we present, for the first time, a Bayesian interpretation of the basic mechanism used in fact-finding from information networks. This interpretation leads to a direct quantification of the accuracy of conclusions obtained from information network analysis. Hence, we provide a general foundation for using information network analysis not only to heuristically extract likely facts, but also to quantify, in an analytically-founded manner, the probability that each fact or source is correct. Such probability constitutes a measure of quality of information (QoI). Hence, the paper presents a new foundation for QoI analysis in information networks, that is of great value in deriving information from unreliable sources. The framework is applied to a representative fact-finding problem, and is validated by extensive simulation where analysis shows significant improvement over past work and great correspondence with ground truth.
  • Keywords
    Bayes methods; data mining; information networks; probability; Bayesian interpretation; abstract relations; data mining literature; fact-finding; information network analysis; information sources; noisy variables; probability; quality of information; topology analysis; Accuracy; Bayesian methods; Data mining; Equations; Mathematical model; Network topology; Silicon; Bayesian inference; Information networks; sensors;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Fusion (FUSION), 2011 Proceedings of the 14th International Conference on
  • Conference_Location
    Chicago, IL
  • Print_ISBN
    978-1-4577-0267-9
  • Type

    conf

  • Filename
    5977540