On Bayesian interpretation of fact-finding in information networks

Dong Wang, Tarek Abdelzaher, Hossein Ahmadi, Jeff Pasternack, Dan Roth, Manish Gupta, Jiawei Han, Omid Fatemieh, Hieu Le, Charu C. Aggarwal

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

When information sources are unreliable, information networks have been used in data mining literature to uncover facts from large numbers of complex relations between noisy variables. The approach relies on topology analysis of graphs, where nodes represent pieces of (unreliable) information and links represent abstract relations. Such topology analysis was often empirically shown to be quite powerful in extracting useful conclusions from large amounts of poor-quality information. However, no systematic analysis was proposed for quantifying the accuracy of such conclusions. In this paper, we present, for the first time, a Bayesian interpretation of the basic mechanism used in fact-finding from information networks. This interpretation leads to a direct quantification of the accuracy of conclusions obtained from information network analysis. Hence, we provide a general foundation for using information network analysis not only to heuristically extract likely facts, but also to quantify, in an analytically-founded manner, the probability that each fact or source is correct. Such probability constitutes a measure of quality of information (QoI). Hence, the paper presents a new foundation for QoI analysis in information networks, that is of great value in deriving information from unreliable sources. The framework is applied to a representative fact-finding problem, and is validated by extensive simulation where analysis shows significant improvement over past work and great correspondence with ground truth.

Original languageEnglish (US)
Title of host publicationFusion 2011 - 14th International Conference on Information Fusion
StatePublished - 2011
Event14th International Conference on Information Fusion, Fusion 2011 - Chicago, IL, United States
Duration: Jul 5 2011Jul 8 2011

Publication series

NameFusion 2011 - 14th International Conference on Information Fusion

Other

Other14th International Conference on Information Fusion, Fusion 2011
Country/TerritoryUnited States
CityChicago, IL
Period7/5/117/8/11

Keywords

  • Bayesian inference
  • Information networks
  • Sensors

ASJC Scopus subject areas

  • Information Systems

Fingerprint

Dive into the research topics of 'On Bayesian interpretation of fact-finding in information networks'. Together they form a unique fingerprint.

Cite this