TY - GEN
T1 - From truth discovery to trustworthy opinion discovery
T2 - 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2016
AU - Wan, Mengting
AU - Chen, Xiangyu
AU - Kaplan, Lance
AU - Han, Jiawei
AU - Gao, Jing
AU - Zhao, Bo
N1 - Publisher Copyright:
© 2016 ACM.
PY - 2016/8/13
Y1 - 2016/8/13
N2 - In this era of information explosion, conficts are often encountered when information is provided by multiple sources. Traditional truth discovery task aims to identify the truth - the most trustworthy information, from conficting sources in different scenarios. In this kind of tasks, truth is regarded as a fixed value or a set of fixed values. However, in a number of real-world cases, objective truth existence cannot be ensured and we can only identify single or multiple reliable facts from opinions. Different from traditional truth discovery task, we address this uncertainty and introduce the concept of trustworthy opinion of an entity, treat it as a random variable, and use its distribution to describe consistency or controversy, which is particularly difficult for data which can be numerically measured, i.e. quantitative information. In this study, we focus on the quantitative opinion, propose an uncertainty-aware approach called Kernel Density Estimation from Multiple Sources (KDEm) to estimate its probability distribution, and summarize trustworthy information based on this distribution. Experiments indicate that KDEm not only has outstanding performance on the classical numeric truth discovery task, but also shows good performance on multi-modality detection and anomaly detection in the uncertain-opinion setting.
AB - In this era of information explosion, conficts are often encountered when information is provided by multiple sources. Traditional truth discovery task aims to identify the truth - the most trustworthy information, from conficting sources in different scenarios. In this kind of tasks, truth is regarded as a fixed value or a set of fixed values. However, in a number of real-world cases, objective truth existence cannot be ensured and we can only identify single or multiple reliable facts from opinions. Different from traditional truth discovery task, we address this uncertainty and introduce the concept of trustworthy opinion of an entity, treat it as a random variable, and use its distribution to describe consistency or controversy, which is particularly difficult for data which can be numerically measured, i.e. quantitative information. In this study, we focus on the quantitative opinion, propose an uncertainty-aware approach called Kernel Density Estimation from Multiple Sources (KDEm) to estimate its probability distribution, and summarize trustworthy information based on this distribution. Experiments indicate that KDEm not only has outstanding performance on the classical numeric truth discovery task, but also shows good performance on multi-modality detection and anomaly detection in the uncertain-opinion setting.
KW - Kernel density estimation
KW - Source reliability
KW - Truth discovery
UR - http://www.scopus.com/inward/record.url?scp=84984993582&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84984993582&partnerID=8YFLogxK
U2 - 10.1145/2939672.2939837
DO - 10.1145/2939672.2939837
M3 - Conference contribution
AN - SCOPUS:84984993582
T3 - Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
SP - 1885
EP - 1894
BT - KDD 2016 - Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
PB - Association for Computing Machinery
Y2 - 13 August 2016 through 17 August 2016
ER -