TY - GEN
T1 - Gauging the internet doctor
T2 - Workshop on Data Mining for Medicine and HealthCare, DMMH'11 - Held with the KDD Conference, the 17th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD-2011
AU - Vinod Vydiswaran, V. G.
AU - Zhai, Cheng Xiang
AU - Roth, Dan
PY - 2011
Y1 - 2011
N2 - As more and more content is published and consumed online, it is imperative to know if an information nugget found on the Web is trustworthy or not. This is especially important for online medical information as it affects the most vulnerable group of users looking for medical help online. In this paper, we study the feasibility of automatically assessing the trustworthiness of a medical claim based on community knowledge, and propose techniques to assign a reliability score for an information nugget based on support over a community-generated collection. Specifically, we model the trustworthiness of a medical claim based on experiences shared by users in health forums and mailing lists. The proposed claim scores can be used to rank related claims on their relative trustworthiness. We further extend the notion of trustworthiness to a site (or equivalently, a database of claims from the site) and propose a scheme to rank sites based on aggregating the trust scores of claims from the site. Our experiments show that community knowledge can be exploited to help users distinguish reliable medical claims from unreliable ones. The proposed techniques can be applied to other domains where similar corpora are available.
AB - As more and more content is published and consumed online, it is imperative to know if an information nugget found on the Web is trustworthy or not. This is especially important for online medical information as it affects the most vulnerable group of users looking for medical help online. In this paper, we study the feasibility of automatically assessing the trustworthiness of a medical claim based on community knowledge, and propose techniques to assign a reliability score for an information nugget based on support over a community-generated collection. Specifically, we model the trustworthiness of a medical claim based on experiences shared by users in health forums and mailing lists. The proposed claim scores can be used to rank related claims on their relative trustworthiness. We further extend the notion of trustworthiness to a site (or equivalently, a database of claims from the site) and propose a scheme to rank sites based on aggregating the trust scores of claims from the site. Our experiments show that community knowledge can be exploited to help users distinguish reliable medical claims from unreliable ones. The proposed techniques can be applied to other domains where similar corpora are available.
KW - Forum credibility
KW - Relation retrieval
KW - Trustworthiness
UR - http://www.scopus.com/inward/record.url?scp=80053196181&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=80053196181&partnerID=8YFLogxK
U2 - 10.1145/2023582.2023589
DO - 10.1145/2023582.2023589
M3 - Conference contribution
AN - SCOPUS:80053196181
SN - 9781450308434
T3 - Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
SP - 42
EP - 51
BT - Workshop on Data Mining for Medicine and HealthCare, DMMH'11 - Held with the KDD Conference, the 17th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD-2011
Y2 - 21 August 2011 through 21 August 2011
ER -