TY - GEN
T1 - Errors, misunderstandings, and attacks
T2 - 19th ACM Internet Measurement Conference, IMC 2019
AU - Alrizah, Mshabab
AU - Xing, Xinyu
AU - Zhu, Sencun
AU - Wang, Gang
N1 - Funding Information:
We would like to thank our shepherd Georgios Smaragdakis and the anonymous reviewers for their helpful feedback. This project was in part supported by NSF grants CNS-1750101, CNS-1717028, CNS-1618684, CNS-1718459. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of any funding agencies.
Publisher Copyright:
© 2019 Association for Computing Machinery. ACM ISBN 978-1-4503-6948-0/19/10...$15.00
PY - 2019/10/21
Y1 - 2019/10/21
N2 - Ad-blocking systems such as Adblock Plus rely on crowdsourcing to build and maintain filter lists, which are the basis for determining which ads to block on web pages. In this work, we seek to advance our understanding of the ad-blocking community as well as the errors and pitfalls of the crowdsourcing process. To do so, we collected and analyzed a longitudinal dataset that covered the dynamic changes of popular filter-list EasyList for nine years and the error reports submitted by the crowd in the same period. Our study yielded a number of significant findings regarding the characteristics of FP and FN errors and their causes. For instances, we found that false positive errors (i.e., incorrectly blocking legitimate content) still took a long time before they could be discovered (50% of them took more than a month) despite the community effort. Both EasyList editors and website owners were to blame for the false positives. In addition, we found that a great number of false negative errors (i.e., failing to block real advertisements) were either incorrectly reported or simply ignored by the editors. Furthermore, we analyzed evasion attacks from ad publishers against ad-blockers. In total, our analysis covers 15 types of attack methods including 8 methods that have not been studied by the research community. We show how ad publishers have utilized them to circumvent ad-blockers and empirically measure the reactions of ad blockers. Through in-depth analysis, our findings are expected to help shed light on any future work to evolve ad blocking and optimize crowdsourcing mechanisms.
AB - Ad-blocking systems such as Adblock Plus rely on crowdsourcing to build and maintain filter lists, which are the basis for determining which ads to block on web pages. In this work, we seek to advance our understanding of the ad-blocking community as well as the errors and pitfalls of the crowdsourcing process. To do so, we collected and analyzed a longitudinal dataset that covered the dynamic changes of popular filter-list EasyList for nine years and the error reports submitted by the crowd in the same period. Our study yielded a number of significant findings regarding the characteristics of FP and FN errors and their causes. For instances, we found that false positive errors (i.e., incorrectly blocking legitimate content) still took a long time before they could be discovered (50% of them took more than a month) despite the community effort. Both EasyList editors and website owners were to blame for the false positives. In addition, we found that a great number of false negative errors (i.e., failing to block real advertisements) were either incorrectly reported or simply ignored by the editors. Furthermore, we analyzed evasion attacks from ad publishers against ad-blockers. In total, our analysis covers 15 types of attack methods including 8 methods that have not been studied by the research community. We show how ad publishers have utilized them to circumvent ad-blockers and empirically measure the reactions of ad blockers. Through in-depth analysis, our findings are expected to help shed light on any future work to evolve ad blocking and optimize crowdsourcing mechanisms.
UR - http://www.scopus.com/inward/record.url?scp=85074855325&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85074855325&partnerID=8YFLogxK
U2 - 10.1145/3355369.3355588
DO - 10.1145/3355369.3355588
M3 - Conference contribution
AN - SCOPUS:85074855325
T3 - Proceedings of the ACM SIGCOMM Internet Measurement Conference, IMC
SP - 230
EP - 244
BT - IMC 2019 - Proceedings of the 2019 ACM Internet Measurement Conference
PB - Association for Computing Machinery
Y2 - 21 October 2019 through 23 October 2019
ER -