skip to main content
10.1145/3131365.3131387acmconferencesArticle/Chapter ViewAbstractPublication PagesimcConference Proceedingsconference-collections
research-article

The ad wars: retrospective measurement and analysis of anti-adblock filter lists

Published: 01 November 2017 Publication History

Abstract

The increasing popularity of adblockers has prompted online publishers to retaliate against adblock users by deploying anti-adblock scripts, which detect adblock users and bar them from accessing content unless they disable their adblocker. To circumvent anti-adblockers, adblockers rely on manually curated anti-adblock filter lists for removing anti-adblock scripts. Anti-adblock filter lists currently rely on informal crowdsourced feedback from users to add/remove filter list rules. In this paper, we present the first comprehensive study of anti-adblock filter lists to analyze their effectiveness against anti-adblockers. Specifically, we compare and contrast the evolution of two popular anti-adblock filter lists. We show that these filter lists are implemented very differently even though they currently have a comparable number of filter list rules. We then use the Internet Archive's Wayback Machine to conduct a retrospective coverage analysis of these filter lists on Alexa top-5K websites over the span of last five years. We find that the coverage of these filter lists has considerably improved since 2014 and they detect anti-adblockers on about 9% of Alexa top-5K websites. To improve filter list coverage and speedup addition of new filter rules, we also design and implement a machine learning based method to automatically detect anti-adblock scripts using static JavaScript code analysis.

References

[1]
Acceptable ads program. https://adblockplus.org/acceptable-ads.
[2]
AdBlock. https://getadblock.com/.
[3]
AdBlock, Chrome web store. https://chrome.google.com/webstore/detail/adblock/gighmmpiobklfepjocnamgkkbiglidom?hl=en-US.
[4]
AdblockPlus. https://adblockplus.org/.
[5]
Adblock Plus, Chrome web store. https://chrome.google.com/webstore/detail/adblock-plus/cfhdojbkjhnklbpkdaibdccddilifddb.
[6]
Adblock Plus, Mozilla Firefox add-on. https://addons.mozilla.org/en-US/firefox/addon/adblock-plus/.
[7]
Adblock rules list parser. https://github.com/shawa/adblockparser.
[8]
Anti-Adblock Killer. https://github.com/reek/anti-adblock-killer.
[9]
Anti-Adblock Killer List Forum. https://github.com/reek/anti-adblock-killer/issues.
[10]
BlockAdBlock. https://github.com/sitexw/BlockAdBlock/blob/master/blockadblock.js.
[11]
Blockzilla. https://zpacman.github.io/Blockzilla/.
[12]
Brave Browser. https://brave.com/.
[13]
Cliqz Browser. https://cliqz.com/us/.
[14]
Coalition for Better Ads. https://www.betterads.org/.
[15]
Disconnect.me filter list. https://disconnect.me/.
[16]
EasyList. https://easylist.to/.
[17]
EasyList Forum. https://forums.lanik.us.
[18]
EasyList variants. https://easylist.to/pages/other-supplementary-filter-lists-and-easylist-variants.html.
[19]
Fanboy's Enhanced Tracking List. https://fanboy.co.nz/.
[20]
Filter lists syntax. https://adblockplus.org/en/filter-cheatsheet.
[21]
Firebug. http://getfirebug.com/.
[22]
Ghostery. https://www.ghostery.com/.
[23]
Ghostery, Chrome web store. https://chrome.google.com/webstore/detail/ghostery/mlomiejdfkolichcflejclcbmpeaniij?hl=en-US.
[24]
Ghostery, Mozilla Firefox add-on. https://addons.mozilla.org/en-US/firefox/addon/ghostery/.
[25]
HAR File. https://en.wikipedia.org/wiki/.har.
[26]
McAfee's URL categorization service. https://www.trustedsource.org/.
[27]
Mozilla Firefox. https://www.mozilla.org/en-US/firefox/.
[28]
Mozilla Firefox tracker blocking. https://testpilot.firefox.com/experiments/tracking-protection.
[29]
NetExport. https://getfirebug.com/wiki/index.php/Firebug_Extensions.
[30]
NoTrack Blocklist. https://github.com/quidsup/notrack.
[31]
PageFair, 2017 Global Adblock Report. https://pagefair.com/downloads/2017/01/PageFair-2017--Adblock-Report.pdf.
[32]
Privacy Badger. https://www.eff.org/privacybadger.
[33]
Privacy Badger, Chrome web store. https://chrome.google.com/webstore/detail/privacy-badger/pkehgijcmpdhfbdbbnkijodmdjhbjlgp?hl=en-US.
[34]
Privacy Badger, Mozilla Firefox add-on. https://addons.mozilla.org/en-US/firefox/addon/privacy-badger17/.
[35]
Selenium. http://docs.seleniumhq.org/.
[36]
Truth In Advertising, Federal Trade Commission. https://www.ftc.gov/news-events/media-resources/truth-advertising/.
[37]
Warning removal list. https://easylist-downloads.adblockplus.org/antiadblockfilters.txt.
[38]
Wayback Machine. https://archive.org/web/.
[39]
Wayback Machine API. https://archive.org/help/wayback_api.php.
[40]
Wayback Machine Archive Details. https://archive.org/about/.
[41]
YourAdChoices. http://youradchoices.com/.
[42]
A. Bosworth. A New Way to Control the Ads You See on Facebook, and an Update on Ad Blocking. https://newsroom.fb.com/news/2016/08/a-new-way-to-control-the-ads-you-see-on-facebook-and-an-update-on-ad-blocking/, 2016.
[43]
C. Curtsinger, B. Livshits, B. Zorn, and C. Seifert. ZOZZLE: Fast and Precise In-Browser JavaScript Malware Detection. In USENIX Security Symposium, 2011.
[44]
S. Englehardt and A. Narayanan. Online Tracking: A 1-million-site Measurement and Analysis. In ACM Conference on Computer and Communications Security (CCS), 2016.
[45]
S. Englehardt, D. Reisman, C. Eubank, P. Zimmerman, J. Mayer, A. Narayanan, and E. W. Felten. Cookies That Give You Away: The Surveillance Implications of Web Tracking. In World Wide Web (WWW) Conference, 2015.
[46]
Y. Freund and R. E. Schapire. A decision-theoretic generalization of on-line learning and an application to boosting. In Journal of Computer and System Sciences, 1997.
[47]
M. Graham. robots.txt meant for search engines don't work well for web archives. https://blog.archive.org/2017/04/17/robots-txt-meant-for-search-engines-dont- work- well- for-web-archives/, 2017.
[48]
D. Gugelmann, M. Happe, B. Ager, and V. Lenders. An Automated Approach for Complementing Ad Blockers' Blacklists. In Privacy Enhancing Technologies Symposium (PETS), 2015.
[49]
S. Ihm and V. S. Pai. Towards Understanding Modern Web Traffic. In ACM Internet Measurement Conference (IMC), 2011.
[50]
M. Ikram, H. J. Asghar, M. A. Kaafar, A. Mahanti, and B. Krishnamurthy. Towards Seamless Tracking-Free Web:Improved Detection of Trackers via One-class Learning. In Privacy Enhancing Technologies Symposium (PETS), 2017.
[51]
B. Krishnamurthy and C. E. Wills. Privacy Diffusion on the Web: A Longitudinal Perspective. In World Wide Web (WWW) Conference, 2009.
[52]
A. Lerner, T. Kohno, and F. Roesner. Rewriting History: Changing the Archived Web from the Present. In ACM Conference on Computer and Communications Security (CCS), 2017.
[53]
A. Lerner, A. K. Simpson, T. Kohno, and F. Roesner. Internet Jones and the Raiders of the Lost Trackers: An Archaeological Study of Web Tracking from 1996 to 2016. In USENIX Security Symposium, 2016.
[54]
X. Li, L. Wang, and E. Sung. AdaBoost with SVM-based component classifiers. In Engineering Applications of Artificial Intelligence, 2007.
[55]
M. Malloy, M. McNamara, A. Cahn, and P. Barford. Ad Blockers: Global Prevalence and Impact. In ACM Internet Measurement Conference (IMC), 2016.
[56]
J. Marshall. The Rise of the Anti-Ad Blockers. https://www.wsj.com/articles/the-rise-of-the-anti-ad-blockers-1465805039, 2016.
[57]
J. R. Mayer and J. C. Mitchell. Third-Party Web Tracking: Policy and Technology. In IEEE Symposium on Security and Privacy, 2012.
[58]
G. Merzdovnik, M. Huber, D. Buhov, N. Nikiforakis, S. Neuner, M. Schmiedecker, and E. Weippl. Block Me If You Can: A Large-Scale Study of Tracker-Blocking Tools. In IEEE European Symposium on Security and Privacy, 2017.
[59]
M. H. Mughees, Z. Qian, and Z. Shafiq. Detecting Anti Ad-blockers in the Wild. In Privacy Enhancing Technologies Symposium (PETS), 2017.
[60]
R. Nithyanand, S. Khattak, M. Javed, N. Vallina-Rodriguez, M. Falahrastegar, J. E. Powles, E. D. Cristofaro, H. Haddadi, and S. J. Murdoch. Adblocking and Counter-Blocking: A Slice of the Arms Race. In USENIX Workshop on Free and Open Communications on the Internet, 2016.
[61]
E. Pujol, O. Hohlfeld, and A. Feldmann. Annoyed Users: Ads and Ad-Block Usage in the Wild. In ACM Internet Measurement Conference (IMC), 2015.
[62]
M. Z. Rafique, T. V. Goethem, W. Joosen, C. Huygens, and N. Nikiforakis. It's Free for a Reason: Exploring the Ecosystem of Free Live Streaming Services. In Network and Distributed System Security Symposium (NDSS), 2016.
[63]
S. Ramaswamy. Building a better web for everyone. https://www.blog.google/topics/journalism-news/building-better-web-everyone/, 2017.
[64]
F. Roesner, T. Kohno, and D. Wetherall. Detecting and Defending Against Third-Party Tracking on the Web. In USENIX Symposium on Networked Systems Design and Implementation (NDSI), 2012.
[65]
G. Storey, D. Reisman, J. Mayer, and A. Narayanan. The Future of Ad Blocking: An Analytical Framework and New Techniques. In arXiv:1705.08568, 2017.
[66]
R. J. Walls, E. D. Kilmer, N. Lageman, and P. D. McDanie. Measuring the Impact and Perception of Acceptable Advertisements. In ACM Internet Measurement Conference (IMC), 2015.
[67]
J. Wilander. Apple Safari Intelligent Tracking Prevention. https://webkit.org/blog/7675/intelligent-tracking-prevention, 2017.
[68]
Y. Yang and J. O. Pedersen. A Comparative Study on Feature Selection in Text Categorization. In International Conference on Machine Learning, 1997.
[69]
Z. Yu, S. Macbeth, K. Modi, and J. M. Pujol. Tracking the Trackers. In World Wide Web (WWW) Conference, 2016.

Cited By

View all

Index Terms

  1. The ad wars: retrospective measurement and analysis of anti-adblock filter lists

      Recommendations

      Comments

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      IMC '17: Proceedings of the 2017 Internet Measurement Conference
      November 2017
      509 pages
      ISBN:9781450351188
      DOI:10.1145/3131365
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      In-Cooperation

      • USENIX Assoc: USENIX Assoc

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 01 November 2017

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. adblocking
      2. anti-adblocking
      3. javascript
      4. machine learning
      5. privacy
      6. static code analysis
      7. the wayback machine

      Qualifiers

      • Research-article

      Conference

      IMC '17
      IMC '17: Internet Measurement Conference
      November 1 - 3, 2017
      London, United Kingdom

      Acceptance Rates

      Overall Acceptance Rate 277 of 1,083 submissions, 26%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)74
      • Downloads (Last 6 weeks)8
      Reflects downloads up to 31 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all

      View Options

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media