Taster's choice: A comparative analysis of spam feeds

Andreas Pitsillidis, Chris Kanich, Geoffrey M. Voelker, Kirill Levchenko, Stefan Savage

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

E-mail spam has been the focus of a wide variety of measurement studies, at least in part due to the plethora of spam data sources available to the research community. However, there has been little attention paid to the suitability of such data sources for the kinds of analyses they are used for. In spite of the broad range of data available, most studies use a single "spam feed" and there has been little examination of how such feeds may differ in content. In this paper we provide this characterization by comparing the contents of ten distinct contemporaneous feeds of spam-advertised domain names. We document significant variations based on how such feeds are collected and show how these variations can produce differences in findings as a result.

Original languageEnglish (US)
Title of host publicationIMC 2012 - Proceedings of the ACM Internet Measurement Conference
Pages427-440
Number of pages14
DOIs
StatePublished - 2012
Externally publishedYes
Event2012 ACM Internet Measurement Conference, IMC 2012 - Boston, MA, United States
Duration: Nov 14 2012Nov 16 2012

Publication series

NameProceedings of the ACM SIGCOMM Internet Measurement Conference, IMC

Other

Other2012 ACM Internet Measurement Conference, IMC 2012
CountryUnited States
CityBoston, MA
Period11/14/1211/16/12

Keywords

  • domain blacklists
  • measurement
  • spam e-mail

ASJC Scopus subject areas

  • Software
  • Computer Networks and Communications

Fingerprint Dive into the research topics of 'Taster's choice: A comparative analysis of spam feeds'. Together they form a unique fingerprint.

Cite this