Predicting web site audience demographics using content and design cues

Iljoo Kim, Gautam Pant

Research output: Contribution to journalArticlepeer-review

Abstract

The size and dynamism of the Web pose challenges for all its stakeholders, which include producers and consumers of content as well as advertisers who want to place advertisements next to relevant content. A critical piece of information for producers/publishers of content as well as advertisers is the demographics of the consumers who are likely to visit a given web site. In this article we explore predictive models that attempt to deduce the demographics of the audience of a web site using cues embedded in the design or the content of its homepage. We find that it is possible to effectively predict different types of demographics of consumers of web sites on the basis of the suggested approach. Through a statistical analysis we observe that several design elements and the content differ significantly among web sites dominated by consumers of different demographic classes. We also suggest the use of an ensemble classifier that combines the content and design cues with the goal of further improving the prediction performance.

Original languageEnglish (US)
Pages (from-to)718-730
Number of pages13
JournalInformation and Management
Volume56
Issue number5
DOIs
StatePublished - Jul 2019
Externally publishedYes

Keywords

  • Audience demographics
  • Machine learning
  • Predictive analytics
  • Text classification

ASJC Scopus subject areas

  • Management Information Systems
  • Information Systems
  • Information Systems and Management

Fingerprint

Dive into the research topics of 'Predicting web site audience demographics using content and design cues'. Together they form a unique fingerprint.

Cite this