Abstract
The size and dynamism of the Web pose challenges for all its stakeholders, which include producers and consumers of content as well as advertisers who want to place advertisements next to relevant content. A critical piece of information for producers/publishers of content as well as advertisers is the demographics of the consumers who are likely to visit a given web site. In this article we explore predictive models that attempt to deduce the demographics of the audience of a web site using cues embedded in the design or the content of its homepage. We find that it is possible to effectively predict different types of demographics of consumers of web sites on the basis of the suggested approach. Through a statistical analysis we observe that several design elements and the content differ significantly among web sites dominated by consumers of different demographic classes. We also suggest the use of an ensemble classifier that combines the content and design cues with the goal of further improving the prediction performance.
Original language | English (US) |
---|---|
Pages (from-to) | 718-730 |
Number of pages | 13 |
Journal | Information and Management |
Volume | 56 |
Issue number | 5 |
DOIs | |
State | Published - Jul 2019 |
Externally published | Yes |
Keywords
- Audience demographics
- Machine learning
- Predictive analytics
- Text classification
ASJC Scopus subject areas
- Management Information Systems
- Information Systems
- Information Systems and Management