TY - JOUR
T1 - Data Quality Measures for Computational Research
T2 - Ensuring Informed Decisions with Emerging Data Sources
AU - Malthouse, Edward C.
AU - Maslowska, Ewa
AU - Strycharz, Joanna
AU - Block, Martin
AU - Araujo, Theo
N1 - Publisher Copyright:
© Copyright © 2024, American Academy of Advertising.
PY - 2024/10/3
Y1 - 2024/10/3
N2 - The proliferation of computational advertising (CA) and other technological developments in artificial intelligence have greatly expanded the types of data used in advertising research, thereby creating new data types. The advertising community needs ways to evaluate the quality of CA data. Although traditional frameworks for evaluating quality are still relevant, they must be updated for these new conditions. Data quality discussions are actively occurring in other fields, including marketing, machine learning, and computational social science. This article provides a short history of advertising data and a summary of the total survey error (TSE) and validity approaches to quality. Three approaches—collaborative, independent, and synthetic—for advertising scholars to access CA data are identified and discussed. This article reviews how TSE and validity can be used to evaluate newer CA data situations and provides a bridge between CA data terminology and quality considerations discussed in different literatures. It proposes and develops two new quality criteria: audit validity, referring to whether the data can be independently validated and the findings replicated, and normative validity, which describes the ethical and responsible collection and use of data, avoiding harms and preserving privacy of individuals involved. Finally, this article provides a checklist for advertising scholars.
AB - The proliferation of computational advertising (CA) and other technological developments in artificial intelligence have greatly expanded the types of data used in advertising research, thereby creating new data types. The advertising community needs ways to evaluate the quality of CA data. Although traditional frameworks for evaluating quality are still relevant, they must be updated for these new conditions. Data quality discussions are actively occurring in other fields, including marketing, machine learning, and computational social science. This article provides a short history of advertising data and a summary of the total survey error (TSE) and validity approaches to quality. Three approaches—collaborative, independent, and synthetic—for advertising scholars to access CA data are identified and discussed. This article reviews how TSE and validity can be used to evaluate newer CA data situations and provides a bridge between CA data terminology and quality considerations discussed in different literatures. It proposes and develops two new quality criteria: audit validity, referring to whether the data can be independently validated and the findings replicated, and normative validity, which describes the ethical and responsible collection and use of data, avoiding harms and preserving privacy of individuals involved. Finally, this article provides a checklist for advertising scholars.
UR - http://www.scopus.com/inward/record.url?scp=85205458057&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85205458057&partnerID=8YFLogxK
U2 - 10.1080/00913367.2024.2403609
DO - 10.1080/00913367.2024.2403609
M3 - Review article
AN - SCOPUS:85205458057
SN - 0091-3367
VL - 53
SP - 644
EP - 660
JO - Journal of Advertising
JF - Journal of Advertising
IS - 5
ER -