TY - JOUR
T1 - Dimensions of uncertainty
T2 - a spatiotemporal review of five COVID-19 datasets
AU - Halpern, Dylan
AU - Lin, Qinyun
AU - Wang, Ryan
AU - Yang, Stephanie
AU - Goldstein, Steve
AU - Kolak, Marynia
N1 - The US Covid Atlas project is funded in part by the Robert Wood Johnson Foundation. This research was also supported by the National Institutes of Health through the NIH HEAL Initiative under award number U2CDA050098. This research was made possible by the open source and open access efforts of the New York Times, Johns Hopkins University, and continuing public access of data from the CDC and USAFacts. Thanks to 1Point3Acres for continued data use permissions.
PY - 2021/10/25
Y1 - 2021/10/25
N2 - COVID-19 surveillance across the United States is essential to tracking and mitigating the pandemic, but data representing cases and deaths may be impacted by attribute, spatial, and temporal uncertainties. COVID-19 case and death data are essential to understanding the pandemic and serve as key inputs for prediction models that inform policy-decisions; consistent information across datasets is critical to ensuring coherent findings. We implement an exploratory data analytic approach to characterize, synthesize, and visualize spatial-temporal dimensions of uncertainty across commonly used datasets for case and death metrics (Johns Hopkins University, the New York Times, USAFacts, and 1Point3Acres). We scrutinize data consistency to assess where and when disagreements occur, potentially indicating underlying uncertainty. We observe differences in cumulative case and death rates to highlight discrepancies and identify spatial patterns. Data are assessed using pairwise agreement (Cohen’s kappa) and agreement across all datasets (Fleiss’ kappa) to summarize changes over time. Findings suggest highest agreements between CDC, JHU, and NYT datasets. We find nine discrete type-components of information uncertainty for COVID-19 datasets reflecting various complex processes. Understanding processes and indicators of uncertainty in COVID-19 data reporting is especially relevant to public health professionals and policymakers to accurately understand and communicate information about the pandemic.
AB - COVID-19 surveillance across the United States is essential to tracking and mitigating the pandemic, but data representing cases and deaths may be impacted by attribute, spatial, and temporal uncertainties. COVID-19 case and death data are essential to understanding the pandemic and serve as key inputs for prediction models that inform policy-decisions; consistent information across datasets is critical to ensuring coherent findings. We implement an exploratory data analytic approach to characterize, synthesize, and visualize spatial-temporal dimensions of uncertainty across commonly used datasets for case and death metrics (Johns Hopkins University, the New York Times, USAFacts, and 1Point3Acres). We scrutinize data consistency to assess where and when disagreements occur, potentially indicating underlying uncertainty. We observe differences in cumulative case and death rates to highlight discrepancies and identify spatial patterns. Data are assessed using pairwise agreement (Cohen’s kappa) and agreement across all datasets (Fleiss’ kappa) to summarize changes over time. Findings suggest highest agreements between CDC, JHU, and NYT datasets. We find nine discrete type-components of information uncertainty for COVID-19 datasets reflecting various complex processes. Understanding processes and indicators of uncertainty in COVID-19 data reporting is especially relevant to public health professionals and policymakers to accurately understand and communicate information about the pandemic.
KW - COVID-19
KW - Data uncertainty
KW - epidemiology
KW - health informatics
KW - public health
UR - http://www.scopus.com/inward/record.url?scp=85118108010&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85118108010&partnerID=8YFLogxK
U2 - 10.1080/15230406.2021.1975311
DO - 10.1080/15230406.2021.1975311
M3 - Article
AN - SCOPUS:85118108010
SN - 1523-0406
VL - 51
SP - 200
EP - 221
JO - Cartography and Geographic Information Science
JF - Cartography and Geographic Information Science
IS - 2
ER -