TY - JOUR
T1 - Graphical Model Inference with Erosely Measured Data
AU - Zheng, Lili
AU - Allen, Genevera I.
N1 - The authors gratefully acknowledge support by NSF NeuroNex-1707400, NIH 1R01GM140468, and NSF DMS-2210837. The authors also thank Gautam Dasarathy for helpful discussion during the preparation of this article.
PY - 2024
Y1 - 2024
N2 - In this article, we investigate the Gaussian graphical model inference problem in a novel setting that we call erose measurements, referring to irregularly measured or observed data. For graphs, this results in different node pairs having vastly different sample sizes which frequently arises in data integration, genomics, neuroscience, and sensor networks. Existing works characterize the graph selection performance using the minimum pairwise sample size, which provides little insights for erosely measured data, and no existing inference method is applicable. We aim to fill in this gap by proposing the first inference method that characterizes the different uncertainty levels over the graph caused by the erose measurements, named GI-JOE (Graph Inference when Joint Observations are Erose). Specifically, we develop an edge-wise inference method and an affiliated FDR control procedure, where the variance of each edge depends on the sample sizes associated with corresponding neighbors. We prove statistical validity under erose measurements, thanks to careful localized edge-wise analysis and disentangling the dependencies across the graph. Finally, through simulation studies and a real neuroscience data example, we demonstrate the advantages of our inference methods for graph selection from erosely measured data. Supplementary materials for this article are available online.
AB - In this article, we investigate the Gaussian graphical model inference problem in a novel setting that we call erose measurements, referring to irregularly measured or observed data. For graphs, this results in different node pairs having vastly different sample sizes which frequently arises in data integration, genomics, neuroscience, and sensor networks. Existing works characterize the graph selection performance using the minimum pairwise sample size, which provides little insights for erosely measured data, and no existing inference method is applicable. We aim to fill in this gap by proposing the first inference method that characterizes the different uncertainty levels over the graph caused by the erose measurements, named GI-JOE (Graph Inference when Joint Observations are Erose). Specifically, we develop an edge-wise inference method and an affiliated FDR control procedure, where the variance of each edge depends on the sample sizes associated with corresponding neighbors. We prove statistical validity under erose measurements, thanks to careful localized edge-wise analysis and disentangling the dependencies across the graph. Finally, through simulation studies and a real neuroscience data example, we demonstrate the advantages of our inference methods for graph selection from erosely measured data. Supplementary materials for this article are available online.
KW - FDR control
KW - Graph selection
KW - Graph structure inference
KW - Missing data
KW - Uneven measurements
UR - http://www.scopus.com/inward/record.url?scp=85174540200&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85174540200&partnerID=8YFLogxK
U2 - 10.1080/01621459.2023.2256503
DO - 10.1080/01621459.2023.2256503
M3 - Article
C2 - 39328784
AN - SCOPUS:85174540200
SN - 0162-1459
VL - 119
SP - 2282
EP - 2293
JO - Journal of the American Statistical Association
JF - Journal of the American Statistical Association
IS - 547
ER -