TY - JOUR
T1 - Leveraging epigenomes and three-dimensional genome organization for interpreting regulatory variation
AU - Baur, Brittany
AU - Shin, Junha
AU - Schreiber, Jacob
AU - Zhang, Shilu
AU - Zhang, Yi
AU - Manjunath, Mohith
AU - Song, Jun S.
AU - Noble, William Stafford
AU - Roy, Sushmita
N1 - Funding: This work was supported by the Genomics Sciences Training Program at UW-Madison (NHGRI 5T32HG002760) for BB, NHGRI R01 grant R01-HG010045-01 for SR and BB, the Center for Predictive Computational Phenotyping (NIH BD2K U54 AI117924) for BB and SR, NIH award U24HG009446 for WSN and JSc, NIH R01 grant R01-CA163336 for JSS, YZ, and MM, and James McDonell Foundation Grant 3194-133-349500-4-AAB5159 for SR and JSh. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
PY - 2023/7
Y1 - 2023/7
N2 - Understanding the impact of regulatory variants on complex phenotypes is a significant challenge because the genes and pathways that are targeted by such variants and the cell type context in which regulatory variants operate are typically unknown. Cell-type-specific long-range regulatory interactions that occur between a distal regulatory sequence and a gene offer a powerful framework for examining the impact of regulatory variants on complex phenotypes. However, high-resolution maps of such long-range interactions are available only for a handful of cell types. Furthermore, identifying specific gene subnetworks or pathways that are targeted by a set of variants is a significant challenge. We have developed LHiC-Reg, a Random Forests regression method to predict high-resolution contact counts in new cell types, and a network-based framework to identify candidate cell-type-specific gene networks targeted by a set of variants from a genome-wide association study (GWAS). We applied our approach to predict interactions in 55 Roadmap Epigenomics Mapping Consortium cell types, which we used to interpret regulatory single nucleotide polymorphisms (SNPs) in the NHGRI-EBI GWAS catalogue. Using our approach, we performed an in-depth characterization of fifteen different phenotypes including schizophrenia, coronary artery disease (CAD) and Crohn’s disease. We found differentially wired subnetworks consisting of known as well as novel gene targets of regulatory SNPs. Taken together, our compendium of interactions and the associated network-based analysis pipeline leverages long-range regulatory interactions to examine the context-specific impact of regulatory variation in complex phenotypes.
AB - Understanding the impact of regulatory variants on complex phenotypes is a significant challenge because the genes and pathways that are targeted by such variants and the cell type context in which regulatory variants operate are typically unknown. Cell-type-specific long-range regulatory interactions that occur between a distal regulatory sequence and a gene offer a powerful framework for examining the impact of regulatory variants on complex phenotypes. However, high-resolution maps of such long-range interactions are available only for a handful of cell types. Furthermore, identifying specific gene subnetworks or pathways that are targeted by a set of variants is a significant challenge. We have developed LHiC-Reg, a Random Forests regression method to predict high-resolution contact counts in new cell types, and a network-based framework to identify candidate cell-type-specific gene networks targeted by a set of variants from a genome-wide association study (GWAS). We applied our approach to predict interactions in 55 Roadmap Epigenomics Mapping Consortium cell types, which we used to interpret regulatory single nucleotide polymorphisms (SNPs) in the NHGRI-EBI GWAS catalogue. Using our approach, we performed an in-depth characterization of fifteen different phenotypes including schizophrenia, coronary artery disease (CAD) and Crohn’s disease. We found differentially wired subnetworks consisting of known as well as novel gene targets of regulatory SNPs. Taken together, our compendium of interactions and the associated network-based analysis pipeline leverages long-range regulatory interactions to examine the context-specific impact of regulatory variation in complex phenotypes.
UR - http://www.scopus.com/inward/record.url?scp=85165521599&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85165521599&partnerID=8YFLogxK
U2 - 10.1371/journal.pcbi.1011286
DO - 10.1371/journal.pcbi.1011286
M3 - Article
C2 - 37428809
SN - 1553-734X
VL - 19
JO - PLoS computational biology
JF - PLoS computational biology
IS - 7
M1 - e1011286
ER -