Integrative genomic analysis predicts causative cis-regulatory mechanisms of the breast cancer–associated genetic variant rs4415084

Yi Zhang, Mohith Manjunath, Shilu Zhang, Deborah Chasman, Sushmita Roy, Jun S. Song

Research output: Contribution to journalArticlepeer-review


Previous genome-wide association studies (GWAS) have identified several common genetic variants that may significantly modulate cancer susceptibility. However, the precise molecular mechanisms behind these associations remain largely unknown; it is often not clear whether discovered variants are themselves functional or merely genetically linked to other functional variants. Here, we provide an integrated method for identifying functional regulatory variants associated with cancer and their target genes by combining analyses of expression quantitative trait loci, a modified version of allele-specific expression that systematically utilizes haplotype information, transcription factor (TF)–binding preference, and epigenetic information. Application of our method to a breast cancer susceptibility region in 5p12 demonstrates that the risk allele rs4415084-T correlates with higher expression levels of the protein-coding gene mitochondrial ribosomal protein S30 (MRPS30) and lncRNA RP11-53O19.1. We propose an intergenic SNP rs4321755, in linkage disequilibrium (LD) with the GWAS SNP rs4415084 (r 2 ¼ 0.988), to be the predicted functional SNP. The risk allele rs4321755-T, in phase with the GWAS rs4415084-T, created a GATA3-binding motif within an enhancer, resulting in differential GATA3 binding and chromatin accessibility, thereby promoting transcription of MRPS30 and RP11-53O19.1. MRPS30 encodes a member of the mitochondrial ribosomal proteins, implicating the role of risk SNP in modulating mitochondrial activities in breast cancer. Our computational framework provides an effective means to integrate GWAS results with high-throughput genomic and epigenomic data and can be extended to facilitate rapid functional characterization of other genetic variants modulating cancer susceptibility. Significance: Unification of GWAS results with information from high-throughput genomic and epigenomic profiles provides a direct link between common genetic variants and measurable molecular perturbations.

Original languageEnglish (US)
Pages (from-to)1579-1591
Number of pages13
JournalCancer Research
Issue number7
StatePublished - Apr 1 2018

ASJC Scopus subject areas

  • Oncology
  • Cancer Research


Dive into the research topics of 'Integrative genomic analysis predicts causative cis-regulatory mechanisms of the breast cancer–associated genetic variant rs4415084'. Together they form a unique fingerprint.

Cite this