TY - GEN
T1 - A SSIM Guided cGAN Architecture for Clinically Driven Generative Image Synthesis of Multiplexed Spatial Proteomics Channels
AU - Saurav, Jillur Rahman
AU - Nasr, Mohammad Sadegh
AU - Shang, Helen H.
AU - Koomey, Paul
AU - Robben, Michael
AU - Huber, Manfred
AU - Weidanz, Jon
AU - Ryan, Brid
AU - Ruppin, Eytan
AU - Jiang, Peng
AU - Luber, Jacob M.
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - Histopathological work in clinical labs often relies on immunostaining of proteins, which can be time-consuming and costly. Multiplexed spatial proteomics imaging can increase interpretive power, but current methods cannot cost-effectively sample the entire proteomic retinue important to diagnostic medicine or drug development. To address this challenge, we developed a conditional generative adversarial network (cGAN) that performs image-To-image (i2i) synthesis to generate accurate biomarker channels in multiplexed spatial proteomics images1. We approached this problem as missing biomarker expression generation, where we assumed that a given n-channel multiplexed image has p channels (biomarkers) present and q channels (biomarkers) absent, with p+q=n, and we aimed to generate the missing q channels. To improve accuracy, we selected p and q channels based on their structural similarity, as measured by a structural similarity index measure (SSIM). We demonstrated the effectiveness of our approach using spatial proteomic data from the Human BioMolecular Atlas Program (HuBMAP)2, which we used to generate spatial representations of missing proteins through a U-Net based image synthesis pipeline. Channels were hierarchically clustered by SSIM to obtain the minimal set needed to recapitulate the underlying biology represented by the spatial landscape of proteins. We also assessed the scalability of our algorithm using regression slope analysis, which showed that it can generate increasing numbers of missing biomarkers in multiplexed spatial proteomics images. Furthermore, we validated our approach by generating a new spatial proteomics data set from human lung adenocarcinoma tissue sections and showed that our model could accurately synthesize the missing channels from this new data set. Overall, our approach provides a cost-effective and time-efficient alternative to traditional immunostaining methods for generating missing biomarker channels, while also increasing the amount of data that can be generated through experiments. This has important implications for the future of medical diagnostics and drug development, and raises important questions about the ethical implications of utilizing data produced by generative image synthesis in the clinical setting.1https://github.com/aauthors131/mu1tip1exed-image-synthesis2https://portal.hubmapconsortium.org
AB - Histopathological work in clinical labs often relies on immunostaining of proteins, which can be time-consuming and costly. Multiplexed spatial proteomics imaging can increase interpretive power, but current methods cannot cost-effectively sample the entire proteomic retinue important to diagnostic medicine or drug development. To address this challenge, we developed a conditional generative adversarial network (cGAN) that performs image-To-image (i2i) synthesis to generate accurate biomarker channels in multiplexed spatial proteomics images1. We approached this problem as missing biomarker expression generation, where we assumed that a given n-channel multiplexed image has p channels (biomarkers) present and q channels (biomarkers) absent, with p+q=n, and we aimed to generate the missing q channels. To improve accuracy, we selected p and q channels based on their structural similarity, as measured by a structural similarity index measure (SSIM). We demonstrated the effectiveness of our approach using spatial proteomic data from the Human BioMolecular Atlas Program (HuBMAP)2, which we used to generate spatial representations of missing proteins through a U-Net based image synthesis pipeline. Channels were hierarchically clustered by SSIM to obtain the minimal set needed to recapitulate the underlying biology represented by the spatial landscape of proteins. We also assessed the scalability of our algorithm using regression slope analysis, which showed that it can generate increasing numbers of missing biomarkers in multiplexed spatial proteomics images. Furthermore, we validated our approach by generating a new spatial proteomics data set from human lung adenocarcinoma tissue sections and showed that our model could accurately synthesize the missing channels from this new data set. Overall, our approach provides a cost-effective and time-efficient alternative to traditional immunostaining methods for generating missing biomarker channels, while also increasing the amount of data that can be generated through experiments. This has important implications for the future of medical diagnostics and drug development, and raises important questions about the ethical implications of utilizing data produced by generative image synthesis in the clinical setting.1https://github.com/aauthors131/mu1tip1exed-image-synthesis2https://portal.hubmapconsortium.org
KW - CODEX
KW - cancer imaging
KW - image synthesis
KW - multiplexed image synthesis
UR - http://www.scopus.com/inward/record.url?scp=85174890758&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85174890758&partnerID=8YFLogxK
U2 - 10.1109/CIBCB56990.2023.10264899
DO - 10.1109/CIBCB56990.2023.10264899
M3 - Conference contribution
AN - SCOPUS:85174890758
T3 - CIBCB 2023 - 20th IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology
BT - CIBCB 2023 - 20th IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 20th IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB 2023
Y2 - 29 August 2023 through 31 August 2023
ER -