TY - GEN
T1 - Supervised Adversarial Alignment of Single-Cell RNA-seq Data
AU - Ge, Songwei
AU - Wang, Haohan
AU - Alavi, Amir
AU - Xing, Eric
AU - Bar-Joseph, Ziv
N1 - Funding Information:
Acknowledgements. This work was partially supported by National Institute of Health grants 1R01GM122096 and OT2OD026682 to Z.B.J. and by a Scholars Award in Studying Complex Systems from the James S. McDonnell Foundation to Z.B.J. HW was supported by the National Institutes of Health grants R01-GM093156 and P30-DA035778.
Publisher Copyright:
© Springer Nature Switzerland AG 2020.
PY - 2020
Y1 - 2020
N2 - Dimensionality reduction is an important first step in the analysis of single cell RNA-seq (scRNA-seq) data. In addition to enabling the visualization of the profiled cells, such representations are used by many downstream analyses methods ranging from pseudo-time reconstruction to clustering to alignment of scRNA-seq data from different experiments, platforms, and labs. Both supervised and unsupervised methods have been proposed to reduce the dimension of scRNA-seq. However, all methods to date are sensitive to batch effects. When batches correlate with cell types, as is often the case, their impact can lead to representations that are batch rather than cell type specific. To overcome this we developed a domain adversarial neural network model for learning a reduced dimension representation of scRNA-seq data. The adversarial model tries to simultaneously optimize two objectives. The first is the accuracy of cell type assignment and the second is the inability to distinguish the batch (domain). We tested the method by using the resulting representation to align several different datasets. As we show, by overcoming batch effects our method was able to correctly separate cell types, improving on several prior methods suggested for this task. Analysis of the top features used by the network indicates that by taking the batch impact into account, the reduced representation is much better able to focus on key genes for each cell type.
AB - Dimensionality reduction is an important first step in the analysis of single cell RNA-seq (scRNA-seq) data. In addition to enabling the visualization of the profiled cells, such representations are used by many downstream analyses methods ranging from pseudo-time reconstruction to clustering to alignment of scRNA-seq data from different experiments, platforms, and labs. Both supervised and unsupervised methods have been proposed to reduce the dimension of scRNA-seq. However, all methods to date are sensitive to batch effects. When batches correlate with cell types, as is often the case, their impact can lead to representations that are batch rather than cell type specific. To overcome this we developed a domain adversarial neural network model for learning a reduced dimension representation of scRNA-seq data. The adversarial model tries to simultaneously optimize two objectives. The first is the accuracy of cell type assignment and the second is the inability to distinguish the batch (domain). We tested the method by using the resulting representation to align several different datasets. As we show, by overcoming batch effects our method was able to correctly separate cell types, improving on several prior methods suggested for this task. Analysis of the top features used by the network indicates that by taking the batch impact into account, the reduced representation is much better able to focus on key genes for each cell type.
KW - Batch effect removal
KW - Data integration
KW - Dimensionality reduction
KW - Domain adversarial training
KW - Single-cell RNA-seq
UR - http://www.scopus.com/inward/record.url?scp=85084268380&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85084268380&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-45257-5_5
DO - 10.1007/978-3-030-45257-5_5
M3 - Conference contribution
AN - SCOPUS:85084268380
SN - 9783030452568
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 72
EP - 87
BT - Research in Computational Molecular Biology - 24th Annual International Conference, RECOMB 2020, Proceedings
A2 - Schwartz, Russell
PB - Springer Netherlands
T2 - 24th Annual Conference on Research in Computational Molecular Biology, RECOMB 2020
Y2 - 10 May 2020 through 13 May 2020
ER -