TY - JOUR
T1 - Gene selection for optimal prediction of cell position in tissues from single-cell transcriptomics data
AU - DREAM SCTC Consortium
AU - Tanevski, Jovan
AU - Nguyen, Thin
AU - Truong, Buu
AU - Karaiskos, Nikos
AU - Ahsen, Mehmet Eren
AU - Zhang, Xinyu
AU - Shu, Chang
AU - Xu, Ke
AU - Liang, Xiaoyu
AU - Hu, Ying
AU - Pham, Hoang Vv
AU - Xiaomei, Li
AU - Le, Thuc D
AU - Tarca, Adi L
AU - Bhatti, Gaurav
AU - Romero, Roberto
AU - Karathanasis, Nestoras
AU - Loher, Phillipe
AU - Chen, Yang
AU - Ouyang, Zhengqing
AU - Mao, Disheng
AU - Zhang, Yuping
AU - Zand, Maryam
AU - Ruan, Jianhua
AU - Hafemeister, Christoph
AU - Qiu, Peng
AU - Tran, Duc
AU - Nguyen, Tin
AU - Gabor, Attila
AU - Yu, Thomas
AU - Guinney, Justin
AU - Glaab, Enrico
AU - Krause, Roland
AU - Banda, Peter
AU - Stolovitzky, Gustavo
AU - Rajewsky, Nikolaus
AU - Saez-Rodriguez, Julio
AU - Meyer, Pablo
N1 - Funding Information:
This research was funded in part by PROACTIVE 2017 “From Single-Cell to Multi-Cells Information Systems Analysis” (C92F17003530005 Department of Information Engineering, University of Padova) for BD Camillo; National Institutes of Health grant number U54CA21729 for J Ruan; Indian Council of Medical Research—Junior Research Fellowship for S Ahmad and X Wang was funded by the National Natural Science Foundation of China (No. 61702421 and No. 61772426). P Meyer thanks KV for help editing.
PY - 2020/9/24
Y1 - 2020/9/24
N2 - Single-cell RNA-sequencing (scRNAseq) technologies are rapidly evolving. Although very informative, in standard scRNAseq experiments, the spatial organization of the cells in the tissue of origin is lost. Conversely, spatial RNA-seq technologies designed to maintain cell localization have limited throughput and gene coverage. Mapping scRNAseq to genes with spatial information increases coverage while providing spatial location. However, methods to perform such mapping have not yet been benchmarked. To fill this gap, we organized the DREAM Single-Cell Transcriptomics challenge focused on the spatial reconstruction of cells from the Drosophila embryo from scRNAseq data, leveraging as silver standard, genes with in situ hybridization data from the Berkeley Drosophila Transcription Network Project reference atlas. The 34 participating teams used diverse algorithms for gene selection and location prediction, while being able to correctly localize clusters of cells. Selection of predictor genes was essential for this task. Predictor genes showed a relatively high expression entropy, high spatial clustering and included prominent developmental genes such as gap and pair-rule genes and tissue markers. Application of the top 10 methods to a zebra fish embryo dataset yielded similar performance and statistical properties of the selected genes than in the Drosophila data. This suggests that methods developed in this challenge are able to extract generalizable properties of genes that are useful to accurately reconstruct the spatial arrangement of cells in tissues.
AB - Single-cell RNA-sequencing (scRNAseq) technologies are rapidly evolving. Although very informative, in standard scRNAseq experiments, the spatial organization of the cells in the tissue of origin is lost. Conversely, spatial RNA-seq technologies designed to maintain cell localization have limited throughput and gene coverage. Mapping scRNAseq to genes with spatial information increases coverage while providing spatial location. However, methods to perform such mapping have not yet been benchmarked. To fill this gap, we organized the DREAM Single-Cell Transcriptomics challenge focused on the spatial reconstruction of cells from the Drosophila embryo from scRNAseq data, leveraging as silver standard, genes with in situ hybridization data from the Berkeley Drosophila Transcription Network Project reference atlas. The 34 participating teams used diverse algorithms for gene selection and location prediction, while being able to correctly localize clusters of cells. Selection of predictor genes was essential for this task. Predictor genes showed a relatively high expression entropy, high spatial clustering and included prominent developmental genes such as gap and pair-rule genes and tissue markers. Application of the top 10 methods to a zebra fish embryo dataset yielded similar performance and statistical properties of the selected genes than in the Drosophila data. This suggests that methods developed in this challenge are able to extract generalizable properties of genes that are useful to accurately reconstruct the spatial arrangement of cells in tissues.
UR - http://www.scopus.com/inward/record.url?scp=85091617226&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85091617226&partnerID=8YFLogxK
U2 - 10.26508/lsa.202000867
DO - 10.26508/lsa.202000867
M3 - Article
C2 - 32972997
SN - 2575-1077
VL - 3
JO - Life Science Alliance
JF - Life Science Alliance
IS - 11
M1 - e202000867
ER -