TY - JOUR
T1 - CNAViz
T2 - An interactive webtool for user-guided segmentation of tumor DNA sequencing data
AU - Lalani, Zubair
AU - Chu, Gillian
AU - Hsu, Silas
AU - Kagawa, Shaw
AU - Xiang, Michael
AU - Zaccaria, Simone
AU - El-Kebir, Mohammed
N1 - Publisher Copyright:
Copyright: © 2022 Lalani et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
PY - 2022/10
Y1 - 2022/10
N2 - Copy-number aberrations (CNAs) are genetic alterations that amplify or delete the number of copies of large genomic segments. Although they are ubiquitous in cancer and, thus, a critical area of current cancer research, CNA identification from DNA sequencing data is challenging because it requires partitioning of the genome into complex segments with the same copy-number states that may not be contiguous. Existing segmentation algorithms address these challenges either by leveraging the local information among neighboring genomic regions, or by globally grouping genomic regions that are affected by similar CNAs across the entire genome. However, both approaches have limitations: overclustering in the case of local segmentation, or the omission of clusters corresponding to focal CNAs in the case of global segmentation. Importantly, inaccurate segmentation will lead to inaccurate identification of CNAs. For this reason, most pan-cancer research studies rely on manual procedures of quality control and anomaly correction. To improve copy-number segmentation, we introduce CNAVIZ, a web-based tool that enables the user to simultaneously perform local and global segmentation, thus overcoming the limitations of each approach. Using simulated data, we demonstrate that by several metrics, CNAVIZ allows the user to obtain more accurate segmentation relative to existing local and global segmentation methods. Moreover, we analyze six bulk DNA sequencing samples from three breast cancer patients. By validating with parallel single-cell DNA sequencing data from the same samples, we show that by using CNAVIZ, our user was able to obtain more accurate segmentation and improved accuracy in downstream copy-number calling.
AB - Copy-number aberrations (CNAs) are genetic alterations that amplify or delete the number of copies of large genomic segments. Although they are ubiquitous in cancer and, thus, a critical area of current cancer research, CNA identification from DNA sequencing data is challenging because it requires partitioning of the genome into complex segments with the same copy-number states that may not be contiguous. Existing segmentation algorithms address these challenges either by leveraging the local information among neighboring genomic regions, or by globally grouping genomic regions that are affected by similar CNAs across the entire genome. However, both approaches have limitations: overclustering in the case of local segmentation, or the omission of clusters corresponding to focal CNAs in the case of global segmentation. Importantly, inaccurate segmentation will lead to inaccurate identification of CNAs. For this reason, most pan-cancer research studies rely on manual procedures of quality control and anomaly correction. To improve copy-number segmentation, we introduce CNAVIZ, a web-based tool that enables the user to simultaneously perform local and global segmentation, thus overcoming the limitations of each approach. Using simulated data, we demonstrate that by several metrics, CNAVIZ allows the user to obtain more accurate segmentation relative to existing local and global segmentation methods. Moreover, we analyze six bulk DNA sequencing samples from three breast cancer patients. By validating with parallel single-cell DNA sequencing data from the same samples, we show that by using CNAVIZ, our user was able to obtain more accurate segmentation and improved accuracy in downstream copy-number calling.
UR - http://www.scopus.com/inward/record.url?scp=85140855908&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85140855908&partnerID=8YFLogxK
U2 - 10.1371/journal.pcbi.1010614
DO - 10.1371/journal.pcbi.1010614
M3 - Article
C2 - 36228003
AN - SCOPUS:85140855908
SN - 1553-734X
VL - 18
JO - PLoS computational biology
JF - PLoS computational biology
IS - 10
M1 - e1010614
ER -