VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing - PubMed
- ️Invalid Date
VarScan 2: somatic mutation and copy number alteration discovery in cancer by exome sequencing
Daniel C Koboldt et al. Genome Res. 2012 Mar.
Abstract
Cancer is a disease driven by genetic variation and mutation. Exome sequencing can be utilized for discovering these variants and mutations across hundreds of tumors. Here we present an analysis tool, VarScan 2, for the detection of somatic mutations and copy number alterations (CNAs) in exome data from tumor-normal pairs. Unlike most current approaches, our algorithm reads data from both samples simultaneously; a heuristic and statistical algorithm detects sequence variants and classifies them by somatic status (germline, somatic, or LOH); while a comparison of normalized read depth delineates relative copy number changes. We apply these methods to the analysis of exome sequence data from 151 high-grade ovarian tumors characterized as part of the Cancer Genome Atlas (TCGA). We validated some 7790 somatic coding mutations, achieving 93% sensitivity and 85% precision for single nucleotide variant (SNV) detection. Exome-based CNA analysis identified 29 large-scale alterations and 619 focal events per tumor on average. As in our previous analysis of these data, we observed frequent amplification of oncogenes (e.g., CCNE1, MYC) and deletion of tumor suppressors (NF1, PTEN, and CDKN2A). We searched for additional recurrent focal CNAs using the correlation matrix diagonal segmentation (CMDS) algorithm, which identified 424 significant events affecting 582 genes. Taken together, our results demonstrate the robust performance of VarScan 2 for somatic mutation and CNA detection and shed new light on the landscape of genetic alterations in ovarian cancer.
Figures

The VarScan 2 mutation and copy number alteration detection algorithms. Alignments in BAM format for a tumor–normal pair are read simultaneously to identify inherited (germline), loss-of-heterozygosity (LOH), and somatic mutation events. Variants in each category are further classified as high confidence (HC) or low confidence (LC). HC variants are filtered to remove false positives from common sequencing- and alignment-related artifacts (see Table 1). The resulting variants are annotated and organized by tier; the average number of “tier 1” coding variants per tumor is shown for each category. At positions with at least 20× coverage (default), copy number alterations are detected by comparison of Q20 read depths from matched tumor–normal pairs, normalized based on the amount of input data for each sample. Raw contiguous regions from VarScan 2 are processed by circular binary segmentation (CBS) and a subsequent merging procedure that joins adjacent segments yields a set of somatic copy number alterations, which are further classified as large-scale (>25% of chromosome arm) or focal (<25%) events. Shown are the average numbers of events detected in 142 ovarian exomes.

Detection of large-scale and focal copy number alterations by sequencing- and array-based approaches. (A) Deletions and focal amplifications of chromosome 4 in sample TCGA-24-1103. Copy number estimates from array (gray), WGS (light blue), and exome (dark blue) indicate two regions of deletion as well as a focal amplification (window). Red lines indicate segmented exome CBS calls. (Below) Variant allele frequencies in the normal (blue) and tumor (green) indicate regions of loss of heterozygosity (LOH) in deleted segments. (B) Intersection of large-scale copy number alterations detected by SNP array, whole-genome sequencing, and exome sequencing approaches for five HGS-OVCa cases. For details, see Supplemental Table 3. (C) Intersection of gene-level (focal) copy number alterations detected by SNP array, whole-genome sequencing, and exome sequencing approaches for five HGS-OVCa cases.

Recurrent chromosome-arm gains and losses in ovarian cancer. Eight significant gains and 22 significant losses of chromosome arms identified by TCGA in SNP array data for 489 cases were recapitulated using exome data for 142 cases. Observed frequencies were highly correlated between data sets for both gains (r2 = 0.84) and losses (r2 = 0.86).

Global copy number alteration profile of ovarian cancer. Average log2 of copy number difference is plotted for chromosomes 1–22 and X. Amplifications are shown in red, deletions in blue, and neutral regions in gray. Significant peaks associated with known oncogenes or tumor suppressor genes are indicated.

Frequent copy number alteration of ovarian cancer genes. Exome-based copy number estimates were used to compute the proportion of ovarian cancer tumors (n = 142) exhibiting amplification or deletion of key ovarian cancer genes. Asterisks (*) indicate significantly altered genes identified from SNP array data in our previous study.
Similar articles
-
Inferring copy number and genotype in tumour exome data.
Amarasinghe KC, Li J, Hunter SM, Ryland GL, Cowin PA, Campbell IG, Halgamuge SK. Amarasinghe KC, et al. BMC Genomics. 2014 Aug 28;15(1):732. doi: 10.1186/1471-2164-15-732. BMC Genomics. 2014. PMID: 25167919 Free PMC article.
-
Sun JX, He Y, Sanford E, Montesion M, Frampton GM, Vignot S, Soria JC, Ross JS, Miller VA, Stephens PJ, Lipson D, Yelensky R. Sun JX, et al. PLoS Comput Biol. 2018 Feb 7;14(2):e1005965. doi: 10.1371/journal.pcbi.1005965. eCollection 2018 Feb. PLoS Comput Biol. 2018. PMID: 29415044 Free PMC article.
-
Franch-Expósito S, Bassaganyas L, Vila-Casadesús M, Hernández-Illán E, Esteban-Fabró R, Díaz-Gay M, Lozano JJ, Castells A, Llovet JM, Castellví-Bel S, Camps J. Franch-Expósito S, et al. Elife. 2020 Jan 15;9:e50267. doi: 10.7554/eLife.50267. Elife. 2020. PMID: 31939734 Free PMC article.
-
Exome sequence read depth methods for identifying copy number changes.
Kadalayil L, Rafiq S, Rose-Zerilli MJ, Pengelly RJ, Parker H, Oscier D, Strefford JC, Tapper WJ, Gibson J, Ennis S, Collins A. Kadalayil L, et al. Brief Bioinform. 2015 May;16(3):380-92. doi: 10.1093/bib/bbu027. Epub 2014 Aug 28. Brief Bioinform. 2015. PMID: 25169955 Review.
-
Mehrad M, LaFramboise WA, Lyons MA, Trejo Bittar HE, Yousem SA. Mehrad M, et al. Hum Pathol. 2018 Aug;78:36-43. doi: 10.1016/j.humpath.2018.04.005. Epub 2018 Apr 22. Hum Pathol. 2018. PMID: 29689243 Review.
Cited by
-
Whole-Exome Sequencing Reveals Rare Germline Mutations in Patients With Hemifacial Microsomia.
Chen X, Liu F, Mar Aung Z, Zhang Y, Chai G. Chen X, et al. Front Genet. 2021 May 17;12:580761. doi: 10.3389/fgene.2021.580761. eCollection 2021. Front Genet. 2021. PMID: 34079577 Free PMC article.
-
Saal LH, Vallon-Christersson J, Häkkinen J, Hegardt C, Grabau D, Winter C, Brueffer C, Tang MH, Reuterswärd C, Schulz R, Karlsson A, Ehinger A, Malina J, Manjer J, Malmberg M, Larsson C, Rydén L, Loman N, Borg Å. Saal LH, et al. Genome Med. 2015 Feb 2;7(1):20. doi: 10.1186/s13073-015-0131-9. eCollection 2015. Genome Med. 2015. PMID: 25722745 Free PMC article.
-
Bhakta MS, Jones VA, Vallejos CE. Bhakta MS, et al. PLoS One. 2015 Jan 28;10(1):e0116822. doi: 10.1371/journal.pone.0116822. eCollection 2015. PLoS One. 2015. PMID: 25629314 Free PMC article.
-
Clinical integration of next-generation sequencing technology.
Gullapalli RR, Lyons-Weiler M, Petrosko P, Dhir R, Becich MJ, LaFramboise WA. Gullapalli RR, et al. Clin Lab Med. 2012 Dec;32(4):585-99. doi: 10.1016/j.cll.2012.07.005. Clin Lab Med. 2012. PMID: 23078661 Free PMC article. Review.
-
Enhanced whole exome sequencing by higher DNA insert lengths.
Pommerenke C, Geffers R, Bunk B, Bhuju S, Eberth S, Drexler HG, Quentmeier H. Pommerenke C, et al. BMC Genomics. 2016 May 25;17:399. doi: 10.1186/s12864-016-2698-y. BMC Genomics. 2016. PMID: 27225215 Free PMC article.
References
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Research Materials
Miscellaneous