pubmed.ncbi.nlm.nih.gov

SWAN: Subset-quantile within array normalization for illumina infinium HumanMethylation450 BeadChips - PubMed

  • ️Sun Jan 01 2012

SWAN: Subset-quantile within array normalization for illumina infinium HumanMethylation450 BeadChips

Jovana Maksimovic et al. Genome Biol. 2012.

Abstract

DNA methylation is the most widely studied epigenetic mark and is known to be essential to normal development and frequently disrupted in disease. The Illumina HumanMethylation450 BeadChip assays the methylation status of CpGs at 485,577 sites across the genome. Here we present Subset-quantile Within Array Normalization (SWAN), a new method that substantially improves the results from this platform by reducing technical variation within and between arrays. SWAN is available in the minfi Bioconductor package.

PubMed Disclaimer

Figures

Figure 1
Figure 1

Illumina Infinium HumanMethylation450 assay. (a) Infinium I assay. Each individual CpG is interrogated using two bead types: methylated (M) and unmethylated (U). The probe design assumes that all CpGs underlying the probe body have the same methylation status as the target CpG. Both bead types will incorporate the same labeled nucleotide for the same target CpG, thereby producing the same color fluorescence. The nucleotide that is added is determined by the base downstream of the 'C' of the target CpG. The proportion of methylation, β, can be calculated by comparing the intensities from the two different probes in the same color: β= M/(U + M). (b) Infinium II assay. Each target CpG is interrogated using a single bead type. A probe may have up to three underlying CpG sites, with a degenerate R base corresponding to the 'C' of each CpG. Methylation state is detected by single base extension at the position of the 'C' of the target CpG, which always results in the addition of a labeled 'G' or 'A' nucleotide, complementary to either the 'methylated' C or 'unmethylated' T, respectively. Each locus is detected in two colors, and methylation status is determined by comparing the two colors from the one position: β = Green (M)/(Red (U) + Green (M)). (c) The number of CpG dinucleotides in the body of the probe according to Infinium probe type. Infinium I probes have significantly more CpGs in the probe body.

Figure 2
Figure 2

Intensity distributions of the Infinium I and II probe types in the methylated and unmethylated channels for normal human kidney sample TCGA-B0-5092-11. The qualitative differences in the intensity distributions are probably driven by the biological differences between the regions that the two probe types are interrogating, which is reflected by the difference in density of CpGs in the body of the probes.

Figure 3
Figure 3

Intensity distributions of subsets of Infinium I and II probes with the same number of underlying CpGs for normal human kidney sample TCGA-B0-5092-11. (a) One CpG in the probe body. (b) Two CpGs in the probe body. (c) Three CpGs in the probe body.

Figure 4
Figure 4

Differences in β value distributions produced by Infinium I and II probes. (a) Density distributions of β values produced by Infinium I (solid line) and II (dashed line) probes for unmethylated (blue), hemi-methylated (purple) and methylated (red) reference standards (Std.). The difference in β value distribution between the Infinium I and II probes seen in the raw data can be adjusted for using the SWAN method. (b) The median and inter-quartile range of β value distributions for Infinium I and II probes is more similar when SWAN is applied to 450k data. (c) The differences in β value distributions produced by the different probe types can result in aberrant overall β value distributions, as seen in this normal human DNA sample. Applying the SWAN method results in an improvement in the overall β value distribution.

Figure 5
Figure 5

β value density distributions for four pairs of technical replicates before and after applying SWAN. (a-d) β value density distributions for each pair of technical replicates. The Kolmogorov-Smirnov (KS) test P-value reflects the similarity of the β value distributions between each pair of replicates; a larger P-value indicates that the distributions of the replicates are more similar.

Figure 6
Figure 6

Comparison of β value density distributions between HumanMethylation450 and HumanMethylation27 arrays before and after SWAN in an MCF7 cell line. This plot illustrates the β value density distributions for 25,978 CpGs that are present on both 450k and 27k platforms. The peaks of the 450k Infinium I and II probe types show better alignment with the 27k peaks when SWAN is used.

Figure 7
Figure 7

Results of differential methylation analysis of three kidney samples compared to three rectum samples, with and without using SWAN. (a) Percentage of RRBS true positives identified at various q-value significance thresholds. Using the SWAN method (magenta) consistently detects more RRBS true positives than analyzing raw data (black). (b) Receiver operating characteristic (ROC) curve of an analysis using SWAN compared to an analysis of the raw data. Using SWAN (magenta) prior to differential methylation analysis results in performance gains.

Similar articles

Cited by

References

    1. Rakyan VK, Down TA, Thorne NP, Flicek P, Kulesha E, Gräf S, Tomazou EM, Bäckdahl L, Johnson N, Herberth M, Howe KL, Jackson DK, Miretti MM, Fiegler H, Marioni JC, Birney E, Hubbard TJP, Carter NP, Tavaré S, Beck S. An integrated resource for genome-wide identification and analysis of human tissue-specific differentially methylated regions (tDMRs). Genome Res. 2008;18:1518–1529. doi: 10.1101/gr.077479.108. - DOI - PMC - PubMed
    1. Esteller M. Cancer epigenomics: DNA methylomes and histone-modification maps. Nat Rev Genet. 2007;8:286–298. doi: 10.1038/nrg2005. - DOI - PubMed
    1. Koga Y, Pelizzola M, Cheng E, Krauthammer M, Sznol M, Ariyan S, Narayan D, Molinaro AM, Halaban R, Weissman SM. Genome-wide screen of promoter methylation identifies novel markers in melanoma. Genome Res. 2009;19:1462–1470. doi: 10.1101/gr.091447.109. - DOI - PMC - PubMed
    1. Irizarry RA, Ladd-Acosta C, Wen B, Wu Z, Montano C, Onyango P, Cui H, Gabo K, Rongione M, Webster M, Ji H, Potash JB, Sabunciyan S, Feinberg AP. The human colon cancer methylome shows similar hypo- and hypermethylation at conserved tissue-specific CpG island shores. Nat Genet. 2009;41:178–186. doi: 10.1038/ng.298. - DOI - PMC - PubMed
    1. Schumacher A, Kapranov P, Kaminsky Z, Flanagan J, Assadzadeh A, Yau P, Virtanen C, Winegarden N, Cheng J, Gingeras T, Petronis A. Microarray-based DNA methylation profiling: technology and applications. Nucleic Acids Res. 2006;34:528–542. doi: 10.1093/nar/gkj461. - DOI - PMC - PubMed

Publication types

MeSH terms

Substances

LinkOut - more resources