Global landscape of recent inferred Darwinian selection for Homo sapiens - PubMed
- ️Sun Jan 01 2006
Comparative Study
. 2006 Jan 3;103(1):135-40.
doi: 10.1073/pnas.0509691102. Epub 2005 Dec 21.
Affiliations
- PMID: 16371466
- PMCID: PMC1317879
- DOI: 10.1073/pnas.0509691102
Comparative Study
Global landscape of recent inferred Darwinian selection for Homo sapiens
Eric T Wang et al. Proc Natl Acad Sci U S A. 2006.
Abstract
By using the 1.6 million single-nucleotide polymorphism (SNP) genotype data set from Perlegen Sciences [Hinds, D. A., Stuve, L. L., Nilsen, G. B., Halperin, E., Eskin, E., Ballinger, D. G., Frazer, K. A. & Cox, D. R. (2005) Science 307, 1072-1079], a probabilistic search for the landscape exhibited by positive Darwinian selection was conducted. By sorting each high-frequency allele by homozygosity, we search for the expected decay of adjacent SNP linkage disequilibrium (LD) at recently selected alleles, eliminating the need for inferring haplotype. We designate this approach the LD decay (LDD) test. By these criteria, 1.6% of Perlegen SNPs were found to exhibit the genetic architecture of selection. These results were confirmed on an independently generated data set of 1.0 million SNP genotypes (International Human Haplotype Map Phase I freeze). Simulation studies indicate that the LDD test, at the megabase scale used, effectively distinguishes selection from other causes of extensive LD, such as inversions, population bottlenecks, and admixture. The approximately 1,800 genes identified by the LDD test were clustered according to Gene Ontology (GO) categories. Based on overrepresentation analysis, several predominant biological themes are common in these selected alleles, including host-pathogen interactions, reproduction, DNA metabolism/cell cycle, protein metabolism, and neuronal function.
Figures

LD patterns surrounding DRD4 7R and G6PD V202M. The observed FRC, associated with a minor allele under selection (DRD4 7R and G6PD V202M), are plotted vs. distance. FRC is calculated assuming the selected variant arose on a single chromosome (haplotype) (8). The indicated logistic function curves are approximated as sigmoidal, indicating the increasing decay of LD with distance with maximum assumed value of 0.5. Only sites in one direction from the selected allele are shown. The proximal region of the DRD4 7R data are shown at increased resolution in Inset. The approximate current Perlegen (1) data set detection limit (gray) is indicated.

Probabilistic method for finding unusual genetic architectures. (A) Binning on major/minor alleles. Each individual is sorted based on homozygosity at the major or minor allele at site S (arrowhead). (B) Compute fraction of adjacent recombinant chromosomes. The distance (d1–d3) and FRC for each neighboring SNP is then computed and stored. This list is then used to compute the ALnLH for each site (see text). Using only homozygous individuals for the computation eliminates the need to infer haplotypes.

Darwin's fingerprint. The global landscape (black lines) of recent inferred Darwinian selection for the Perlegen (PLG) and HapMap (CEU, CHB, JPT, and YRI) data sets is shown, aligned along chromosomes and genes (blue lines). A larger version of this figure is available as Fig. 9, and higher-resolution analysis can be obtained from the authors for display on the University of California at Santa Cruz Genome Browser (28).

Example of inferred selection at the Reticulon gene (RTN1), which encodes a neuroendocrine-specific protein thought to affect the formation of amyloid plaques in Alzheimer's disease (29, 30). (A) Inferred selected SNPs in the promoter region (red) are shown along with all annotated SNPs (black). (B–D) The randomness for neighboring recombinant chromosomes for the major RTN1 allele (blue) at this site exemplifies the genome average, with little long-range LD. In contrast, the minor RTN1 allele (yellow) at this site closely matches the LDD model for selection. The horizontal axis labels distance away from each centered SNP, and the vertical axis is FRC (Fig. 1). (B) Perlegen data set. (C) CEU HapMap data set. (D) African ancestry (YRI) HapMap data set. The Asian HapMap data sets resemble the CEU architecture (data not shown). Note the twofold horizontal axis scale change for the YRI display, reflecting the more rapid LDD at this site in this population.

Overrepresented GO categories are not random and represent six biological themes. A total of 407 HapMap CEU selected genes are classifiable under Biological Process GO categories. For these classified genes, 870 biological themes with positive
easevalues were identified, as indicated. Six functional categories constitute 82% of the –log(
ease) scores of >0.65, indicated by colored flags. Each flag is color-coded for one of these specific categories, namely pathogen–host interaction, reproduction, DNA metabolism (including putative transcription factors), cell cycle, protein metabolism, and neuronal function.
Similar articles
-
Genetic evidence for ongoing balanced selection at human DNA repair genes ERCC8, FANCC, and RAD51C.
Wang ET, Moyzis RK. Wang ET, et al. Mutat Res. 2007 Mar 1;616(1-2):165-74. doi: 10.1016/j.mrfmmm.2006.11.030. Epub 2007 Jan 25. Mutat Res. 2007. PMID: 17257630
-
Eberle MA, Rieder MJ, Kruglyak L, Nickerson DA. Eberle MA, et al. PLoS Genet. 2006 Sep 8;2(9):e142. doi: 10.1371/journal.pgen.0020142. Epub 2006 Jul 25. PLoS Genet. 2006. PMID: 16965180 Free PMC article.
-
Scalable linkage-disequilibrium-based selective sweep detection: a performance guide.
Alachiotis N, Pavlidis P. Alachiotis N, et al. Gigascience. 2016 Feb 8;5:7. doi: 10.1186/s13742-016-0114-9. eCollection 2016. Gigascience. 2016. PMID: 26862394 Free PMC article.
-
Barnes MR. Barnes MR. Brief Bioinform. 2006 Sep;7(3):211-24. doi: 10.1093/bib/bbl021. Epub 2006 Jul 28. Brief Bioinform. 2006. PMID: 16877472 Review.
-
SNP and haplotype variation in the human genome.
Salisbury BA, Pungliya M, Choi JY, Jiang R, Sun XJ, Stephens JC. Salisbury BA, et al. Mutat Res. 2003 May 15;526(1-2):53-61. doi: 10.1016/s0027-5107(03)00014-9. Mutat Res. 2003. PMID: 12714183 Review.
Cited by
-
Localizing recent adaptive evolution in the human genome.
Williamson SH, Hubisz MJ, Clark AG, Payseur BA, Bustamante CD, Nielsen R. Williamson SH, et al. PLoS Genet. 2007 Jun;3(6):e90. doi: 10.1371/journal.pgen.0030090. Epub 2007 Apr 20. PLoS Genet. 2007. PMID: 17542651 Free PMC article.
-
Rapid detection of positive selection in genes and genomes through variation clusters.
Wagner A. Wagner A. Genetics. 2007 Aug;176(4):2451-63. doi: 10.1534/genetics.107.074732. Epub 2007 Jul 1. Genetics. 2007. PMID: 17603100 Free PMC article.
-
Evans PD, Mekel-Bobrov N, Vallender EJ, Hudson RR, Lahn BT. Evans PD, et al. Proc Natl Acad Sci U S A. 2006 Nov 28;103(48):18178-83. doi: 10.1073/pnas.0606966103. Epub 2006 Nov 7. Proc Natl Acad Sci U S A. 2006. PMID: 17090677 Free PMC article.
-
Saccone SF, Bierut LJ, Chesler EJ, Kalivas PW, Lerman C, Saccone NL, Uhl GR, Li CY, Philip VM, Edenberg HJ, Sherry ST, Feolo M, Moyzis RK, Rutter JL. Saccone SF, et al. PLoS One. 2009;4(4):e5225. doi: 10.1371/journal.pone.0005225. Epub 2009 Apr 21. PLoS One. 2009. PMID: 19381300 Free PMC article.
-
Scanning for genomic regions subject to selective sweeps using SNP-MaP strategy.
Deng L, Tang X, Chen W, Lin J, Lai Z, Liu Z, Zhang D. Deng L, et al. Genomics Proteomics Bioinformatics. 2010 Dec;8(4):256-61. doi: 10.1016/S1672-0229(10)60027-7. Genomics Proteomics Bioinformatics. 2010. PMID: 21382594 Free PMC article.
References
-
- Hinds, D. A., Stuve, L. L., Nilsen, G. B., Halperin, E., Eskin, E., Ballinger, D. G., Frazer, K. A. & Cox, D. R. (2005) Science 307, 1072–1079. - PubMed
-
- Risch, N. & Merikangas, K. (1996) Science 273, 1516–1517. - PubMed
-
- Zwick, M. E., Cutler, D. J. & Chakravarti, A. (2000) Annu. Rev. Genomics Hum. Genet. 1, 387–407. - PubMed
-
- Reich, D. E. & Lander, E. S. (2001) Trends Genet. 17, 502–510. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Research Materials