Genomic variation in seven Khoe-San groups reveals adaptation and complex African history - PubMed
- ️Sun Jan 01 2012
. 2012 Oct 19;338(6105):374-9.
doi: 10.1126/science.1227721. Epub 2012 Sep 20.
Pontus Skoglund, Per Sjödin, Lucie M Gattepaille, Dena Hernandez, Flora Jay, Sen Li, Michael De Jongh, Andrew Singleton, Michael G B Blum, Himla Soodyall, Mattias Jakobsson
Affiliations
- PMID: 22997136
- PMCID: PMC8978294
- DOI: 10.1126/science.1227721
Genomic variation in seven Khoe-San groups reveals adaptation and complex African history
Carina M Schlebusch et al. Science. 2012.
Abstract
The history of click-speaking Khoe-San, and African populations in general, remains poorly understood. We genotyped ~2.3 million single-nucleotide polymorphisms in 220 southern Africans and found that the Khoe-San diverged from other populations ≥100,000 years ago, but population structure within the Khoe-San dated back to about 35,000 years ago. Genetic variation in various sub-Saharan populations did not localize the origin of modern humans to a single geographic region within Africa; instead, it indicated a history of admixture and stratification. We found evidence of adaptation targeting muscle function and immune response; potential adaptive introgression of protection from ultraviolet light; and selection predating modern human diversification, involving skeletal and neurological development. These new findings illustrate the importance of African genomic diversity in understanding human evolutionary history.
Figures

(A) Sampling locations. (B) PCA of African individuals showing PC1 and PC2 rotated to fit geography. (C) PCA for Khoe-San populations (~2.3 million SNPs). (D) Pairwise FST for sub-Saharan populations (excluding the Hadza; see fig. S24 for comparison). (E) Prediction of the genetic components from geographic, linguistic, and subsistence covariates. The predictive error relative to geography is given for each combination of covariates (values <1 show improved predictive capacity as compared to that of geography).

(A) Rooted population topology from a concordance-test approach (14). Nodes with boot-strap support <50% are collapsed (dashed lines); all other nodes have boot-strap support >85%. (B) Clustering of 403 sub-Saharan African individuals (~270,000 SNPs), assuming 2 to 11 clusters. (C) Clustering of 118 southern African individuals (~2.3 million SNPs), assuming 2 to 8 clusters. Compare with fig. S16, which includes recently admixed individuals.

(A) Expected heterozygosity of 5 SNP haplotypes as a function of haplotype length. (B) Haplotype richness for 5 SNP haplotypes as a function of haplotype length. (C) LD, represented as r2, as a function of distance. (D) Cumulative RoHs for each population (0.5- to 1-Mb runs and averaged across individuals). (E to H) Heat maps of the summary statistics indicated in (A) to (D). (E) and (F) show the results at 50-kb windows; (G) shows ρ = 4 Ne × c, where ρ is estimated from fitting r2-decay curves to simulated data from a constant-size model (14) and c is the unscaled recombination rate; and (H) shows the population cRoHs (0.5- to 1-Mb class) averaged across 50 replicates of subsampling. For (A) to (C), the colors of the African populations are as in Fig. 1, and gray lines represent various non-African groups. For (A) to (H), all populations were randomly downsampled to seven individuals (without replacement), and SNPs with minor allele frequency < 10% were excluded.

(A) iHS values for each SNP on chromosome 10 in Ju/’hoansi, surrounding the muscle gene MYPN, and (B) on chromosome 6 in #Khomani, surrounding the immune system genes PRSS16 and POM121L2. The empirical P values (14) for 200-kb regions centered on the peak are given for each population. Locations of genes are shown by blue rectangles. (C) The greatest FST values for particular SNPs and pairwise population comparisons versus genome-wide FST estimates for the same population comparison. The top pairwise comparisons involving the Nama and another Khoe-San population (yellow) are found in the same region, separated by less than 4000 bp. (D) Proportion of genome-local ancestry (14, 24) for chromosome 16 in the Nama assigned to Khoe-San, Herero, or Bantu-speakers (South Africa). The population-specific chromosome-wide means are shown as dashed horizontal lines. The 99 percentile for Bantu-speakers (South Africa) ancestry, and the 1 percentile for the Khoe-San ancestry are shown as dotted horizontal lines. The two top SNP FST values are highlighted in yellow in (C) and (D). (E) Illustration of the aPBS approach for detecting selective sweeps in early modern humans. AMH, anatomically modern humans. (F) Stretches of consecutive positive aPBS values, with the top aPBS value plotted against the size of the stretch.
Similar articles
-
Breton G, Schlebusch CM, Lombard M, Sjödin P, Soodyall H, Jakobsson M. Breton G, et al. Curr Biol. 2014 Apr 14;24(8):852-8. doi: 10.1016/j.cub.2014.02.041. Epub 2014 Apr 3. Curr Biol. 2014. PMID: 24704072
-
Fan S, Kelly DE, Beltrame MH, Hansen MEB, Mallick S, Ranciaro A, Hirbo J, Thompson S, Beggs W, Nyambo T, Omar SA, Meskel DW, Belay G, Froment A, Patterson N, Reich D, Tishkoff SA. Fan S, et al. Genome Biol. 2019 Apr 26;20(1):82. doi: 10.1186/s13059-019-1679-2. Genome Biol. 2019. PMID: 31023338 Free PMC article.
-
Schlebusch CM, Lombard M, Soodyall H. Schlebusch CM, et al. BMC Evol Biol. 2013 Feb 27;13:56. doi: 10.1186/1471-2148-13-56. BMC Evol Biol. 2013. PMID: 23445172 Free PMC article.
-
African genetic diversity provides novel insights into evolutionary history and local adaptations.
Choudhury A, Aron S, Sengupta D, Hazelhurst S, Ramsay M. Choudhury A, et al. Hum Mol Genet. 2018 Aug 1;27(R2):R209-R218. doi: 10.1093/hmg/ddy161. Hum Mol Genet. 2018. PMID: 29741686 Free PMC article. Review.
-
Bantu-speaker migration and admixture in southern Africa.
Choudhury A, Sengupta D, Ramsay M, Schlebusch C. Choudhury A, et al. Hum Mol Genet. 2021 Apr 26;30(R1):R56-R63. doi: 10.1093/hmg/ddaa274. Hum Mol Genet. 2021. PMID: 33367711 Free PMC article. Review.
Cited by
-
Harvati K, Ackermann RR. Harvati K, et al. Nat Ecol Evol. 2022 Oct;6(10):1573-1585. doi: 10.1038/s41559-022-01875-z. Epub 2022 Sep 5. Nat Ecol Evol. 2022. PMID: 36064759
-
Genetic Connections and Convergent Evolution of Tropical Indigenous Peoples in Asia.
Deng L, Pan Y, Wang Y, Chen H, Yuan K, Chen S, Lu D, Lu Y, Mokhtar SS, Rahman TA, Hoh BP, Xu S. Deng L, et al. Mol Biol Evol. 2022 Feb 3;39(2):msab361. doi: 10.1093/molbev/msab361. Mol Biol Evol. 2022. PMID: 34940850 Free PMC article.
-
Identification of putative target genes of the transcription factor RUNX2.
Kuhlwilm M, Davierwala A, Pääbo S. Kuhlwilm M, et al. PLoS One. 2013 Dec 12;8(12):e83218. doi: 10.1371/journal.pone.0083218. eCollection 2013. PLoS One. 2013. PMID: 24349465 Free PMC article.
-
The Extent and Impact of Variation in ADME Genes in Sub-Saharan African Populations.
da Rocha JEB, Othman H, Botha G, Cottino L, Twesigomwe D, Ahmed S, Drögemöller BI, Fadlelmola FM, Machanick P, Mbiyavanga M, Panji S, Wright GEB, Adebamowo C, Matshaba M, Ramsay M, Simo G, Simuunza MC, Tiemessen CT, Baldwin S, Chiano M, Cox C, Gross AS, Thomas P, Gamo FJ, Hazelhurst S. da Rocha JEB, et al. Front Pharmacol. 2021 Apr 28;12:634016. doi: 10.3389/fphar.2021.634016. eCollection 2021. Front Pharmacol. 2021. PMID: 34721006 Free PMC article.
-
Personalizing medicine in Africa: current state, progress and challenges.
Owolabi P, Adam Y, Adebiyi E. Owolabi P, et al. Front Genet. 2023 Sep 19;14:1233338. doi: 10.3389/fgene.2023.1233338. eCollection 2023. Front Genet. 2023. PMID: 37795248 Free PMC article. Review.
References
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Molecular Biology Databases