Demographic history of european populations of Arabidopsis thaliana - PubMed
- ️Tue Jan 01 2008
Demographic history of european populations of Arabidopsis thaliana
Olivier François et al. PLoS Genet. 2008.
Abstract
The model plant species Arabidopsis thaliana is successful at colonizing land that has recently undergone human-mediated disturbance. To investigate the prehistoric spread of A. thaliana, we applied approximate Bayesian computation and explicit spatial modeling to 76 European accessions sequenced at 876 nuclear loci. We find evidence that a major migration wave occurred from east to west, affecting most of the sampled individuals. The longitudinal gradient appears to result from the plant having spread in Europe from the east approximately 10,000 years ago, with a rate of westward spread of approximately 0.9 km/year. This wave-of-advance model is consistent with a natural colonization from an eastern glacial refugium that overwhelmed ancient western lineages. However, the speed and time frame of the model also suggest that the migration of A. thaliana into Europe may have accompanied the spread of agriculture during the Neolithic transition.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures

(A) Membership coefficients in K max = 5 putative populations, computed using the average values over the 10 TESS runs with the smallest values of the deviance information criterion from a total of 100 runs. Similar results were obtained with other values of K max from 4 to 10. (B) Interpolated membership coefficients in the three apparent subpopulations: western cluster, eastern cluster, and northern cluster.

Correlation (R) map for the linear regression of expected heterozygosity on great circle distance. We used 300×180 points on a two-dimensional lattice covering Europe, and we computed distances from each lattice point considered as a potential source. The dots represent the centers of the 7 population samples used in the regression analysis.

The 4 demographic scenarios (Models A–D) and their associated Bayes factors. Model A is the model with constant population size, N 0. Model B is a model with an exponentially growing population size (present size, N 0, ancestral size, N 1, time since the onset of expansion, t 0). In Model C, the growth is exponential between two periods with constant size (present size, N 0, ancestral size, N 1, time since the onset of expansion, t 0, time since the end of expansion, t 1). Model D is similar to Model B, but it includes an ancient bottleneck before expansion. Variants of these 4 models, including variable mutation rates across loci, are considered here. The Bayes factors (top boxes) correspond to the ratio of the weight of evidence of each model to the weight of evidence of Model B. Two window sizes, δ 0.01 and δ 0.05, were used when computing the Bayes factors. These window sizes correspond to the 1% and 5% quantiles of the distance between the values of the summary statistics obtained under Model B and the observed values of the summary statistics. The Bayes factors were identical for the 2 window sizes and for values rounded for one decimal place, except for Model C, for which a minor difference was observed (1.8 for δ 0.05 instead of 1.9).

Plot of the joint posterior distribution for the time of onset of the expansion, t 0, and the length of the expansion, t 0−t 1. Computations were performed under demographic Model C, in which the population was initially constant, then grew exponentially until t 1, and then remained constant until the present. Percentages represent the cumulative probabilities under the density curve. The straight line indicates that the duration of expansion cannot be longer than the time elapsed since the onset of expansion.

The mean number of distinct haplotypes and the mean number of private haplotypes of the central European population and the northern European population as functions of sample size. Vertical bars show standard error.

The mean number of distinct haplotypes and the mean number of private haplotypes of two simulated populations, as functions of sample size. The dark orange lines show the simulation results for a population of size 135,000, and the dark green lines show the simulation results for a population of size 135,000×1/4. The top panel shows the case when the split time is 0. Below follow the results for increasing split times. No migration is assumed. The split time T is given in units of population size. The fit of the simulated data to the observed data was evaluated by the mean across the 100 simulations of the sum of squared differences (SSD) between each simulated data set and the observed data.

The mean number of distinct haplotypes and the mean number of private haplotypes of two simulated populations as functions of sample size, shown for 100 replicates. The dark orange lines show the simulation results for a population of size NCE = 135,000, and the dark green lines show the results for a population of size 135,000×1/4, when T = 13,500 years. The top panel shows the case when the migration rate, m, equals 0, and then follow the cases with m = 3 and m = 6 (normalized by N CE). The results from the observed populations are also plotted for comparison (lighter orange and green lines).

(A) χ 2 distances between the simulated and the empirical folded frequency spectra as a function of the time of onset of the expansion. The other parameters were fixed at m = 0.25, r = 0.6–1.2, and N 1 = 10,000. The origin was placed north of the Black Sea (48°N, 35°E). The horizontal line corresponds to the 95% rejection interval of the χ 2 test (df = 3, see Methods). (B) Interpolated map of χ 2 distances between simulated and empirical folded spectra for 24 potential origins (black dots). The time of onset was fixed at 9,000 years BP, and the other parameters were fixed as in (A).

Minor allele frequency spectra of empirical data and data simulated under the best-fitting model of spatial range expansion. Population growth followed the logistic model within each deme (see text for the other parameter settings). The solid line (grey) corresponds to the neutral folded frequency spectrum. (A) The empirical folded spectrum was computed from the 648 inter-genic and non-coding sequences. (B) The simulated spectrum was computed using the same number of neutral nucleotides as in the data. In simulations, expansion started 9,000 years ago from a potential origin north of the Black Sea (48°N, 35°E). Other locations from a large region around this potential origin yielded very similar simulated spectra.
Similar articles
-
Picó FX, Méndez-Vigo B, Martínez-Zapater JM, Alonso-Blanco C. Picó FX, et al. Genetics. 2008 Oct;180(2):1009-21. doi: 10.1534/genetics.108.089581. Epub 2008 Aug 20. Genetics. 2008. PMID: 18716334 Free PMC article.
-
Population structure and historical biogeography of European Arabidopsis lyrata.
Ansell SW, Stenøien HK, Grundmann M, Schneider H, Hemp A, Bauer N, Russell SJ, Vogel JC. Ansell SW, et al. Heredity (Edinb). 2010 Dec;105(6):543-53. doi: 10.1038/hdy.2010.10. Epub 2010 Feb 17. Heredity (Edinb). 2010. PMID: 20160758
-
The origin of populations of Arabidopsis thaliana in China, based on the chloroplast DNA sequences.
Yin P, Kang J, He F, Qu LJ, Gu H. Yin P, et al. BMC Plant Biol. 2010 Feb 8;10:22. doi: 10.1186/1471-2229-10-22. BMC Plant Biol. 2010. PMID: 20141622 Free PMC article.
-
Archaic lineages broaden our view on the history of Arabidopsis thaliana.
Fulgione A, Hancock AM. Fulgione A, et al. New Phytol. 2018 Sep;219(4):1194-1198. doi: 10.1111/nph.15244. Epub 2018 Jun 4. New Phytol. 2018. PMID: 29862511 Review.
-
Planting molecular functions in an ecological context with Arabidopsis thaliana.
Krämer U. Krämer U. Elife. 2015 Mar 25;4:e06100. doi: 10.7554/eLife.06100. Elife. 2015. PMID: 25807084 Free PMC article. Review.
Cited by
-
The Arabidopsis thaliana mobilome and its impact at the species level.
Quadrana L, Bortolini Silveira A, Mayhew GF, LeBlanc C, Martienssen RA, Jeddeloh JA, Colot V. Quadrana L, et al. Elife. 2016 Jun 3;5:e15716. doi: 10.7554/eLife.15716. Elife. 2016. PMID: 27258693 Free PMC article.
-
Huang W, Takebayashi N, Qi Y, Hickerson MJ. Huang W, et al. BMC Bioinformatics. 2011 Jan 3;12:1. doi: 10.1186/1471-2105-12-1. BMC Bioinformatics. 2011. PMID: 21199577 Free PMC article.
-
Vercken E, Fontaine MC, Gladieux P, Hood ME, Jonot O, Giraud T. Vercken E, et al. PLoS Pathog. 2010 Dec 16;6(12):e1001229. doi: 10.1371/journal.ppat.1001229. PLoS Pathog. 2010. PMID: 21187901 Free PMC article.
-
Reconstructing the origin and spread of horse domestication in the Eurasian steppe.
Warmuth V, Eriksson A, Bower MA, Barker G, Barrett E, Hanks BK, Li S, Lomitashvili D, Ochir-Goryaeva M, Sizonov GV, Soyonov V, Manica A. Warmuth V, et al. Proc Natl Acad Sci U S A. 2012 May 22;109(21):8202-6. doi: 10.1073/pnas.1111122109. Epub 2012 May 7. Proc Natl Acad Sci U S A. 2012. PMID: 22566639 Free PMC article.
-
Pierce AA, Zalucki MP, Bangura M, Udawatta M, Kronforst MR, Altizer S, Haeger JF, de Roode JC. Pierce AA, et al. Proc Biol Sci. 2014 Dec 22;281(1797):20142230. doi: 10.1098/rspb.2014.2230. Proc Biol Sci. 2014. PMID: 25377462 Free PMC article.
References
-
- Meyerowitz EM, Somerville CR. Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press; 1994. Arabidopsis.
-
- Meinke DW, Cherry JM, Dean D, Rounsley SD, Koornneef M. Arabidopsis thaliana: a model plant for genome analysis. Science. 1998;282:662–682. - PubMed
-
- Dean C. Advantages of Arabidopsis for cloning plant genes. Philos T Roy Soc London B. 1993;342:189–195.
-
- Pyke K. Arabidopsis - its use in the genetic and molecular analysis of plant morphogenesis. New Phytol. 1994;128:19–37. - PubMed
-
- Lawrence MJ. Variations in natural populations of Arabidopsis thaliana (L.) Heynh. In: Vaughan JG, MacLeod AJ, Jones BMG, editors. The Biology and Chemistry of the CRUCIFERAE. London: Academic Press; 1976.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources