Identification of a locus under complex positive selection in Drosophila simulans by haplotype mapping and composite-likelihood estimation - PubMed
Comparative Study
Identification of a locus under complex positive selection in Drosophila simulans by haplotype mapping and composite-likelihood estimation
Colin D Meiklejohn et al. Genetics. 2004 Sep.
Abstract
The recent action of positive selection is expected to influence patterns of intraspecific DNA sequence variation in chromosomal regions linked to the selected locus. These effects include decreased polymorphism, increased linkage disequilibrium, and an increased frequency of derived variants. These effects are all expected to dissipate with distance from the selected locus due to recombination. Therefore, in regions of high recombination, it should be possible to localize a target of selection to a relatively small interval. Previously described patterns of intraspecific variation in three tandemly arranged, testes-expressed genes (janusA, janusB, and ocnus) in Drosophila simulans included all three of these features. Here we expand the original sample and also survey nucleotide polymorphism at three neighboring loci. On the basis of recombination events between derived and ancestral alleles, we localize the target of selection to a 1.5-kb region surrounding janusB. A composite-likelihood-ratio test based on the spatial distribution and frequency of derived polymorphic variants corroborates this result and provides an estimate of the strength of selection. However, the data are difficult to reconcile with the simplest model of positive selection, whereas a new composite-likelihood method suggests that the data are better described by a model in which the selected allele has not yet gone to fixation.
Figures

Diagram of the portion of chromosomal band 99D studied here. Solid bars represent exons and intervening lines represent introns. Bars above represent sequenced regions.

Sequence data for six genes in region 99D. The D. melanogaster (mel) sequence is from A
damset al. (2000); the D. yakuba (yak) sequences are from P
arschet al. (2001b)(janA-ocn) and C
acconeet al. (1996)(sryα). Asterisks below the sequences indicate nonsynonymous polymorphisms; vertical lines indicate noncoding polymorphisms. Boxed sites indicate that the rare D. simulans allele matches D. melanogaster and the common D. simulans allele matches D. yakuba; shaded sites indicate that the rare D. simulans allele matches D. yakuba and the common D. simulans allele matches D. melanogaster. Abbreviations for the location of origin of the D. simulans lines are: SA, South Africa; SM, St. Martin; JA, Japan; FR, France; TU, Tunisia; AU, Australia; HA, Haiti; US, United States; SE, Seychelles; PE, Peru; KE, Kenya; CO, Congo; PO, Polynesia; and ZI, Zimbabwe.

Sequence data for six genes in region 99D. The D. melanogaster (mel) sequence is from A
damset al. (2000); the D. yakuba (yak) sequences are from P
arschet al. (2001b)(janA-ocn) and C
acconeet al. (1996)(sryα). Asterisks below the sequences indicate nonsynonymous polymorphisms; vertical lines indicate noncoding polymorphisms. Boxed sites indicate that the rare D. simulans allele matches D. melanogaster and the common D. simulans allele matches D. yakuba; shaded sites indicate that the rare D. simulans allele matches D. yakuba and the common D. simulans allele matches D. melanogaster. Abbreviations for the location of origin of the D. simulans lines are: SA, South Africa; SM, St. Martin; JA, Japan; FR, France; TU, Tunisia; AU, Australia; HA, Haiti; US, United States; SE, Seychelles; PE, Peru; KE, Kenya; CO, Congo; PO, Polynesia; and ZI, Zimbabwe.

Low polymorphism and excess of singletons at janB. Graphs were generated using DnaSP 3.99 (R
ozaset al. 2003) with a sliding window of 400 nucleotides and a step size of 25 nucleotides. (A) Average pairwise differences (T
ajima1983) divided by divergence. (B) Fu and Li's D (F
uand L
i1993). The horizontal line indicates values of D that are significantly different from 0 at P < 0.05.

Parameter estimation for data sets simulated under incomplete sweep with α = 500, X = 6300, β = 0.7. (A) Joint distribution of X̂ and β̂. (B) Joint distribution of β̂ and α̂.

Parameter estimation for data sets simulated under incomplete sweep with α = 500, X = 6300, β = 0.7. (A) Joint distribution of X̂ and β̂. (B) Joint distribution of β̂ and α̂.

Average number of pairwise differences (π, solid line) and number of segregating sites (S, dashed line) for subsets of chromosomes that minimize π (πm(i)), graphed against the number of chromosomes in each subset. See text for details.

The composite-likelihood ratio (CLR) as a function of the position of the putative beneficial mutation. Sequenced segments corresponding to six genes in this region are indicated by horizontal lines above the x-axis. The CLR was obtained from 26 chromosomes corresponding to haplotype group I. The dashed line represents the 95th percentile of CLR (4.52) determined by neutral simulations.
Similar articles
-
Parsch J, Meiklejohn CD, Hartl DL. Parsch J, et al. Genetics. 2001 Oct;159(2):647-57. doi: 10.1093/genetics/159.2.647. Genetics. 2001. PMID: 11606541 Free PMC article.
-
Sánchez-Gracia A, Rozas J. Sánchez-Gracia A, et al. Genetics. 2007 Apr;175(4):1923-35. doi: 10.1534/genetics.106.068015. Epub 2007 Feb 4. Genetics. 2007. PMID: 17277360 Free PMC article.
-
A Composite-Likelihood Method for Detecting Incomplete Selective Sweep from Population Genomic Data.
Vy HM, Kim Y. Vy HM, et al. Genetics. 2015 Jun;200(2):633-49. doi: 10.1534/genetics.115.175380. Epub 2015 Apr 24. Genetics. 2015. PMID: 25911658 Free PMC article.
-
Kern AD, Begun DJ. Kern AD, et al. Mol Biol Evol. 2005 Jan;22(1):51-62. doi: 10.1093/molbev/msh269. Epub 2004 Sep 29. Mol Biol Evol. 2005. PMID: 15456897 Review.
-
Sequence variation: looking for effects of genetic linkage.
Charlesworth D, Charlesworth B. Charlesworth D, et al. Curr Biol. 1998 Sep 10;8(18):R658-61. doi: 10.1016/s0960-9822(07)00416-2. Curr Biol. 1998. PMID: 9740793 Review.
Cited by
-
Talyzina NM, Ingvarsson PK, Zhu J, Wai SN, Andersson A. Talyzina NM, et al. Appl Environ Microbiol. 2009 Jun;75(11):3808-12. doi: 10.1128/AEM.02496-08. Epub 2009 Apr 3. Appl Environ Microbiol. 2009. PMID: 19346342 Free PMC article.
-
Measuring natural selection on genotypes and phenotypes in the wild.
Linnen CR, Hoekstra HE. Linnen CR, et al. Cold Spring Harb Symp Quant Biol. 2009;74:155-68. doi: 10.1101/sqb.2009.74.045. Epub 2010 Apr 22. Cold Spring Harb Symp Quant Biol. 2009. PMID: 20413707 Free PMC article. Review.
-
On the origin and spread of an adaptive allele in deer mice.
Linnen CR, Kingsley EP, Jensen JD, Hoekstra HE. Linnen CR, et al. Science. 2009 Aug 28;325(5944):1095-8. doi: 10.1126/science.1175826. Science. 2009. PMID: 19713521 Free PMC article.
-
Inferring the distribution of selective effects from a time inhomogeneous model.
Amei A, Zhou S. Amei A, et al. PLoS One. 2019 Jan 18;14(1):e0194709. doi: 10.1371/journal.pone.0194709. eCollection 2019. PLoS One. 2019. PMID: 30657757 Free PMC article.
-
Distinguishing between selective sweeps and demography using DNA polymorphism data.
Jensen JD, Kim Y, DuMont VB, Aquadro CF, Bustamante CD. Jensen JD, et al. Genetics. 2005 Jul;170(3):1401-10. doi: 10.1534/genetics.104.038224. Epub 2005 May 23. Genetics. 2005. PMID: 15911584 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Molecular Biology Databases