Maternal History of Oceania from Complete mtDNA Genomes: Contrasting Ancient Diversity with Recent Homogenization Due to the Austronesian Expansion
Abstract
Archaeology, linguistics, and existing genetic studies indicate that Oceania was settled by two major waves of migration. The first migration took place approximately 40 thousand years ago and these migrants, Papuans, colonized much of Near Oceania. Approximately 3.5 thousand years ago, a second expansion of Austronesian-speakers arrived in Near Oceania and the descendants of these people spread to the far corners of the Pacific, colonizing Remote Oceania. To assess the female contribution of these two human expansions to modern populations and to investigate the potential impact of other migrations, we obtained 1,331 whole mitochondrial genome sequences from 34 populations spanning both Near and Remote Oceania. Our results quantify the magnitude of the Austronesian expansion and demonstrate the homogenizing effect of this expansion on almost all studied populations. With regards to Papuan influence, autochthonous haplogroups support the hypothesis of a long history in Near Oceania, with some lineages suggesting a time depth of 60 thousand years, and offer insight into historical interpopulation dynamics. Santa Cruz, a population located in Remote Oceania, is an anomaly with extreme frequencies of autochthonous haplogroups of Near Oceanian origin; simulations to investigate whether this might reflect a pre-Austronesian versus Austronesian settlement of the island failed to provide unequivocal support for either scenario.
Main Text
Within the boundaries of Oceania, one of the first and one of the last major colonization events by anatomically modern humans occurred. Settlement of New Guinea and Australia, which were then joined as a single landmass known as Sahul, occurred at least 44 kiloannum (ka) ago1,2 and humans spread essentially instantaneously across the Vitiaz Straight and to the islands of the Bismarck Archipelago.3,4 Descendants of this initial expansion voyaged and settled at least as far as Buka at the northernmost tip of the Solomons archipelago by 28 ka ago,5 and Manus in the Admiralty Islands by 12 ka ago,6 indicating that they possessed sufficient watercraft and sailing skills to voyage at least 200 km. However, there are no indications that they made regular long-distance voyages beyond Near Oceania,7 which comprises New Guinea, the Bismarck Archipelago, and the Solomon Islands as far southeast as Makira. Instead, it seems that the settlement of Remote Oceania (comprising the Reef Islands, Santa Cruz, Vanuatu, New Caledonia, Fiji, and Polynesia) and possibly much of the Solomon Islands8 was achieved by the descendants of an expansion that began in Taiwan (or possibly elsewhere in Southeast Asia) about 5–6 ka ago,9–11 reached Near Oceania between 3.5 and 3.3 ka ago,12–14 and were the first to colonize areas in Remote Oceania beginning around 3.1 ka ago,15 culminating in the settlement of the Hawai’ian Islands,16 Easter Island,17 and New Zealand18 within the last 800–1,200 years.
Each of these two major expansion events is associated by anthropologists, archaeologists, and linguists with peoples, phenotypes, cultures, and languages. People believed to be descended from the first expansion within Near Oceania are often referred to as Papuan, generally practice patrilocality, tend to have darker skin pigmentation reflecting the root of the region’s historical name—Melanesia—as the “dark islands,”7 and speak languages that appear to be so old and deep rooting that linguists are still unsure about their true relationships.19,20 By contrast, the second migration into Near Oceania is indicated by the presence of a particular material culture, including, but not limited to, a distinctive style of pottery (Lapita) and is associated with speakers of Austronesian languages that are clearly related and descended from a single language called Proto Oceanic (see Lynch et al.21 and references therein). Reconstructions suggest they lived in small, highly mobile matrilocal groups22 and were phenotypically more similar to Asian populations (e.g., with generally lighter skin pigmentation) than are Papuans.
We can also make genetic ties to both of these founding populations. The Austronesian expansion is associated with the spread of mtDNA haplogroup B, particularly haplogroup B4a1 and its descendent lineages, throughout Island South East Asia, Oceania, and even to Madagascar in the west.10,23–28 In particular, haplogroup B4a1a1a, defined by the “Polynesian motif,”29 an A-G transition at position 16247, and its descendants are associated with the spread of Austronesians throughout Oceania. This haplogroup reaches near fixation in Remote Oceania,24,29,30 though position 16247 has been found to back-mutate repeatedly on independent lineages and as such must be examined carefully.31 Meanwhile, Papuan ancestry is associated with haplogroups Q, P, M27, M28, and M29, which appear to be autochthonous to Near Oceania.24,32,33 Studies of Y chromosome SNPs and short tandem repeats (STRs) also support a dual-parental population model for Oceanians, particularly Remote Oceanians.24,34–36 Interestingly, no single Y chromosome haplogroup is dominant in the same way that mtDNA haplogroup B is predominant in Remote Oceanians and the frequency of Y chromosomes of Near Oceanian origin (∼66%) is much greater than that of Y chromosomes of Asian origin (∼28%).24 Moreover, genome-wide data37–39 support the dual ancestry model with more Asian ancestry than Near Oceanian ancestry in remote Oceania (approximately 80% Asian, 20% Near Oceanian), meaning that the genome-wide average is intermediate between the mtDNA and Y chromosome results.
Apart from these two well-attested population expansions, there is additional evidence to suggest interactions with Southeast Asia around 5 ka ago. Archaeologically this is evidenced in part by the introduction of domesticated pigs to New Guinea,7,40 which is the approximate time frame that previous studies have suggested for the entry of haplogroup E to Near Oceania.25 However, whether these two events are connected remains to be resolved.
In this study, we examined the maternal population structure and the history of admixture across Oceania. Previous studies exploring the maternal histories of the area have been limited because they largely make use of only a small portion of the genome known as the hypervariable region (HVR) (e.g., see Kayser et al.,24 Friedlaender et al.,25 Delfin et al.41); other studies making use of whole mtDNA sequences from Oceania have been limited in terms of sample sizes and number of populations analyzed (e.g., see Soares et al.,42 Benton et al.43). Our data set is, to our knowledge, the most comprehensive to date, and consists of 1,331 whole mitochondrial genome sequences from 34 populations spread from the Bismarck Archipelago to Polynesia (Figure 1). To facilitate comprehension and visualization of the patterns of variation, these 34 populations have been amalgamated into six groups on the basis of geographic proximity and/or shared cultural features (Figure 1). The first group consists of populations from New Britain in the Bismarck Archipelago (Anem, Ata, Nakanai) and the second group consists of populations from Bougainville and Buka (Buin, Buka, Nagovisi, Nasioi, Siwai, Torau), two islands at the northern tip of the Solomons Archipelago that politically are part of Papua New Guinea, and the easternmost site of known settlement in Pleistocene Oceania.5 The only other confirmed pre-Austronesian settlements in the Solomons Archipelago are sites from Guadalcanal with indications of human occupation approximately 6 ka ago.44,45 The third group consists of populations found in the islands of the rest of the Solomons Archipelago (Choiseul, Gela, Guadalcanal, Isabel, Kolombangara, Makira, Malaita, Ranongga, Russell, Savo, Shortlands, Simbo, Vella Lavella). Santa Cruz is the only population to be grouped alone; it is treated separately because it has been shown to be a genetic outlier in previous studies of mtDNA and Y chromosome variation41,46 as well as being linguistically distinct. Originally classified as a Papuan language, currently the Santa Cruz language and closely related languages of the Temotu group are thought to represent a very deep branch of the Oceanic family of Austronesian languages.47 The final two groupings concern populations believed to be predominantly of Austronesian heritage: these include populations from across Remote Oceania ranging geographically from Fiji to the Cook Islands (Cook Islands, Fiji, Futuna, Niue, Samoa, Tonga, Tuvalu), as well as four populations classified as Polynesian Outliers (Bellona, Ontong Java, Rennell, Tikopia). The latter are populations in Near Oceania that, based on linguistic and cultural evidence, are thought to be descended primarily from back migrations from Polynesia.48,49
Figure 1.
Map of Study Area
Populations are colored as to their group assignment: New Britain (black), Bougainville (blue), Solomon Islands main chain (green), Santa Cruz (red), Remote Oceania (gray), Polynesian Outliers (orange). Italicized labels indicate Papuan speaking populations. Additional geographic points mentioned in text are identified in the map inset.
Samples from New Britain were collected, in the form of whole blood, as described previously.25 Plasma from these samples was then shipped to Leipzig where they were extracted in 2011 with the QIAGEN DNeasy Blood and Tissue kit, as per the manufacturer’s protocol. After extraction samples were quantified for human mtDNA content by qPCR as previously described50 and samples that were found to have mtDNA concentrations of less than 5 ng/μl were subjected to whole-genome amplification with the QIAGEN REPLI-G minikit as per the manufacturer’s protocol and then re-extracted with the DNeasy Blood and Tissue kit but replacing the proteinase K incubation with an incubation in QIAGEN Buffer AL. Bougainvillean samples were collected with written informed consent in 2011 as 2 ml of saliva and stored in 2 ml of lysis buffer (50 mM Tris, 50 mM EDTA, 50 mM sucrose, 100 mM NaCl, 1% SDS) as described previously.51 Extraction was completed with the QIAGEN DNA midi-kit. DNA samples from all other populations have been described previously.24,34,41,52,53 This study was approved by the ethics commission of the University of Leipzig Medical Faculty, the National Research Institute of Papua New Guinea, and the Papua New Guinea Medical Research Advisory Committee.
Multiplex sequencing libraries were constructed and enriched for mtDNA sequences according to previous protocols54,55 with additional modifications described previously56 and also below. Various Illumina platforms and lengths of sequencing reads were used during this study; the conditions for each sample are provided in Table S1 available online, along with details concerning the coverage and amount of missing data. The 1,331 samples were aligned with MUSCLE v.3.857 and visualized in BioEdit.58 Heteroplasmies and indels were confirmed with SAMtools.59 There are no significant differences in the number of variant sites called per sequence with respect to coverage or sequencing platform. 536 sequences from the Solomon Islands, Santa Cruz, and Polynesian Outlier populations that belong to haplogroup B4a1a1 and descendent lineages were previously published as part of a study on the instability of the 16247G allele.31
Haplogroups were assigned to consensus sequences for each sample with the Haplogrep webtool and Phylotree Build 15.60,61 The haplogroup assigned to each individual is provided in Table S1, and haplogroup frequencies for each population are in Table S2. The relative frequencies of haplogroups of putative Near Oceanian, Austronesian (haplogroup B and sublineages), or other origin are depicted in Figure 2. In general, the frequencies of haplogroups of putative Near Oceanic origin are greater in New Britain and Bougainville and then decrease in frequency the further out a population is in the Pacific (Figure 2). Two populations that contradict this trend are the Nagovisi of Bougainville, with 97% frequency of haplogroup B, and Santa Cruz, with 85% frequency of Near Oceanian haplogroups (Figure 2).
Figure 2.
Relative Frequency of Near Oceanian, Austronesian, and Other Haplogroups by Population
For designation of haplogroup origin, see Table S2.
Haplogroup B is by far the predominant lineage in this study, accounting for 76% of the sequences. Haplogroup B4a1a1a and sublineages thereof account for 69% alone, confirming previous HVR studies24,25,41 that found that these haplogroups form a gradient of increasing frequency across the Pacific. Building on our previous results,31 we find additional evidence for back-mutations at position 16247 (Figure S1), on the background of both haplogroups B4a1a1a and B4a1a1a1. The general frequency of the back-mutation in haplogroup B4a1a1a1 remains constant at approximately 20% and was found in all groups except Santa Cruz and the three populations from New Britain (Table S2); the latter result probably reflects the low frequency of haplogroup B4a1a1a1 in these populations. In addition, haplogroup B4a1a1a3, identified previously as a Maori-specific haplogroup,43 is present in additional samples from Remote Oceania, reaching a frequency of 14% in the Cook Islands (Table S2).
An analysis of haplotype sharing provides a sense of how ubiquitous haplogroup B is in the Pacific and further supports the recent entry and rapid spread of haplogroup B across Oceania. Haplotype sharing was calculated by an in-house Perl script that compares all sequence pairs. The resultant identity matrix was visualized with the R package ggplot2.62 For all analyses except haplogroup assignment, the poly-C regions (positions 303–315 and 16182–16193) were removed from all sequences. Most pairs of populations share sequences belonging to haplogroup B (Figure 3A), whereas in a similar analysis of all other haplogroups, interpopulation sharing virtually disappears and almost all cases of haplotype sharing are restricted to members of the same population (Figure 3B). Because recent female movement between populations should affect all haplotypes to the same degree, this pattern indicates a comparative lack of recent interpopulation contact. Instead, the high proportion of sharing in haplogroup B is a clear signature of a spread via a not-too-ancient population expansion that involved mostly, if not exclusively, haplogroup B lineages. Although previous HVR studies have also shown high degrees of similarity within haplogroup B samples across Oceania,24,41,63 their conclusions about haplotype sharing are constricted by the small portion of the mtDNA genome analyzed. Here we assess the complete diversity of whole mitochondrial genome and still find high levels of haplotype sharing. That these haplotypes are sometimes shared from New Britain to Remote Oceania (a distance of more than 5,500 km) indicates that this expansion happened over such a short period of time that new mutations did not occur within these haplotypes, nor have they mutated extensively since the initial expansion.
Figure 3.
Haplotype Sharing between Populations
(A) Exclusively haplogroup B.
(B) Excluding haplogroup B.
The scope of the Austronesian expansion and the spread of haplogroup B4a1a1 and descendent lineages is also evidenced in a Bayesian Skyline Plot (BSP)64,65 of population size change through time. The BSP for haplogroup B4a1a1∗ coalesces just after 5 ka ago, or at the approximate time the Austronesian expansion is thought to have begun, and shows a steady increase in effective population size of approximately three orders of magnitude continuing to the present day (Figure 4A). In the middle of this population increase, there is a slight levelling out, or decrease in the rate of size increase, around 3 ka ago which could be a reflection of the “long pause” in exploration that is believed to have occurred after the colonizations of western Polynesia (Fiji, Tonga, Samoa) and before other waves of voyaging and settlement began in eastern Polynesia around 1.7 ka ago.7,66,67
Figure 4.
Bayesian Skyline Plots
(A) Haplogroup B4a1a1∗.
(B) Haplogroup E.
BSP calculations were conducted with the data partitioned between coding and noncoding regions with respective mutation rates of 1.708 × 10−8 ± 0.15 × 10−8 and 9.883 × 10−8 ± 0.15 × 10−8 after Soares et al.88 and the TN93 substitution model allowing for both invariant sites and gamma distribution.
Previous studies have suggested that, based on the reconstructed age of the B4a1a1 haplogroup in Near Oceania, it must have been present in the region before the Austronesian expansion reached the area.42 Although our reconstructions would also suggest a time depth for the lineage that is greater than 3.5 ka (Figure 4A), the reconstructed age of a haplogroup does not necessarily indicate when the haplogroup was introduced into a particular region; it could have been introduced much more recently. Thus, a pre-Lapita age of haplogroup B4a1a1 is not evidence for a pre-Lapita origin in Near Oceania. Moreover, even if haplogroup B4a1a1 was introduced to New Guinea by a pre-Austronesian migration, the subsequent spread of this haplogroup throughout the rest of Near Oceania and across Remote Oceania is clearly associated with the Austronesian dispersal.
Older haplogroups that are thought to be autochthonous to Near Oceania include M27, M28, M29, Q, and P.24,32,33 Within these older lineages, we find not only greater geographic specificity and much less sequence sharing than for haplogroup B lineages (Figures 3B and 5, Table S2) but also several new lineages; these haplogroups are underrepresented in the literature. We find three distinct lineages of haplogroup M27 that diverged from each other about 60,000 years ago (Figure 6) and are notably found only in populations from Bougainville and the main Solomon Islands, with the exception of a single individual from Santa Cruz with M27b (Table S2). All samples identified as M27b in this study are missing 8 of the 25 diagnostic mutations for M27b (Phylotree Build 15), and all except one of the samples lack a ninth diagnostic position, and hence are most likely a sister lineage to those M27b lineages already identified in the area33 (Figure S2). Also widespread are haplogroups M28a, identified in populations from all groups save the Polynesian Outliers, and Q1, which was found in 40% of the samples from Santa Cruz (Table S2). Some of the sublineages of Q1 appear to have regional specificity (such as Q1 + 16223!, found in New Britain only) (Table S2), but on closer examination many of these sequences did not meet the full requirements for the sublineage to which they were assigned by Haplogrep. These probably represent new lineages that help refine the phylogeny of haplogroup Q, as some of the existing diagnostic mutations may not be suitable markers because they appear on multiple independent branches (Figure S3). With our larger sample size and wide geographic coverage, we have been able to identify greater levels of diversity for the autochthonous lineages of Oceania than have been previously available based on HVR data or a small sample of whole genomes selected for sequencing on the basis of their HVR sequence.32,33
Figure 5.
Network of All Haplogroups of Putative Near Oceanian Origin
Networks were generated with the program Network and calculated first with a reduced median algorithm89 and then a median joining algorithm90 and with transversions weighted three times greater than transitions. Networks were postprocessed with the program Network Publisher.
Figure 6.
Schematic of Haplogroups and Coalescence Times for Haplogroups Found in Oceania, Based on Bayesian Phylogenetic Analyses and Mutation Rates
B4a1a1/B4a1a1a/B4a1a1a3/B4a1a1a4 are represented as a single clade. B4a1a1a3 and B4a1a1a4 branch from diversity within B4a1a1a and because of the recurrent back-mutations of the 16247G allele, the Bayesian tree does not accurately resolve the relationship between B4a1a1, B4a1a1a, and B4a1a1a1. The age estimates for haplogroups in the B4a1a1 lineage are therefore only approximate. Individual schematics for haplogroups B, E, M27/M28/M29/M∗, M7c3c, Q, and P were constructed from Bayesian maximum clade credibility trees generated by BEAST65 and TreeAnnotator91 with parameters as described in the legend of Figure 4. These six individual schematics were then arranged as per the current accepted mtDNA phylogeny (Phylotree Build 15); connections assigned by the authors are indicated by dashed lines.
Finally, a previously undocumented lineage that is here termed M∗ (Table S2) is likely to be another autochthonous haplogroup of Near Oceania. Found in Bougainville, the main Solomon Islands, and Santa Cruz, the divergence of this lineage is estimated at about 60 ka ago from an ancestral M28-M29 lineage (Figure 6). No close relatives to this lineage have been reported based on either complete mtDNA genome sequences or HVR sequences, save one HVR sequence from the Torau population of Bougainville25 and one HVR sequence from Vanuatu68 that share the diagnostic 16290A transversion. The M∗ lineages coalesce about 5.9 ka ago, which is much more recent than other autochthonous Near Oceanian lineages (Figure 6) and could therefore suggest an origin for M∗ outside of Near Oceania that then spread to Near Oceania before 5.9 ka ago. However, given that M∗ has not been found outside Near Oceania, despite the wealth of mtDNA sequence information available for Southeast Asia, its recent coalescent time probably reflects a bottleneck event sometime within the last 6 ka.
Beyond the lineages that are associated with either the original settlement of Sahul or the Austronesian expansion, the presence of other lineages in Near Oceania may suggest other population influences. The frequency distribution of haplogroup E is notably different from that of haplogroup B (Table S2), and its extreme frequency in the Ata, a Papuan speaking population from New Britain, has led to the suggestion that haplogroup E entered Near Oceania separately from the Austronesian expansion.25 Haplogroup E1b1 is found in more than 45% of samples from the Ata in New Britain, a population notable as the only one in this study with no observed haplogroup B, making it quite different from its neighbors in New Britain (Figure 2). Overall, haplogroup E1 lineages are restricted to New Britain, Bougainville, and the Solomons, while the only haplogroup E2 lineage, E2a, was found in just three samples from Isabel in the Solomon Islands. Haplogroup E is found in reasonably high frequency across much of Island Southeast Asia (ISEA) and also in Taiwan but is almost completely absent from mainland Asia.10,69–75 The absence of two major clades of haplogroup E in Taiwan (E1b and E2a), as well as the greater overall sequence diversity found in ISEA, has led some to posit that the lineage must have originated in ISEA during the last glacial maximum while the region was still part of the larger continent of Sunda.70 E1b is the most frequent lineage of haplogroup E found in this study (Table S2) and it is conspicuously absent in Taiwan and the Philippines,71,72,74 which would suggest a different source for entry into Near Oceania. In the reconstructed history of haplogroup E through the BSP, we note that despite having only 28 samples from haplogroup E, the coalescence time is approximately 14 ka ago, far earlier than that of haplogroup B4a1a1∗ (Figure 4B). Although this cannot be taken as evidence that this haplogroup has been present in Near Oceania for 14 ka, we do believe that this further supports the hypothesis that this haplogroup arrived in the area via a population expansion separate from haplogroup B and the Austronesian expansion. Still, this putative migration had a minimal impact on the maternal genetic structure of Oceania, as indicated by the fact that haplogroup E accounts for only about 2% of the mtDNA sequences in this study.
Haplogroup M7c3c was found in elevated frequency (∼25%) only in Ontong Java (Table S2); it also occurred in one individual each from Vella Lavella and Tuvalu. Although M7c3c (“M7c1c” in some earlier publications10,24,69,76) is found in Taiwanese aborigines,10,69 the purported source populations of the Austronesian expansion, it is also widespread across ISEA69,71–77 but is virtually absent in Near Oceania (Table S2). Because this haplogroup distribution does not seem to reflect the same history as that of haplogroup B (or haplogroup E), M7c3c could potentially represent a different migration other than the Austronesian expansion, possibly from ISEA and Indonesia because these are believed to have sustained contact with Near Oceania in the period before the Austronesian expansion.7 However, Micronesia is another potential source, because M7c3c has been documented there,78 and there are historical accounts of drift migration from Micronesia to Ontong Java.79
The elevated proportion of Near Oceanian haplogroups in Santa Cruz has been reported before41,46 and remains perplexing. Santa Cruz is the first landmass encountered once crossing the border into Remote Oceania. It is known to have been settled approximately 3.2 ka ago,80 possibly directly from the Bismarck Archipelago, with evidence of contact between Santa Cruz and New Britain continuing for hundreds of years.81 Whether the peculiar haplogroup composition for this population is evidence of this trade and contact remains to be satisfactorily answered. It is the population sample with the greatest number of polymorphic sites (despite a smaller sample size than many populations) and has the highest mean pairwise difference and the greatest nucleotide diversity values (Table 1). Moreover, there are several Near Oceanian haplogroups in Santa Cruz, and each is represented by several distinct sequence types (Table S2, Figure 5). Thus, it is unlikely that the high frequency of Near Oceanian haplogroups reflects a recent bottleneck or founder event in Santa Cruz. The people of Santa Cruz speak an Austronesian language, recognized now as probably a deep branch within the Oceanic family of Austronesian,47 yet they are starkly different genetically from all other Remote Oceanian populations and from Austronesian-speaking populations in Near Oceania.
Table 1.
Diversity Statistics Based on Complete mtDNA Genome Sequences
Population | N | Hts | Ht Diversity | Polymorphic Sites | π | MPD |
---|---|---|---|---|---|---|
Anem | 32 | 18 | 0.93 ± 0.03 | 99 | 0.0011 ± 0.0006 | 18.30 ± 8.33 |
Ata | 28 | 8 | 0.75 ± 0.07 | 65 | 0.0014 ± 0.0007 | 22.29 ± 10.12 |
Nakanai | 42 | 19 | 0.94 ± 0.02 | 100 | 0.0015 ± 0.0007 | 24.41 ± 10.94 |
Buin | 29 | 23 | 0.98 ± 0.02 | 104 | 0.0018 ± 0.0009 | 29.61 ± 13.33 |
Buka | 11 | 10 | 0.98 ± 0.05 | 82 | 0.0017 ± 0.0009 | 28.15 ± 13.36 |
Siwai | 26 | 17 | 0.95 ± 0.02 | 93 | 0.0014 ± 0.0007 | 22.51 ± 10.24 |
Nagovisi | 38 | 14 | 0.88 ± 0.03 | 53 | 0.0003 ± 0.0001 | 4.23 ± 2.15 |
Nasioi | 41 | 28 | 0.97 ± 0.01 | 142 | 0.0017 ± 0.0008 | 28.04 ± 12.53 |
Torau | 34 | 20 | 0.96 ± 0.02 | 124 | 0.0017 ± 0.0009 | 28.37 ± 12.72 |
Choiseul | 33 | 23 | 0.95 ± 0.03 | 149 | 0.0012 ± 0.0006 | 19.86 ± 9.01 |
Gela | 40 | 23 | 0.94 ± 0.02 | 68 | 0.0010 ± 0.0005 | 17.12 ± 7.78 |
Guadalcanal | 50 | 42 | 0.99 ± 0.01 | 155 | 0.0011 ± 0.0006 | 18.15 ± 8.19 |
Isabel | 52 | 33 | 0.97 ± 0.01 | 114 | 0.0010 ± 0.0005 | 16.97 ± 7.67 |
Kolombangara | 18 | 10 | 0.91 ± 0.04 | 69 | 0.0007 ± 0.0004 | 11.18 ± 5.33 |
Makira | 17 | 14 | 0.97 ± 0.03 | 58 | 0.0005 ± 0.0003 | 8.46 ± 4.12 |
Malaita | 89 | 50 | 0.96 ± 0.01 | 165 | 0.0013 ± 0.0006 | 21.05 ± 9.38 |
Ranongga | 47 | 23 | 0.94 ± 0.02 | 117 | 0.0013 ± 0.0007 | 22.22 ± 9.97 |
Russell | 39 | 12 | 0.86 ± 0.04 | 59 | 0.0003 ± 0.0002 | 5.44 ± 2.68 |
Savo | 40 | 30 | 0.98 ± 0.01 | 92 | 0.0010 ± 0.0005 | 16.36 ± 7.44 |
Shortlands | 14 | 12 | 0.98 ± 0.03 | 81 | 0.0014 ± 0.0007 | 23.43 ± 10.97 |
Simbo | 22 | 15 | 0.96 ± 0.03 | 48 | 0.0004 ± 0.0002 | 5.97 ± 2.96 |
Vella Lavella | 51 | 25 | 0.96 ± 0.01 | 143 | 0.0010 ± 0.0005 | 16.85 ± 7.62 |
Santa Cruz | 47 | 27 | 0.95 ± 0.02 | 175 | 0.0019 ± 0.0009 | 30.93 ± 13.75 |
Cook Islands | 65 | 26 | 0.88 ± 0.03 | 77 | 0.0003 ± 0.0002 | 5.59 ± 2.72 |
Fiji | 49 | 32 | 0.96 ± 0.02 | 134 | 0.0007 ± 0.0004 | 12.04 ± 5.54 |
Futuna | 48 | 22 | 0.94 ± 0.02 | 84 | 0.0004 ± 0.0002 | 6.35 ± 3.06 |
Niue | 21 | 10 | 0.87 ± 0.06 | 12 | 0.0002 ± 0.0001 | 3.50 ± 1.85 |
Samoa | 47 | 34 | 0.95 ± 0.02 | 109 | 0.0005 ± 0.0003 | 8.50 ± 4.00 |
Tonga | 52 | 42 | 0.98 ± 0.01 | 116 | 0.0005 ± 0.0002 | 7.82 ± 3.70 |
Tuvalu | 50 | 29 | 0.96 ± 0.02 | 94 | 0.0005 ± 0.0002 | 7.64 ± 3.62 |
Bellona | 38 | 10 | 0.85 ± 0.03 | 15 | 0.0002 ± 0.0001 | 3.89 ± 2.00 |
Ontong Java | 32 | 12 | 0.89 ± 0.03 | 46 | 0.0009 ± 0.0004 | 14.12 ± 6.50 |
Rennell | 43 | 8 | 0.76 ± 0.05 | 13 | 0.0002 ± 0.0001 | 3.62 ± 1.87 |
Tikopia | 46 | 15 | 0.85 ± 0.04 | 53 | 0.0003 ± 0.0002 | 4.56 ± 2.28 |
To further explore these patterns on a population level, we calculated p-distance matrices between populations for some autochthonous Near Oceanian haplogroups. The heatmap contains pairwise distances for all sequences that belong to haplogroups of Near Oceanian origin and shows that, in general, sequences are most closely related to others originating from the same group (Figure 7). However, there is also a relatively high similarity of sequences from the Ata and Nakanai of New Britain to those of Santa Cruz and Remote Oceania (Figure 7). Thus, despite the lack of haplotype sharing, the lower p-distances between Santa Cruz and New Britain suggest some sort of older contact or migration (Figure 7). In addition, there is a tendency for some populations from the western Solomons (in particular Ranongga and the Shortlands) to have lower p-distances with populations from Bougainville than with populations from elsewhere in the Solomons, suggesting an east-west distinction in the Solomons in keeping with some views of the culture history of the region,45 as well as a linguistic division between Oceanic languages in the southeast versus northwest Solomon Islands.82,83 The genetic affinities between these populations are apparent here because our data set covers a wide range of Oceanian populations and also because these analyses have been restricted to the autochthonous haplogroups. When studies of population relationships were carried out averaged across all sequences from all populations (i.e., for multidimensional scaling analyses, data not shown), these signals appeared diminished in large part due to the overwhelming signal of similarity produced by the elevated frequency of haplogroup B in most populations.
Figure 7.
P-Distances between Samples Belonging to Haplogroups
(A) M27, M28, and M29.
(B) Haplogroup Q.
(C) Haplogroup P.
Distances were calculated with MEGA v.592 with the Tamura Nei substitution model with a gamma distribution and omitting any sites with greater than 5% missing data.
Because of the high proportion of Near Oceanian haplogroups in Santa Cruz (Figure 2) and the dearth of haplotype sharing (Figures 3 and 5) but the apparent sequence affinities (Figure 7), we considered the possibility of a pre-Austronesian settlement of Santa Cruz from New Britain. Archaeological evidence has shown that people were sailing regularly between the Bismarck Archipelago and Santa Cruz by 3.2 ka ago, “leapfrogging” over the main Solomon Islands where there is no similar indication of large-scale obsidian movement.81 To test the possibility of a pre-Austronesian settlement of Santa Cruz, we used the coalescent simulator SIMCOAL84 to simulate various scenarios in which a source population splits to form two daughter populations of equal effective population sizes followed by no additional migration events. We simulated demographic scenarios with combinations of six time points for the population split (representing the founding of the Santa Cruz population), a generation time of 28 years,85 and six effective population sizes and then, with the Arlequin package,86,87 calculated the number of segregating sites, mean pairwise distances, and ΦST and compared them to our empirically observed values. A summary of results is in Figure 8. The number of segregating sites and nucleotide diversity values for each population are informative as to the effective population size for New Britain and Santa Cruz, and the ΦST value (together with the effective population size estimate) is informative as to the time of population divergence (Figure 8). Although some of the findings hint at the possibility of an older settlement time for Santa Cruz, based on the best-fitting effective population sizes of between 1,500 and 2,500 for both New Britain and Santa Cruz, the 95% confidence intervals do not allow us to discount a settlement time of approximately 3 ka ago. Although a pre-Austronesian settlement for Santa Cruz is not ruled out by these results, neither is a pre-Austronesian settlement convincingly supported. The reason for the unusual genetic composition of Santa Cruz remains an enigma. Some studies of Y chromosome STRs24,34,41 and autosomal STRs and SNPs have already been undertaken in Oceania;37–39 however, additional studies of Y chromosome variation and whole-genome SNPs will be an important direction for future work and may help to clarify the history of Santa Cruz.
Figure 8.
Simulation Results for the Founding of Santa Cruz
(A and B) Segregating sites within the New Britain (A) and Santa Cruz (B) populations.
(C and D) Mean pairwise distance within the New Britain (C) and Santa Cruz (D) populations.
(E) ΦST between the New Britain and Santa Cruz populations.
In all cases the empirically observed value is indicated by the horizontal red line. Simulated demographic scenarios are grouped by the time since population split and then by simulated effective population size. Mean values for each simulation are indicated by black circles and the corresponding 95% confidence intervals by grey bars.
To conclude, we have investigated the maternal population history of 34 Oceanic populations, ranging from the Bismarck Archipelago to eastern Polynesia, with whole mitochondrial genome sequencing. Our results demonstrate the huge impact of the Austronesian expansion and its genetic legacy: haplogroup B lineages are found all across Oceania, account for 76% of all lineages, and in a very short time period have spread across the entire region. To consider this within the terms of population admixture, it seems that the Austronesians were very successfully integrated into existing Papuan societies; indeed, some populations that speak Papuan languages are among those with the highest frequencies of haplogroup B. However, by studying the non-Austronesian genetic signature, we can find evidence of population substructure and affinities between particular populations, perhaps indicative of relationships that existed before the entry of Austronesians to Near Oceania. The haplogroups of putative Near Oceanian origin found in this study appear to be quite ancient with coalescence times exceeding 60 ka ago, and we have identified new lineages that refine the current mtDNA phylogeny. Furthermore, some of these Near Oceanian haplogroups have highly regionally specific distributions, suggestive of population structure and minimal contact over a period of tens of thousands of years. We also find signals that suggest that beyond the two accepted population expansions, Near Oceania has potentially had other lesser migration events, each carrying with it specific haplogroups that appear to be localized to specific populations within Near Oceania. In addition, we do not find evidence to support a pre-Austronesian settlement of Santa Cruz, which remains a strong outlier in Remote Oceania because of its extraordinarily high frequency of autochthonous Near Oceanian haplogroups.
Acknowledgments
We thank all sample donors for contributing to this study and the numerous individuals in Bougainville who assisted with the sample collection. We thank the Sequencing Group within the Evolutionary Genetics Department for expert production of the sequences, and Gabriel Renaud and Udo Stenzel for their assistance with the HiSeq data. We thank Chiara Barbieri, Sebastian Lippold, and Sebastian Zöllner for helpful discussions and suggestions. We thank Laura Bañuelos, Torsten Blass, Alexander Hübner, and Roland Schröder for technical assistance. We thank Peter Sheppard and David Roe for sharing their knowledge of Near Oceanian archaeology. We thank Brigitte Pakendorf for helpful comments on the manuscript. This research was supported by the Max Planck Society, the Wenner-Gren Foundation for Anthropological Research, the National Science Foundation, and the National Geographic Society.
Supplemental Data
Document S1. Figures S1–S3
Table S1. Summary of Sequencing Conditions per Sample and Assigned Haplogroup per Sample
Table S2. Haplogroup Frequency and Putative Origin per Population
Document S2. Article plus Supplemental Data
Web Resources
The URLs for data presented herein are as follows:
GenBank, http://www.ncbi.nlm.nih.gov/genbank/
HaploGrep, http://haplogrep.uibk.ac.at/
Phylotree, http://www.phylotree.org/
R statistical software, http://www.r-project.org/
Accession Numbers
The GenBank accession numbers for the 795 novel sequences reported in this paper are KJ154155–KJ154949.
References
- 1.Groube L., Chappell J., Muke J., Price D. A 40,000 year-old human occupation site at Huon Peninsula, Papua New Guinea. Nature. 1986;324:453–455. doi: 10.1038/324453a0. [DOI] [PubMed] [Google Scholar]
- 2.Summerhayes G.R., Leavesley M., Fairbairn A., Mandui H., Field J., Ford A., Fullagar R. Human adaptation and plant use in highland New Guinea 49,000 to 44,000 years ago. Science. 2010;330:78–81. doi: 10.1126/science.1193130. [DOI] [PubMed] [Google Scholar]
- 3.Leavesley M.G., Bird M.I., Fifield L.K., Hausladen P., Santos G., Di Tada M. Buang Merabak: early evidence for human occupation in the Bismarck Archipelago, Papua New Guinea. Aust. Archaeol. 2002;54:55–57. [Google Scholar]
- 4.Leavesley M.G., Chappell J. Buang Merabak: additional early radiocarbon evidence of the colonisation of the Bismarck Archipelago, Papua New Guinea. Antiquity. 2004;78:318. [Google Scholar]
- 5.Wickler S., Spriggs M. Pleistocene human occupation of the Solomon Islands, Melanesia. Antiquity. 1988;62:703–706. [Google Scholar]
- 6.Fredericksen C., Spriggs M., Ambrose W. Pamwak rockshelter: a pleistocene site on Manus Island, Papua New Guinea. In: Smith M.A., Spriggs M., Frankhauser B., editors. Sahul in Review: Pleistocene Archaeology in Australia, New Guinea and Island Melanesia. Australian National University; Canberra: 1993. pp. 144–154. [Google Scholar]
- 7.Kirch P.V. University of California Press; Berkeley: 2000. On the Road of the Winds-An Archaeological History of the Pacific Islands Before European Contact. [Google Scholar]
- 8.Kirch P. Peopling of the Pacific: a holistic anthropological perspective. Annu. Rev. Anthropol. 2010;39:131–148. [Google Scholar]
- 9.Blust R. The prehistory of the Austronesian-speaking peoples: a view from language. J. World Prehist. 1995;9:453–510. [Google Scholar]
- 10.Trejaut J.A., Kivisild T., Loo J.H., Lee C.L., He C.L., Hsu C.J., Lee Z.Y., Lin M. Traces of archaic mitochondrial lineages persist in Austronesian-speaking Formosan populations. PLoS Biol. 2005;3:e247. doi: 10.1371/journal.pbio.0030247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Moodley Y., Linz B., Yamaoka Y., Windsor H.M., Breurec S., Wu J.-Y., Maady A., Bernhöft S., Thiberge J.-M., Phuanukoonnon S. The peopling of the Pacific from a bacterial perspective. Science. 2009;323:527–530. doi: 10.1126/science.1166083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Kirch P., Hunt T. Radiocarbon dates from the Mussau Islands and the Lapita colonization of the southwestern Pacific. Radiocarbon. 1988;30:161–169. [Google Scholar]
- 13.Specht J., Gosden C. Dating Lapita pottery in the Bismark Archipelago, Papua New Guinea. Asian Perspect. 1997;36:175–199. [Google Scholar]
- 14.Summerhayes G.R. Lapita in the far west: recent developments. Archaeol. Oceania. 2001;36:53–63. [Google Scholar]
- 15.Summerhayes G.R. Island Melanesian pasts: a view from archaeology. In: Friedlaender J.S., editor. Genes, Language and Culture History in the Southwest Pacific. Oxford University Press; New York: 2007. pp. 10–35. [Google Scholar]
- 16.Athens J.S., Tuggle H.D., Ward J.V., Welch D.J. Avifaunal extinctions, vegetation change, and Polynesian impacts in prehistoric Hawai’i. Archaeol. Oceania. 2002;37:57–78. [Google Scholar]
- 17.Hunt T.L., Lipo C.P. Late colonization of Easter Island. Science. 2006;311:1603–1606. doi: 10.1126/science.1121879. [DOI] [PubMed] [Google Scholar]
- 18.Hogg A.G., Higham T.F.G., Lowe D.J., Palmer J.G., Reimer P.J., Newnham R.M. A wiggle-match date for Polynesian settlement of New Zealand. Antiquity. 2002;77:116–125. [Google Scholar]
- 19.Pawley A. Recent research on the historical relationships of the Papuan languages, or, what does linguistics say about the prehistory of Melanesia. In: Friedlaender J.S., editor. Genes, Language, and Culture History in the Southwest Pacific. Oxford University Press; New York: 2007. pp. 36–58. [Google Scholar]
- 20.Hunley K., Dunn M., Lindström E., Reesink G., Terrill A., Norton H., Scheinfeldt L., Friedlaender F.R., Merriwether D.A., Koki G., Friedlaender J.S. Inferring prehistory from genetic, linguistic, and geographic variation. In: Friedlaender J.S., editor. Genes, Language and Culture History in the Southwest Pacific. Oxford University Press; New York: 2007. pp. 141–154. [Google Scholar]
- 21.Lynch J., Ross M., Crowley T. Curzon Press; Richmond, Surrey: 2002. The Oceanic Languages. [Google Scholar]
- 22.Jordan F.M., Gray R.D., Greenhill S.J., Mace R. Matrilocal residence is ancestral in Austronesian societies. Proc. Biol. Sci. 2009;276:1957–1964. doi: 10.1098/rspb.2009.0088. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Soodyall H., Jenkins T., Stoneking M. ‘Polynesian’ mtDNA in the Malagasy. Nat. Genet. 1995;10:377–378. doi: 10.1038/ng0895-377. [DOI] [PubMed] [Google Scholar]
- 24.Kayser M., Brauer S., Cordaux R., Casto A., Lao O., Zhivotovsky L.A., Moyse-Faurie C., Rutledge R.B., Schiefenhoevel W., Gil D. Melanesian and Asian origins of Polynesians: mtDNA and Y chromosome gradients across the Pacific. Mol. Biol. Evol. 2006;23:2234–2244. doi: 10.1093/molbev/msl093. [DOI] [PubMed] [Google Scholar]
- 25.Friedlaender J.S., Friedlaender F.R., Hodgson J.A., Stoltz M., Koki G., Horvat G., Zhadanov S., Schurr T.G., Merriwether D.A. Melanesian mtDNA complexity. PLoS ONE. 2007;2:e248. doi: 10.1371/journal.pone.0000248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Kayser M., Choi Y., van Oven M., Mona S., Brauer S., Trent R.J., Suarkia D., Schiefenhövel W., Stoneking M. The impact of the Austronesian expansion: evidence from mtDNA and Y chromosome diversity in the Admiralty Islands of Melanesia. Mol. Biol. Evol. 2008;25:1362–1374. doi: 10.1093/molbev/msn078. [DOI] [PubMed] [Google Scholar]
- 27.Razafindrazaka H., Ricaut F.-X., Cox M.P., Mormina M., Dugoujon J.-M., Randriamarolaza L.P., Guitard E., Tonasso L., Ludes B., Crubézy E. Complete mitochondrial DNA sequences provide new insights into the Polynesian motif and the peopling of Madagascar. Eur. J. Hum. Genet. 2010;18:575–581. doi: 10.1038/ejhg.2009.222. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Cox M.P., Nelson M.G., Tumonggor M.K., Ricaut F.-X., Sudoyo H. A small cohort of Island Southeast Asian women founded Madagascar. Proc. Biol. Sci. 2012;279:2761–2768. doi: 10.1098/rspb.2012.0012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Melton T., Peterson R., Redd A.J., Saha N., Sofro A.S., Martinson J., Stoneking M. Polynesian genetic affinities with Southeast Asian populations as identified by mtDNA analysis. Am. J. Hum. Genet. 1995;57:403–414. [PMC free article] [PubMed] [Google Scholar]
- 30.Redd A.J., Takezaki N., Sherry S.T., McGarvey S.T., Sofro A.S., Stoneking M. Evolutionary history of the COII/tRNALys intergenic 9 base pair deletion in human mitochondrial DNAs from the Pacific. Mol. Biol. Evol. 1995;12:604–615. doi: 10.1093/oxfordjournals.molbev.a040240. [DOI] [PubMed] [Google Scholar]
- 31.Duggan A.T., Stoneking M. A highly unstable recent mutation in human mtDNA. Am. J. Hum. Genet. 2013;92:279–284. doi: 10.1016/j.ajhg.2012.12.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Friedlaender J., Schurr T., Gentz F., Koki G., Friedlaender F., Horvat G., Babb P., Cerchio S., Kaestle F., Schanfield M. Expanding Southwest Pacific mitochondrial haplogroups P and Q. Mol. Biol. Evol. 2005;22:1506–1517. doi: 10.1093/molbev/msi142. [DOI] [PubMed] [Google Scholar]
- 33.Merriwether D.A., Hodgson J.A., Friedlaender F.R., Allaby R., Cerchio S., Koki G., Friedlaender J.S. Ancient mitochondrial M haplogroups identified in the Southwest Pacific. Proc. Natl. Acad. Sci. USA. 2005;102:13034–13039. doi: 10.1073/pnas.0506195102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Kayser M., Brauer S., Weiss G., Underhill P.A., Roewer L., Schiefenhövel W., Stoneking M. Melanesian origin of Polynesian Y chromosomes. Curr. Biol. 2000;10:1237–1246. doi: 10.1016/s0960-9822(00)00734-x. [DOI] [PubMed] [Google Scholar]
- 35.Hurles M.E., Nicholson J., Bosch E., Renfrew C., Sykes B.C., Jobling M.A. Y chromosomal evidence for the origins of oceanic-speaking peoples. Genetics. 2002;160:289–303. doi: 10.1093/genetics/160.1.289. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Scheinfeldt L., Friedlaender F., Friedlaender J., Latham K., Koki G., Karafet T., Hammer M., Lorenz J. Unexpected NRY chromosome variation in Northern Island Melanesia. Mol. Biol. Evol. 2006;23:1628–1641. doi: 10.1093/molbev/msl028. [DOI] [PubMed] [Google Scholar]
- 37.Kayser M., Lao O., Saar K., Brauer S., Wang X., Nürnberg P., Trent R.J., Stoneking M. Genome-wide analysis indicates more Asian than Melanesian ancestry of Polynesians. Am. J. Hum. Genet. 2008;82:194–198. doi: 10.1016/j.ajhg.2007.09.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Friedlaender J.S., Friedlaender F.R., Reed F.A., Kidd K.K., Kidd J.R., Chambers G.K., Lea R.A., Loo J.-H., Koki G., Hodgson J.A. The genetic structure of Pacific Islanders. PLoS Genet. 2008;4:e19. doi: 10.1371/journal.pgen.0040019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Wollstein A., Lao O., Becker C., Brauer S., Trent R.J., Nürnberg P., Stoneking M., Kayser M. Demographic history of Oceania inferred from genome-wide data. Curr. Biol. 2010;20:1983–1992. doi: 10.1016/j.cub.2010.10.040. [DOI] [PubMed] [Google Scholar]
- 40.Bellwood P. William Collins Publishers Ltd; Hong Kong: 1978. Man’s Conquest of the Pacific. [Google Scholar]
- 41.Delfin F., Myles S., Choi Y., Hughes D., Illek R., van Oven M., Pakendorf B., Kayser M., Stoneking M. Bridging near and remote Oceania: mtDNA and NRY variation in the Solomon Islands. Mol. Biol. Evol. 2012;29:545–564. doi: 10.1093/molbev/msr186. [DOI] [PubMed] [Google Scholar]
- 42.Soares P., Rito T., Trejaut J., Mormina M., Hill C., Tinkler-Hundal E., Braid M., Clarke D.J., Loo J.-H., Thomson N. Ancient voyaging and Polynesian origins. Am. J. Hum. Genet. 2011;88:239–247. doi: 10.1016/j.ajhg.2011.01.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Benton M., Macartney-Coxson D., Eccles D., Griffiths L., Chambers G., Lea R. Complete mitochondrial genome sequencing reveals novel haplotypes in a Polynesian population. PLoS ONE. 2012;7:e35026. doi: 10.1371/journal.pone.0035026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Roe D. Australian National University; Canberra: 1993. Prehistory without pots: prehistoric settlement and economy of North-west Guadalcanal, Solomon Islands. PhD thesis. [Google Scholar]
- 45.Walter R., Sheppard P. A review of Solomon Islands archaeology. In: Sheppard P., Thomas T., Summerhayes G., editors. Lapita: Ancestors and Descendants. New Zealand Archaeological Association; Auckland: 2009. pp. 35–72. [Google Scholar]
- 46.Friedlaender J.S., Gentz F., Green K., Merriwether D.A. A cautionary tale on ancient migration detection: mitochondrial DNA variation in Santa Cruz Islands, Solomon Islands. Hum. Biol. 2002;74:453–471. doi: 10.1353/hub.2002.0029. [DOI] [PubMed] [Google Scholar]
- 47.Ross M., Næss Å. An Oceanic origin for Äiwoo, the language of the Reef Islands? Ocean. Linguist. 2007;46:456–498. [Google Scholar]
- 48.Kirch P.V. The polynesian outliers: Continuity, change, and replacement. J. Pac. Hist. 1984;19:224–238. [Google Scholar]
- 49.Green R.C. Linguistic, biological and cultural origins of the initial inhabitants of Remote Oceania. NZ. J. Archaeol. 1995;17:5–27. [Google Scholar]
- 50.Walker J.A., Hedges D.J., Perodeau B.P., Landry K.E., Stoilova N., Laborde M.E., Shewale J., Sinha S.K., Batzer M.A. Multiplex polymerase chain reaction for simultaneous quantitation of human nuclear, mitochondrial, and male Y-chromosome DNA: application in human identification. Anal. Biochem. 2005;337:89–97. doi: 10.1016/j.ab.2004.09.036. [DOI] [PubMed] [Google Scholar]
- 51.Quinque D., Kittler R., Kayser M., Stoneking M., Nasidze I. Evaluation of saliva as a source of human DNA for population and association studies. Anal. Biochem. 2006;353:272–277. doi: 10.1016/j.ab.2006.03.021. [DOI] [PubMed] [Google Scholar]
- 52.Trent R.J., Mickleson K.N., Wilkinson T., Yakas J., Dixon M.W., Hill P.J., Kronenberg H. Globin genes in Polynesians have many rearrangements including a recently described gamma gamma gamma gamma/ Am. J. Hum. Genet. 1986;39:350–360. [PMC free article] [PubMed] [Google Scholar]
- 53.Trent R.J., Buchanan J.G., Webb A., Goundar R.P., Seruvatu L.M., Mickleson K.N. Globin genes are useful markers to identify genetic similarities between Fijians and Pacific Islanders from Polynesia and Melanesia. Am. J. Hum. Genet. 1988;42:601–607. [PMC free article] [PubMed] [Google Scholar]
- 54.Meyer M., Kircher M. Cold Spring Harb. Protoc; Cold Spring Harbor: 2010. Illumina Sequencing Library Preparation for Highly Multiplexed Target Capture and Sequencing. [DOI] [PubMed] [Google Scholar]
- 55.Maricic T., Whitten M., Pääbo S. Multiplexed DNA sequence capture of mitochondrial genomes using PCR products. PLoS ONE. 2010;5:e14004. doi: 10.1371/journal.pone.0014004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Barbieri C., Whitten M., Beyer K., Schreiber H., Li M., Pakendorf B. Contrasting maternal and paternal histories in the linguistic context of Burkina Faso. Mol. Biol. Evol. 2012;29:1213–1223. doi: 10.1093/molbev/msr291. [DOI] [PubMed] [Google Scholar]
- 57.Edgar R.C. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32:1792–1797. doi: 10.1093/nar/gkh340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Hall T.A. BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp. Ser. 1999;41:95–98. [Google Scholar]
- 59.Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., Marth G., Abecasis G., Durbin R., 1000 Genome Project Data Processing Subgroup The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–2079. doi: 10.1093/bioinformatics/btp352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.van Oven M., Kayser M. Updated comprehensive phylogenetic tree of global human mitochondrial DNA variation. Hum. Mutat. 2009;30:E386–E394. doi: 10.1002/humu.20921. [DOI] [PubMed] [Google Scholar]
- 61.Kloss-Brandstätter A., Pacher D., Schönherr S., Weissensteiner H., Binna R., Specht G., Kronenberg F. HaploGrep: a fast and reliable algorithm for automatic classification of mitochondrial DNA haplogroups. Hum. Mutat. 2011;32:25–32. doi: 10.1002/humu.21382. [DOI] [PubMed] [Google Scholar]
- 62.Wickham H. Springer New York; New York: 2009. ggplot2: Elegant Graphics for Data Analysis. [Google Scholar]
- 63.Whyte A.L., Marshall S.J., Chambers G.K. Human evolution in Polynesia. Hum. Biol. 2005;77:157–177. doi: 10.1353/hub.2005.0045. [DOI] [PubMed] [Google Scholar]
- 64.Drummond A.J., Rambaut A., Shapiro B., Pybus O.G. Bayesian coalescent inference of past population dynamics from molecular sequences. Mol. Biol. Evol. 2005;22:1185–1192. doi: 10.1093/molbev/msi103. [DOI] [PubMed] [Google Scholar]
- 65.Drummond A.J., Suchard M.A., Xie D., Rambaut A. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Mol. Biol. Evol. 2012;29:1969–1973. doi: 10.1093/molbev/mss075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Spriggs M., Anderson A. Late colonization of east Polynesia. Antiquity. 1993;67:200–217. [Google Scholar]
- 67.Gray R.D., Drummond A.J., Greenhill S.J. Language phylogenies reveal expansion pulses and pauses in Pacific settlement. Science. 2009;323:479–483. doi: 10.1126/science.1166858. [DOI] [PubMed] [Google Scholar]
- 68.Sykes B., Leiboff A., Low-Beer J., Tetzner S., Richards M. The origins of the Polynesians: an interpretation from mitochondrial lineage analysis. Am. J. Hum. Genet. 1995;57:1463–1475. [PMC free article] [PubMed] [Google Scholar]
- 69.Hill C., Soares P., Mormina M., Macaulay V., Clarke D., Blumbach P.B., Vizuete-Forster M., Forster P., Bulbeck D., Oppenheimer S., Richards M. A mitochondrial stratigraphy for island southeast Asia. Am. J. Hum. Genet. 2007;80:29–43. doi: 10.1086/510412. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Soares P., Trejaut J.A., Loo J.-H., Hill C., Mormina M., Lee C.-L., Chen Y.-M., Hudjashov G., Forster P., Macaulay V. Climate change and postglacial human dispersals in southeast Asia. Mol. Biol. Evol. 2008;25:1209–1218. doi: 10.1093/molbev/msn068. [DOI] [PubMed] [Google Scholar]
- 71.Tabbada K.A., Trejaut J., Loo J.-H., Chen Y.-M., Lin M., Mirazón-Lahr M., Kivisild T., De Ungria M.C. Philippine mitochondrial DNA diversity: a populated viaduct between Taiwan and Indonesia? Mol. Biol. Evol. 2010;27:21–31. doi: 10.1093/molbev/msp215. [DOI] [PubMed] [Google Scholar]
- 72.Gunnarsdóttir E.D., Li M., Bauchet M., Finstermeier K., Stoneking M. High-throughput sequencing of complete human mtDNA genomes from the Philippines. Genome Res. 2011;21:1–11. doi: 10.1101/gr.107615.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Gunnarsdóttir E.D., Nandineni M.R., Li M., Myles S., Gil D., Pakendorf B., Stoneking M. Larger mitochondrial DNA than Y-chromosome differences between matrilocal and patrilocal groups from Sumatra. Nat. Commun. 2011;2:228. doi: 10.1038/ncomms1235. [DOI] [PubMed] [Google Scholar]
- 74.Delfin F., Min-Shan Ko A., Li M., Gunnarsdóttir E.D., Tabbada K.A., Salvador J.M., Calacal G.C., Sagum M.S., Datar F.A., Padilla S.G. Complete mtDNA genomes of Filipino ethnolinguistic groups: a melting pot of recent and ancient lineages in the Asia-Pacific region. Eur. J. Hum. Genet. 2014;22:228–237. doi: 10.1038/ejhg.2013.122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Tumonggor M.K., Karafet T.M., Hallmark B., Lansing J.S., Sudoyo H., Hammer M.F., Cox M.P. The Indonesian archipelago: an ancient genetic highway linking Asia and the Pacific. J. Hum. Genet. 2013;58:165–173. doi: 10.1038/jhg.2012.154. [DOI] [PubMed] [Google Scholar]
- 76.Kivisild T., Tolk H.-V., Parik J., Wang Y., Papiha S.S., Bandelt H.-J., Villems R. The emerging limbs and twigs of the East Asian mtDNA tree. Mol. Biol. Evol. 2002;19:1737–1751. doi: 10.1093/oxfordjournals.molbev.a003996. [DOI] [PubMed] [Google Scholar]
- 77.Jinam T.A., Hong L.-C., Phipps M.E., Stoneking M., Ameen M., Edo J., Saitou N., HUGO Pan-Asian SNP Consortium Evolutionary history of continental southeast Asians: “early train” hypothesis based on genetic analysis of mitochondrial and autosomal DNA data. Mol. Biol. Evol. 2012;29:3513–3527. doi: 10.1093/molbev/mss169. [DOI] [PubMed] [Google Scholar]
- 78.Pierson M.J., Martinez-Arias R., Holland B.R., Gemmell N.J., Hurles M.E., Penny D. Deciphering past human population movements in Oceania: provably optimal trees of 127 mtDNA genomes. Mol. Biol. Evol. 2006;23:1966–1975. doi: 10.1093/molbev/msl063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Hogbin H.I. “Polynesian” colonies in Melanesia. J. Polynesian So. 1940;49:199–220. [Google Scholar]
- 80.Green R.C., Jones M., Sheppard P. The reconstructed environment and absolute dating of SE-SZ-8 Lapita site on Nendö, Santa Cruz, Solomon Islands. Archaeol. Oceania. 2008;43:49–61. [Google Scholar]
- 81.Sheppard P., Walter R. A revised model of Solomon Islands culture history. J. Polynesian Soc. 2006;115:47–76. [Google Scholar]
- 82.Tryon D.T., Hackman B.D. Pacific Linguistics; Canberra: 1983. Solomon Islands Languages: An Internal Classification. [Google Scholar]
- 83.Ross M. Pacific Linguistics; Canberra: 1988. Proto Oceanic and the Austronesian Languages of Western Melanesia. [Google Scholar]
- 84.Excoffier L., Novembre J., Schneider S. SIMCOAL: a general coalescent program for the simulation of molecular data in interconnected populations with arbitrary demography. J. Hered. 2000;91:506–509. doi: 10.1093/jhered/91.6.506. [DOI] [PubMed] [Google Scholar]
- 85.Fenner J.N. Cross-cultural estimation of the human generation interval for use in genetics-based population divergence studies. Am. J. Phys. Anthropol. 2005;128:415–423. doi: 10.1002/ajpa.20188. [DOI] [PubMed] [Google Scholar]
- 86.Excoffier L., Laval G., Schneider S. Arlequin (version 3.0): an integrated software package for population genetics data analysis. Evol. Bioinform. Online. 2005;1:47–50. [PMC free article] [PubMed] [Google Scholar]
- 87.Excoffier L., Lischer H.E. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and Windows. Mol Ecol Resour. 2010;10:564–567. doi: 10.1111/j.1755-0998.2010.02847.x. [DOI] [PubMed] [Google Scholar]
- 88.Soares P., Ermini L., Thomson N., Mormina M., Rito T., Röhl A., Salas A., Oppenheimer S., Macaulay V., Richards M.B. Correcting for purifying selection: an improved human mitochondrial molecular clock. Am. J. Hum. Genet. 2009;84:740–759. doi: 10.1016/j.ajhg.2009.05.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Bandelt H.J., Forster P., Sykes B.C., Richards M.B. Mitochondrial portraits of human populations using median networks. Genetics. 1995;141:743–753. doi: 10.1093/genetics/141.2.743. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Bandelt H.J., Forster P., Röhl A. Median-joining networks for inferring intraspecific phylogenies. Mol. Biol. Evol. 1999;16:37–48. doi: 10.1093/oxfordjournals.molbev.a026036. [DOI] [PubMed] [Google Scholar]
- 91.Drummond A.J., Rambaut A. BEAST: Bayesian evolutionary analysis by sampling trees. BMC Evol. Biol. 2007;7:214. doi: 10.1186/1471-2148-7-214. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Tamura K., Peterson D., Peterson N., Stecher G., Nei M., Kumar S. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol. Biol. Evol. 2011;28:2731–2739. doi: 10.1093/molbev/msr121. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Document S1. Figures S1–S3
Table S1. Summary of Sequencing Conditions per Sample and Assigned Haplogroup per Sample
Table S2. Haplogroup Frequency and Putative Origin per Population
Document S2. Article plus Supplemental Data