Refine
Year of publication
Document Type
Language
- English (117)
Is part of the Bibliography
- yes (117)
Keywords
- ancient DNA (31)
- palaeogenomics (7)
- mitochondrial genome (6)
- phylogeny (6)
- admixture (5)
- museum specimens (5)
- population genomics (5)
- Ancient DNA (4)
- Genomics (4)
- Mitochondria (4)
Inactivation of thermogenic UCP1 as a historical contingency in multiple placental mammal clades
(2017)
Historically, the giant panda was widely distributed from northern China to southwestern Asia [1]. As a result of range contraction and fragmentation, extant individuals are currently restricted to fragmented mountain ranges on the eastern margin of the Qinghai-Tibet plateau, where they are distributed among three major population clusters [2]. However, little is known about the genetic consequences of this dramatic range contraction. For example, were regions where giant pandas previously existed occupied by ancestors of present-day populations, or were these regions occupied by genetically distinct populations that are now extinct? If so, is there any contribution of these extinct populations to the genomes of giant pandas living today? To investigate these questions, we sequenced the nuclear genome of an similar to 5,000-year-old giant panda from Jiangdongshan, Teng-chong County in Yunnan Province, China. We find that this individual represents a genetically distinct population that diverged prior to the diversification of modern giant panda populations. We find evidence of differential admixture with this ancient population among modern individuals originating from different populations as well as within the same population. We also find evidence for directional gene flow, which transferred alleles from the ancient population into the modern giant panda lineages. A variable proportion of the genomes of extant individuals is therefore likely derived from the ancient population represented by our sequenced individual. Although extant giant panda populations retain reasonable genetic diversity, our results suggest that this represents only part of the genetic diversity this species harbored prior to its recent range contractions.
The invention and development of next or second generation sequencing methods has resulted in a dramatic transformation of ancient DNA research and allowed shotgun sequencing of entire genomes from fossil specimens. However, although there are exceptions, most fossil specimens contain only low (similar to 1% or less) percentages of endogenous DNA. The only skeletal element for which a systematically higher endogenous DNA content compared to other skeletal elements has been shown is the petrous part of the temporal bone. In this study we investigate whether (a) different parts of the petrous bone of archaeological human specimens give different percentages of endogenous DNA yields, (b) there are significant differences in average DNA read lengths, damage patterns and total DNA concentration, and (c) it is possible to obtain endogenous ancient DNA from petrous bones from hot environments. We carried out intra-petrous comparisons for ten petrous bones from specimens from Holocene archaeological contexts across Eurasia dated between 10,0001,800 calibrated years before present (cal. BP). We obtained shotgun DNA sequences from three distinct areas within the petrous: a spongy part of trabecular bone (part A), the dense part of cortical bone encircling the osseous inner ear, or otic capsule (part B), and the dense part within the otic capsule (part C). Our results confirm that dense bone parts of the petrous bone can provide high endogenous aDNA yields and indicate that endogenous DNA fractions for part C can exceed those obtained for part B by up to 65-fold and those from part A by up to 177-fold, while total endogenous DNA concentrations are up to 126-fold and 109-fold higher for these comparisons. Our results also show that while endogenous yields from part C were lower than 1% for samples from hot (both arid and humid) parts, the DNA damage patterns indicate that at least some of the reads originate from ancient DNA molecules, potentially enabling ancient DNA analyses of samples from hot regions that are otherwise not amenable to ancient DNA analyses.
The invention and development of next or second generation sequencing methods has resulted in a dramatic transformation of ancient DNA research and allowed shotgun sequencing of entire genomes from fossil specimens. However, although there are exceptions, most fossil specimens contain only low (similar to 1% or less) percentages of endogenous DNA. The only skeletal element for which a systematically higher endogenous DNA content compared to other skeletal elements has been shown is the petrous part of the temporal bone. In this study we investigate whether (a) different parts of the petrous bone of archaeological human specimens give different percentages of endogenous DNA yields, (b) there are significant differences in average DNA read lengths, damage patterns and total DNA concentration, and (c) it is possible to obtain endogenous ancient DNA from petrous bones from hot environments. We carried out intra-petrous comparisons for ten petrous bones from specimens from Holocene archaeological contexts across Eurasia dated between 10,0001,800 calibrated years before present (cal. BP). We obtained shotgun DNA sequences from three distinct areas within the petrous: a spongy part of trabecular bone (part A), the dense part of cortical bone encircling the osseous inner ear, or otic capsule (part B), and the dense part within the otic capsule (part C). Our results confirm that dense bone parts of the petrous bone can provide high endogenous aDNA yields and indicate that endogenous DNA fractions for part C can exceed those obtained for part B by up to 65-fold and those from part A by up to 177-fold, while total endogenous DNA concentrations are up to 126-fold and 109-fold higher for these comparisons. Our results also show that while endogenous yields from part C were lower than 1% for samples from hot (both arid and humid) parts, the DNA damage patterns indicate that at least some of the reads originate from ancient DNA molecules, potentially enabling ancient DNA analyses of samples from hot regions that are otherwise not amenable to ancient DNA analyses.
Leopard complex spotting is inherited by the incompletely dominant locus, LP, which also causes congenital stationary night blindness in homozygous horses. We investigated an associated single nucleotide polymorphism in the TRPM1 gene in 96 archaeological bones from 31 localities from Late Pleistocene (approx. 17 000 YBP) to medieval times. The first genetic evidence of LP spotting in Europe dates back to the Pleistocene. We tested for temporal changes in the LP associated allele frequency and estimated coefficients of selection by means of approximate Bayesian computation analyses. Our results show that at least some of the observed frequency changes are congruent with shifts in artificial selection pressure for the leopard complex spotting phenotype. In early domestic horses from Kirklareli-Kanligecit (Turkey) dating to 2700-2200 BC, a remarkably high number of leopard spotted horses (six of 10 individuals) was detected including one adult homozygote. However, LP seems to have largely disappeared during the late Bronze Age, suggesting selection against this phenotype in early domestic horses. During the Iron Age, LP reappeared, probably by reintroduction into the domestic gene pool from wild animals. This picture of alternating selective regimes might explain how genetic diversity was maintained in domestic animals despite selection for specific traits at different times.
For a long time, the analysis of ancient human DNA represented one of the most controversial disciplines in an already controversial field of research. Scepticism in this field was only matched by the long-lasting controversy over the authenticity of ancient pathogen DNA. This ambiguous view on ancient human DNA had a dichotomous root. On the one hand, the interest in ancient human DNA is great because such studies touch on the history and evolution of our own species. On the other hand, because these studies are dealing with samples from our own species, results are easily compromised by contamination of the experiments with modern human DNA, which is ubiquitous in the environment. Consequently, some of the most disputed studies published - apart maybe from early reports on million year old dinosaur or amber DNA - reported DNA analyses from human subfossil remains. However, the development of so-called next-or second-generation sequencing (SGS) in 2005 and the technological advances associated with it have generated new confidence in the genetic study of ancient human remains. The ability to sequence shorter DNA fragments than with PCR amplification coupled to traditional Sanger sequencing, along with very high sequencing throughput have both reduced the risk of sequencing modern contamination and provided tools to evaluate the authenticity of DNA sequence data. The field is now rapidly developing, providing unprecedented insights into the evolution of our own species and past human population dynamics as well as the evolution and history of human pathogens and epidemics. Here, we review how recent technological improvements have rapidly transformed ancient human DNA research from a highly controversial subject to a central component of modern anthropological research. We also discuss potential future directions of ancient human DNA research.
Hyenas (family Hyaenidae), as the sister group to cats (family Felidae), represent a deeply diverging branch within the cat-like carnivores (Feliformia). With an estimated population size of <10,000 individuals worldwide, the brown hyena (Parahyaena brunnea) represents the rarest of the four extant hyena species and has been listed as Near Threatened by the IUCN. Here, we report a high-coverage genome from a captive bred brown hyena and both mitochondrial and low-coverage nuclear genomes of 14 wild-caught brown hyena individuals from across southern Africa. We find that brown hyena harbor extremely low genetic diversity on both the mitochondrial and nuclear level, most likely resulting from a continuous and ongoing decline in effective population size that started similar to 1 Ma and dramatically accelerated towards the end of the Pleistocene. Despite the strikingly low genetic diversity, we find no evidence of inbreeding within the captive bred individual and reveal phylogeographic structure, suggesting the existence of several potential subpopulations within the species.
Hyenas (family Hyaenidae), as the sister group to cats (family Felidae), represent a deeply diverging branch within the cat-like carnivores (Feliformia). With an estimated population size of <10,000 individuals worldwide, the brown hyena (Parahyaena brunnea) represents the rarest of the four extant hyena species and has been listed as Near Threatened by the IUCN. Here, we report a high-coverage genome from a captive bred brown hyena and both mitochondrial and low-coverage nuclear genomes of 14 wild-caught brown hyena individuals from across southern Africa. We find that brown hyena harbor extremely low genetic diversity on both the mitochondrial and nuclear level, most likely resulting from a continuous and ongoing decline in effective population size that started similar to 1 Ma and dramatically accelerated towards the end of the Pleistocene. Despite the strikingly low genetic diversity, we find no evidence of inbreeding within the captive bred individual and reveal phylogeographic structure, suggesting the existence of several potential subpopulations within the species.
Cave hyenas (genus Crocuta) are extinct bone-cracking carnivores from the family Hyaenidae and are generally split into two taxa that correspond to a European/Eurasian and an (East) Asian lineage. They are close relatives of the extant African spotted hyenas, the only extant member of the genus Crocuta. Cave hyenas inhabited a wide range across Eurasia during the Pleistocene, but became extinct at the end of the Late Pleistocene. Using genetic and genomic datasets, previous studies have proposed different scenarios about the evolutionary history of Crocuta. However, causes of the extinction of cave hyenas are widely speculative and samples from China are severely understudied. In this study, we assembled near-complete mitochondrial genomes from two cave hyenas from northeastern China dating to 20 240 and 20 253 calBP, representing the youngest directly dated fossils of Crocuta in Asia. Phylogenetic analyses suggest a monophyletic clade of these two samples within a deeply diverging mitochondrial haplogroup of Crocuta. Bayesian analyses suggest that the split of this Asian cave hyena mitochondrial lineage from their European and African relatives occurred approximately 1.85 Ma (95% CI 1.62-2.09 Ma), which is broadly concordant with the earliest Eurasian Crocuta fossil dating to approximately 2 Ma. Comparisons of mean genetic distance indicate that cave hyenas harboured higher genetic diversity than extant spotted hyenas, brown hyenas and aardwolves, but this is probably at least partially due to the fact that their mitochondrial lineages do not represent a monophyletic group, although this is also true for extant spotted hyenas. Moreover, the joint female effective population size of Crocuta (both cave hyenas and extant spotted hyenas) has sustained two declines during the Late Pleistocene. Combining this mitochondrial phylogeny, previous nuclear findings and fossil records, we discuss the possible relationship of fossil Crocuta in China and the extinction of cave hyenas.
The complete mitochondrial genome of the common vole, Microtus arvalis (Rodentia: Arvicolinae)
(2018)
The common vole, Microtus arvalis belongs to the genus Microtus in the subfamily Arvicolinae. In this study, the complete mitochondrial genome of M. arvalis was recovered using shotgun sequencing and an iterative mapping approach using three related species. Phylogenetic analyses using the sequence of 21 arvicoline species place the common vole as a sister species to the East European vole (Microtus levis), but as opposed to previous results we find no support for the recognition of the genus Neodon within the subfamily Arvicolinae, as this is, as well as the genus Lasiopodomys, found within the Microtus genus.
The complete mitochondrial genome of the common vole, Microtus arvalis (Rodentia: Arvicolinae)
(2018)
The common vole, Microtus arvalis belongs to the genus Microtus in the subfamily Arvicolinae. In this study, the complete mitochondrial genome of M. arvalis was recovered using shotgun sequencing and an iterative mapping approach using three related species. Phylogenetic analyses using the sequence of 21 arvicoline species place the common vole as a sister species to the East European vole (Microtus levis), but as opposed to previous results we find no support for the recognition of the genus Neodon within the subfamily Arvicolinae, as this is, as well as the genus Lasiopodomys, found within the Microtus genus.
Domestic Bactrian camel (Camelus bactrianus) used to be one of the most important livestock species in Chinese history, as well as the major transport carrier on the ancient Silk Road. However, archeological studies on Chinese C. bactrianus are still limited, and molecular biology research on this species is mainly focused on modern specimens. In this study, we retrieved the complete mitochondrial genome from a C. bactrianus specimen, which was excavated from northwestern China and dated at 1290-1180 cal. Phylogenetic analyses using 18 mitochondrial genomes indicated that the C. bactrianus clade was divided into two maternal lineages. The majority of samples originating from Iran to Japan and Mongolia belong to subclade A1, while our sample together with two Mongolian individuals formed the much smaller subclade A2. Furthermore, the divergence time of these two maternal lineages was estimated as 165 Kya (95% credibility interval 117-222 Kya), this might indicate that several different evolutionary lineages were incorporated into the domestic gene pool during the initial domestication process. Bayesian skyline plot (BSP) analysis a slow increase in female effective population size of C. bactrianus from 5000 years ago, which to the beginning of domestication of C. bactrianus. The present study also revealed that there were extensive exchanges of genetic information among C. bactrianus populations in regions along the Silk Road.
It is widely accepted that modern pigs were domesticated independently at least twice, and Chinese native pigs are deemed as direct descendants of the first domesticated pigs in the corresponding domestication centers. By analyzing mitochondrial DNA sequences of an extensive sample set spanning 10,000 years, we find that the earliest pigs from the middle Yellow River region already carried the maternal lineages that are dominant in both younger archaeological populations and modern Chinese pigs. Our data set also supports early Neolithic pig utilization and a long-term in situ origin for northeastern Chinese pigs during 8,000-3,500 BP, suggesting a possibly independent domestication in northeast China. Additionally, we observe a genetic replacement in ancient northeast Chinese pigs since 3,500 BP. The results not only provide increasing evidence for pig origin in the middle Yellow River region but also depict an outline for the process of early pig domestication in northeast China.
Present-day domestic horses are immensely diverse in their maternally inherited mitochondrial DNA, yet they show very little variation on their paternally inherited Y chromosome. Although it has recently been shown that Y chromosomal diversity in domestic horses was higher at least until the Iron Age, when and why this diversity disappeared remain controversial questions. We genotyped 16 recently discovered Y chromosomal single-nucleotide polymorphisms in 96 ancient Eurasian stallions spanning the early domestication stages (Copper and Bronze Age) to the Middle Ages. Using this Y chromosomal time series, which covers nearly the entire history of horse domestication, we reveal how Y chromosomal diversity changed over time. Our results also show that the lack of multiple stallion lineages in the extant domestic population is caused by neither a founder effect nor random demographic effects but instead is the result of artificial selection-initially during the Iron Age by nomadic people from the Eurasian steppes and later during the Roman period. Moreover, the modern domestic haplotype probably derived from another, already advantageous, haplotype, most likely after the beginning of the domestication. In line with recent findings indicating that the Przewalski and domestic horse lineages remained connected by gene flow after they diverged about 45,000 years ago, we present evidence for Y chromosomal introgression of Przewalski horses into the gene pool of European domestic horses at least until medieval times.
Ancient genomes have revolutionized our understanding of Holocene prehistory and, particularly, the Neolithic transition in western Eurasia. In contrast, East Asia has so far received little attention, despite representing a core region at which the Neolithic transition took place independently similar to 3 millennia after its onset in the Near East. We report genome-wide data from two hunter-gatherers from Devil's Gate, an early Neolithic cave site (dated to similar to 7.7 thousand years ago) located in East Asia, on the border between Russia and Korea. Both of these individuals are genetically most similar to geographically close modern populations from the Amur Basin, all speaking Tungusic languages, and, in particular, to the Ulchi. The similarity to nearby modern populations and the low levels of additional genetic material in the Ulchi imply a high level of genetic continuity in this region during the Holocene, a pattern that markedly contrasts with that reported for Europe.
The performance of hybridization capture combined with next-generation sequencing (NGS) has seen limited investigation with samples from hot and arid regions until now. We applied hybridization capture and shotgun sequencing to recover DNA sequences from bone specimens of ancient-domestic dromedary (Camelus dromedarius) and its extinct ancestor, the wild dromedary from Jordan, Syria, Turkey and the Arabian Peninsula, respectively. Our results show that hybridization capture increased the percentage of mitochondrial DNA (mtDNA) recovery by an average 187-fold and in some cases yielded virtually complete mitochondrial (mt) genomes at multifold coverage in a single capture experiment. Furthermore, we tested the effect of hybridization temperature and time by using a touchdown approach on a limited number of samples. We observed no significant difference in the number of unique dromedary mtDNA reads retrieved with the standard capture compared to the touchdown method. In total, we obtained 14 partial mitochondrial genomes from ancient-domestic dromedaries with 17-95% length coverage and 1.27-47.1-fold read depths for the covered regions. Using whole-genome shotgun sequencing, we successfully recovered endogenous dromedary nuclear DNA (nuDNA) from domestic and wild dromedary specimens with 1-1.06-fold read depths for covered regions. Our results highlight that despite recent methodological advances, obtaining ancient DNA (aDNA) from specimens recovered from hot, arid environments is still problematic. Hybridization protocols require specific optimization, and samples at the limit of DNA preservation need multiple replications of DNA extraction and hybridization capture as has been shown previously for Middle Pleistocene specimens.
Eastern Africa has been a prime target for scientific drilling because it is rich in key paleoanthropological sites as well as in paleolakes, containing valuable paleoclimatic information on evolutionary time scales. The Hominin Sites and Paleolakes Drilling Project (HSPDP) explores these paleolakes with the aim of reconstructing environmental conditions around critical episodes of hominin evolution. Identification of biological taxa based on their sedimentary ancient DNA (sedaDNA) traces can contribute to understand past ecological and climatological conditions of the living environment of our ancestors. However, sedaDNA recovery from tropical environments is challenging because high temperatures, UV irradiation, and desiccation result in highly degraded DNA. Consequently, most of the DNA fragments in tropical sediments are too short for PCR amplification. We analyzed sedaDNA in the upper 70 m of the composite sediment core of the HSPDP drill site at Chew Bahir for eukaryotic remnants. We first tested shotgun high throughput sequencing which leads to metagenomes dominated by bacterial DNA of the deep biosphere, while only a small fraction was derived from eukaryotic, and thus probably ancient, DNA. Subsequently, we performed cross-species hybridization capture of sedaDNA to enrich ancient DNA (aDNA) from eukaryotic remnants for paleoenvironmental analysis, using established barcoding genes (cox1 and rbcL for animals and plants, respectively) from 199 species that may have had relatives in the past biosphere at Chew Bahir. Metagenomes yielded after hybridization capture are richer in reads with similarity to cox1 and rbcL in comparison to metagenomes without prior hybridization capture. Taxonomic assignments of the reads from these hybridization capture metagenomes also yielded larger fractions of the eukaryotic domain. For reads assigned to cox1, inferred wet periods were associated with high inferred relative abundances of putative limnic organisms (gastropods, green algae), while inferred dry periods showed increased relative abundances for insects. These findings indicate that cross-species hybridization capture can be an effective approach to enhance the information content of sedaDNA in order to explore biosphere changes associated with past environmental conditions, enabling such analyses even under tropical conditions.
The performance of hybridization capture combined with next-generation sequencing (NGS) has seen limited investigation with samples from hot and arid regions until now. We applied hybridization capture and shotgun sequencing to recover DNA sequences from bone specimens of ancient-domestic dromedary (Camelus dromedarius) and its extinct ancestor, the wild dromedary from Jordan, Syria, Turkey and the Arabian Peninsula, respectively. Our results show that hybridization capture increased the percentage of mitochondrial DNA (mtDNA) recovery by an average 187-fold and in some cases yielded virtually complete mitochondrial (mt) genomes at multifold coverage in a single capture experiment. Furthermore, we tested the effect of hybridization temperature and time by using a touchdown approach on a limited number of samples. We observed no significant difference in the number of unique dromedary mtDNA reads retrieved with the standard capture compared to the touchdown method. In total, we obtained 14 partial mitochondrial genomes from ancient-domestic dromedaries with 17-95% length coverage and 1.27-47.1-fold read depths for the covered regions. Using whole-genome shotgun sequencing, we successfully recovered endogenous dromedary nuclear DNA (nuDNA) from domestic and wild dromedary specimens with 1-1.06-fold read depths for covered regions. Our results highlight that despite recent methodological advances, obtaining ancient DNA (aDNA) from specimens recovered from hot, arid environments is still problematic. Hybridization protocols require specific optimization, and samples at the limit of DNA preservation need multiple replications of DNA extraction and hybridization capture as has been shown previously for Middle Pleistocene specimens.
Ancient genomes have revolutionized our understanding of Holocene prehistory and, particularly, the Neolithic transition in western Eurasia. In contrast, East Asia has so far received little attention, despite representing a core region at which the Neolithic transition took place independently ~3 millennia after its onset in the Near East. We report genome-wide data from two hunter-gatherers from Devil’s Gate, an early Neolithic cave site (dated to ~7.7 thousand years ago) located in East Asia, on the border between Russia and Korea. Both of these individuals are genetically most similar to geographically close modern populations from the Amur Basin, all speaking Tungusic languages, and, in particular, to the Ulchi. The similarity to nearby modern populations and the low levels of additional genetic material in the Ulchi imply a high level of genetic continuity in this region during the Holocene, a pattern that markedly contrasts with that reported for Europe.
Horse domestication revolutionized warfare and accelerated travel, trade, and the geographic expansion of languages. Here, we present the largest DNA time series for a non-human organism to date, including genome-scale data from 149 ancient animals and 129 ancient genomes (>= 1-fold coverage), 87 of which are new. This extensive dataset allows us to assess the modem legacy of past equestrian civilisations. We find that two extinct horse lineages existed during early domestication, one at the far western (Iberia) and the other at the far eastern range (Siberia) of Eurasia. None of these contributed significantly to modern diversity. We show that the influence of Persian-related horse lineages increased following the Islamic conquests in Europe and Asia. Multiple alleles associated with elite-racing, including at the MSTN "speed gene," only rose in popularity within the last millennium. Finally, the development of modem breeding impacted genetic diversity more dramatically than the previous millennia of human management.
Comparing mitogenomic timetrees for two African savannah primate genera (Chlorocebus and Papio)
(2017)
Complete mitochondrial (mtDNA) genomes have proved to be useful in reconstructing primate phylogenies with higher resolution and confidence compared to reconstructions based on partial mtDNA sequences. Here, we analyse complete mtDNA genomes of African green monkeys (genus Chlorocebus), a widely distributed primate genus in Africa representing an interesting phylogeographical model for the evolution of savannah species. Previous studies on partial mtDNA sequences revealed nine major clades, suggesting several cases of para- and polyphyly among Chlorocebus species. However, in these studies, phylogenetic relationships among several clades were not resolved, and divergence times were not estimated. We analysed complete mtDNA genomes for ten Chlorocebus samples representing major mtDNA clades to find stronger statistical support in the phylogenetic reconstruction than in the previous studies and to estimate divergence times. Our results confirmed para- and polyphyletic relationships of most Chlorocebus species, while the support for the phylogenetic relationships between the mtDNA clades increased compared to the previous studies. Our results indicate an initial west-east division in the northern part of the Chlorocebus range with subsequent divergence into north-eastern and southern clades. This phylogeographic scenario contrasts with that for another widespread African savannah primate genus, the baboons (Papio), for which a dispersal from southern Africa into East and West Africa was suggested.
Background
The forelimb-specific gene tbx5 is highly conserved and essential for the development of forelimbs in zebrafish, mice, and humans. Amongst birds, a single order, Dinornithiformes, comprising the extinct wingless moa of New Zealand, are unique in having no skeletal evidence of forelimb-like structures.
Results
To determine the sequence of tbx5 in moa, we used a range of PCR-based techniques on ancient DNA to retrieve all nine tbx5 exons and splice sites from the giant moa, Dinornis. Moa Tbx5 is identical to chicken Tbx5 in being able to activate the downstream promotors of fgf10 and ANF. In addition we show that missexpression of moa tbx5 in the hindlimb of chicken embryos results in the formation of forelimb features, suggesting that Tbx5 was fully functional in wingless moa. An alternatively spliced exon 1 for tbx5 that is expressed specifically in the forelimb region was shown to be almost identical between moa and ostrich, suggesting that, as well as being fully functional, tbx5 is likely to have been expressed normally in moa since divergence from their flighted ancestors, approximately 60 mya.
Conclusions
The results suggests that, as in mice, moa tbx5 is necessary for the induction of forelimbs, but is not sufficient for their outgrowth. Moa Tbx5 may have played an important role in the development of moa’s remnant forelimb girdle, and may be required for the formation of this structure. Our results further show that genetic changes affecting genes other than tbx5 must be responsible for the complete loss of forelimbs in moa.
Climate impacts on transocean dispersal and habitat in gray whales from the Pleistocene to 2100
(2015)
Arctic animals face dramatic habitat alteration due to ongoing climate change. Understanding how such species have responded to past glacial cycles can help us forecast their response to today's changing climate. Gray whales are among those marine species likely to be strongly affected by Arctic climate change, but a thorough analysis of past climate impacts on this species has been complicated by lack of information about an extinct population in the Atlantic. While little is known about the history of Atlantic gray whales or their relationship to the extant Pacific population, the extirpation of the Atlantic population during historical times has been attributed to whaling. We used a combination of ancient and modern DNA, radiocarbon dating and predictive habitat modelling to better understand the distribution of gray whales during the Pleistocene and Holocene. Our results reveal that dispersal between the Pacific and Atlantic was climate dependent and occurred both during the Pleistocene prior to the last glacial period and the early Holocene immediately following the opening of the Bering Strait. Genetic diversity in the Atlantic declined over an extended interval that predates the period of intensive commercial whaling, indicating this decline may have been precipitated by Holocene climate or other ecological causes. These first genetic data for Atlantic gray whales, particularly when combined with predictive habitat models for the year 2100, suggest that two recent sightings of gray whales in the Atlantic may represent the beginning of the expansion of this species' habitat beyond its currently realized range.
The agricultural transition profoundly changed human societies. We sequenced and analysed the first genome (1.39x) of an early Neolithic woman from Ganj Dareh, in the Zagros Mountains of Iran, a site with early evidence for an economy based on goat herding, ca. 10,000 BP. We show that Western Iran was inhabited by a population genetically most similar to hunter-gatherers from the Caucasus, but distinct from the Neolithic Anatolian people who later brought food production into Europe. The inhabitants of Ganj Dareh made little direct genetic contribution to modern European populations, suggesting those of the Central Zagros were somewhat isolated from other populations of the Fertile Crescent. Runs of homozygosity are of a similar length to those from Neolithic farmers, and shorter than those of Caucasus and Western Hunter-Gatherers, suggesting that the inhabitants of Ganj Dareh did not undergo the large population bottleneck suffered by their northern neighbours. While some degree of cultural diffusion between Anatolia, Western Iran and other neighbouring regions is possible, the genetic dissimilarity between early Anatolian farmers and the inhabitants of Ganj Dareh supports a model in which Neolithic societies in these areas were distinct.
The transition from hunting and gathering to farming involved profound cultural and technological changes. In Western and Central Europe, these changes occurred rapidly and synchronously after the arrival of early farmers of Anatolian origin [1-3], who largely replaced the local Mesolithic hunter-gatherers [1, 4-6]. Further east, in the Baltic region, the transition was gradual, with little or no genetic input from incoming farmers [7]. Here we use ancient DNA to investigate the relationship between hunter-gatherers and farmers in the Lower Danube basin, a geographically intermediate area that is characterized by a rapid Neolithic transition but also by the presence of archaeological evidence that points to cultural exchange, and thus possible admixture, between hunter-gatherers and farmers. We recovered four human paleogenomes (1.13 to 4.13 coverage) from Romania spanning a time transect between 8.8 thousand years ago (kya) and 5.4 kya and supplemented them with two Mesolithic genomes (1.73- and 5.33) from Spain to provide further context on the genetic background of Mesolithic Europe. Our results show major Western hunter-gatherer (WHG) ancestry in a Romanian Eneolithic sample with a minor, but sizeable, contribution from Anatolian farmers, suggesting multiple admixture events between hunter-gatherers and farmers. Dietary stableisotope analysis of this sample suggests a mixed terrestrial/ aquatic diet. Our results provide support for complex interactions among hunter-gatherers and farmers in the Danube basin, demonstrating that in some regions, demic and cultural diffusion were not mutually exclusive, but merely the ends of a continuum for the process of Neolithization.
Palaeogenomes of Eurasian straight-tusked elephants challenge the current view of elephant evolution
(2017)
The straight-tusked elephants Palaeoloxodon spp. were widespread across Eurasia during the Pleistocene. Phylogenetic reconstructions using morphological traits have grouped them with Asian elephants (Elephas maximus), and many paleontologists place Palaeoloxodon within Elephas. Here, we report the recovery of full mitochondrial genomes from four and partial nuclear genomes from two P. antiquus fossils. These fossils were collected at two sites in Germany, Neumark-Nord and Weimar-Ehringsdorf, and likely date to interglacial periods similar to 120 and similar to 244 thousand years ago, respectively. Unexpectedly, nuclear and mitochondrial DNA analyses suggest that P. antiquus was a close relative of extant African forest elephants (Loxodonta cyclotis). Species previously referred to Palaeoloxodon are thus most parsimoniously explained as having diverged from the lineage of Loxodonta, indicating that Loxodonta has not been constrained to Africa. Our results demonstrate that the current picture of elephant evolution is in need of substantial revision.
Eastern Africa has been a prime target for scientific drilling because it is rich in key paleoanthropological sites as well as in paleolakes, containing valuable paleoclimatic information on evolutionary time scales. The Hominin Sites and Paleolakes Drilling Project (HSPDP) explores these paleolakes with the aim of reconstructing environmental conditions around critical episodes of hominin evolution. Identification of biological taxa based on their sedimentary ancient DNA (sedaDNA) traces can contribute to understand past ecological and climatological conditions of the living environment of our ancestors. However, sedaDNA recovery from tropical environments is challenging because high temperatures, UV irradiation, and desiccation result in highly degraded DNA. Consequently, most of the DNA fragments in tropical sediments are too short for PCR amplification. We analyzed sedaDNA in the upper 70 m of the composite sediment core of the HSPDP drill site at Chew Bahir for eukaryotic remnants. We first tested shotgun high throughput sequencing which leads to metagenomes dominated by bacterial DNA of the deep biosphere, while only a small fraction was derived from eukaryotic, and thus probably ancient, DNA. Subsequently, we performed cross-species hybridization capture of sedaDNA to enrich ancient DNA (aDNA) from eukaryotic remnants for paleoenvironmental analysis, using established barcoding genes (cox1 and rbcL for animals and plants, respectively) from 199 species that may have had relatives in the past biosphere at Chew Bahir. Metagenomes yielded after hybridization capture are richer in reads with similarity to cox1 and rbcL in comparison to metagenomes without prior hybridization capture. Taxonomic assignments of the reads from these hybridization capture metagenomes also yielded larger fractions of the eukaryotic domain. For reads assigned to cox1, inferred wet periods were associated with high inferred relative abundances of putative limnic organisms (gastropods, green algae), while inferred dry periods showed increased relative abundances for insects. These findings indicate that cross-species hybridization capture can be an effective approach to enhance the information content of sedaDNA in order to explore biosphere changes associated with past environmental conditions, enabling such analyses even under tropical conditions.
Objective
Plant carnivory is distributed across the tree of life and has evolved at least six times independently, but sequenced and annotated nuclear genomes of carnivorous plants are currently lacking. We have sequenced and structurally annotated the nuclear genome of the carnivorous Roridula gorgonias and that of a non-carnivorous relative, Madeira’s lily-of-the-valley-tree, Clethra arborea, both within the Ericales. This data adds an important resource to study the evolutionary genetics of plant carnivory across angiosperm lineages and also for functional and systematic aspects of plants within the Ericales.
Results
Our assemblies have total lengths of 284 Mbp (R. gorgonias) and 511 Mbp (C. arborea) and show high BUSCO scores of 84.2% and 89.5%, respectively. We used their predicted genes together with publicly available data from other Ericales’ genomes and transcriptomes to assemble a phylogenomic data set for the inference of a species tree. However, groups of orthologs showed a marked absence of species represented by a transcriptome. We discuss possible reasons and caution against combining predicted genes from genome- and transriptome-based assemblies.
Simultaneous Barcode Sequencing of Diverse Museum Collection Specimens Using a Mixed RNA Bait Set
(2022)
A growing number of publications presenting results from sequencing natural history collection specimens reflect the importance of DNA sequence information from such samples. Ancient DNA extraction and library preparation methods in combination with target gene capture are a way of unlocking archival DNA, including from formalin-fixed wet-collection material. Here we report on an experiment, in which we used an RNA bait set containing baits from a wide taxonomic range of species for DNA hybridisation capture of nuclear and mitochondrial targets for analysing natural history collection specimens. The bait set used consists of 2,492 mitochondrial and 530 nuclear RNA baits and comprises specific barcode loci of diverse animal groups including both invertebrates and vertebrates. The baits allowed to capture DNA sequence information of target barcode loci from 84% of the 37 samples tested, with nuclear markers being captured more frequently and consensus sequences of these being more complete compared to mitochondrial markers. Samples from dry material had a higher rate of success than wet-collection specimens, although target sequence information could be captured from 50% of formalin-fixed samples. Our study illustrates how efforts to obtain barcode sequence information from natural history collection specimens may be combined and are a way of implementing barcoding inventories of scientific collection material.
Simultaneous Barcode Sequencing of Diverse Museum Collection Specimens Using a Mixed RNA Bait Set
(2022)
A growing number of publications presenting results from sequencing natural history collection specimens reflect the importance of DNA sequence information from such samples. Ancient DNA extraction and library preparation methods in combination with target gene capture are a way of unlocking archival DNA, including from formalin-fixed wet-collection material. Here we report on an experiment, in which we used an RNA bait set containing baits from a wide taxonomic range of species for DNA hybridisation capture of nuclear and mitochondrial targets for analysing natural history collection specimens. The bait set used consists of 2,492 mitochondrial and 530 nuclear RNA baits and comprises specific barcode loci of diverse animal groups including both invertebrates and vertebrates. The baits allowed to capture DNA sequence information of target barcode loci from 84% of the 37 samples tested, with nuclear markers being captured more frequently and consensus sequences of these being more complete compared to mitochondrial markers. Samples from dry material had a higher rate of success than wet-collection specimens, although target sequence information could be captured from 50% of formalin-fixed samples. Our study illustrates how efforts to obtain barcode sequence information from natural history collection specimens may be combined and are a way of implementing barcoding inventories of scientific collection material.
The radula is the central foraging organ and apomorphy of the Mollusca. However, in contrast to other innovations, including the mollusk shell, genetic underpinnings of radula formation remain virtually unknown. Here, we present the first radula formative tissue transcriptome using the viviparous freshwater snail Tylomelania sarasinorum and compare it to foot tissue and the shell-building mantle of the same species. We combine differential expression, functional enrichment, and phylostratigraphic analyses to identify both specific and shared genetic underpinnings of the three tissues as well as their dominant functions and evolutionary origins. Gene expression of radula formative tissue is very distinct, but nevertheless more similar to mantle than to foot. Generally, the genetic bases of both radula and shell formation were shaped by novel orchestration of preexisting genes and continuous evolution of novel genes. A significantly increased proportion of radula-specific genes originated since the origin of stem-mollusks, indicating that novel genes were especially important for radula evolution. Genes with radula-specific expression in our study are frequently also expressed during the formation of other lophotrochozoan hard structures, like chaetae (hes1, arx), spicules (gbx), and shells of mollusks (gbx, heph) and brachiopods (heph), suggesting gene co-option for hard structure formation. Finally, a Lophotrochozoa-specific chitin synthase with a myosin motor domain (CS-MD), which is expressed during mollusk and brachiopod shell formation, had radula-specific expression in our study. CS-MD potentially facilitated the construction of complex chitinous structures and points at the potential of molecular novelties to promote the evolution of different morphological innovations.
Obtaining information about functional details of proteins of extinct species is of critical importance for a better understanding of the real-life appearance, behavior and ecology of these lost entries in the book of life. In this chapter, we discuss the possibilities to retrieve the necessary DNA sequence information from paleogenomic data obtained from fossil specimens, which can then be used to express and subsequently analyze the protein of interest. We discuss the problems specific to ancient DNA, including mis-coding lesions, short read length and incomplete paleogenome assemblies. Finally, we discuss an alternative, but currently rarely used approach, direct PCR amplification, which is especially useful for comparatively short proteins.
Xenikoudakis et al. report a partial mitochondrial genome of the extinct giant beaver Castoroides and estimate the origin of aquatic behavior in beavers to approximately 20 million years. This time estimate coincides with the extinction of terrestrial beavers and raises the question whether the two events had a common cause.
Historical biogeography of the leopard (Panthera pardus) and its extinct Eurasian populations
(2019)
Background
Resolving the historical biogeography of the leopard (Panthera pardus) is a complex issue, because patterns inferred from fossils and from molecular data lack congruence. Fossil evidence supports an African origin, and suggests that leopards were already present in Eurasia during the Early Pleistocene. Analysis of DNA sequences however, suggests a more recent, Middle Pleistocene shared ancestry of Asian and African leopards. These contrasting patterns led researchers to propose a two-stage hypothesis of leopard dispersal out of Africa: an initial Early Pleistocene colonisation of Asia and a subsequent replacement by a second colonisation wave during the Middle Pleistocene. The status of Late Pleistocene European leopards within this scenario is unclear: were these populations remnants of the first dispersal, or do the last surviving European leopards share more recent ancestry with their African counterparts?
Results
In this study, we generate and analyse mitogenome sequences from historical samples that span the entire modern leopard distribution, as well as from Late Pleistocene remains. We find a deep bifurcation between African and Eurasian mitochondrial lineages (~ 710 Ka), with the European ancient samples as sister to all Asian lineages (~ 483 Ka). The modern and historical mainland Asian lineages share a relatively recent common ancestor (~ 122 Ka), and we find one Javan sample nested within these.
Conclusions
The phylogenetic placement of the ancient European leopard as sister group to Asian leopards suggests that these populations originate from the same out-of-Africa dispersal which founded the Asian lineages. The coalescence time found for the mitochondrial lineages aligns well with the earliest undisputed fossils in Eurasia, and thus encourages a re-evaluation of the identification of the much older putative leopard fossils from the region. The relatively recent ancestry of all mainland Asian leopard lineages suggests that these populations underwent a severe population bottleneck during the Pleistocene. Finally, although only based on a single sample, the unexpected phylogenetic placement of the Javan leopard could be interpreted as evidence for exchange of mitochondrial lineages between Java and mainland Asia, calling for further investigation into the evolutionary history of this subspecies.
The future of ancient DNA
(2015)
Technological innovations such as next generation sequencing and DNA hybridisation enrichment have resulted in multi-fold increases in both the quantity of ancient DNA sequence data and the time depth for DNA retrieval. To date, over 30 ancient genomes have been sequenced, moving from 0.7x coverage (mammoth) in 2008 to more than 50x coverage (Neanderthal) in 2014. Studies of rapid evolutionary changes, such as the evolution and spread of pathogens and the genetic responses of hosts, or the genetics of domestication and climatic adaptation, are developing swiftly and the importance of palaeogenomics for investigating evolutionary processes during the last million years is likely to increase considerably. However, these new datasets require new methods of data processing and analysis, as well as conceptual changes in interpreting the results. In this review we highlight important areas of future technical and conceptual progress and discuss research topics in the rapidly growing field of palaeogenomics.
Historical biogeography of the leopard (Panthera pardus) and its extinct Eurasian populations
(2018)
Background
Resolving the historical biogeography of the leopard (Panthera pardus) is a complex issue, because patterns inferred from fossils and from molecular data lack congruence. Fossil evidence supports an African origin, and suggests that leopards were already present in Eurasia during the Early Pleistocene. Analysis of DNA sequences however, suggests a more recent, Middle Pleistocene shared ancestry of Asian and African leopards. These contrasting patterns led researchers to propose a two-stage hypothesis of leopard dispersal out of Africa: an initial Early Pleistocene colonisation of Asia and a subsequent replacement by a second colonisation wave during the Middle Pleistocene. The status of Late Pleistocene European leopards within this scenario is unclear: were these populations remnants of the first dispersal, or do the last surviving European leopards share more recent ancestry with their African counterparts?
Results
In this study, we generate and analyse mitogenome sequences from historical samples that span the entire modern leopard distribution, as well as from Late Pleistocene remains. We find a deep bifurcation between African and Eurasian mitochondrial lineages (~ 710 Ka), with the European ancient samples as sister to all Asian lineages (~ 483 Ka). The modern and historical mainland Asian lineages share a relatively recent common ancestor (~ 122 Ka), and we find one Javan sample nested within these.
Conclusions
The phylogenetic placement of the ancient European leopard as sister group to Asian leopards suggests that these populations originate from the same out-of-Africa dispersal which founded the Asian lineages. The coalescence time found for the mitochondrial lineages aligns well with the earliest undisputed fossils in Eurasia, and thus encourages a re-evaluation of the identification of the much older putative leopard fossils from the region. The relatively recent ancestry of all mainland Asian leopard lineages suggests that these populations underwent a severe population bottleneck during the Pleistocene. Finally, although only based on a single sample, the unexpected phylogenetic placement of the Javan leopard could be interpreted as evidence for exchange of mitochondrial lineages between Java and mainland Asia, calling for further investigation into the evolutionary history of this subspecies.
Consensify
(2020)
A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes.
Ancient DNA studies have revolutionized the study of extinct species and populations, providing insights on phylogeny, phylogeography, admixture and demographic history. However, inferences on behaviour and sociality have been far less frequent. Here, we investigate the complete mitochondrial genomes of extinct Late Pleistocene cave bears and middle Holocene brown bears that each inhabited multiple geographically proximate caves in northern Spain. In cave bears, we find that, although most caves were occupied simultaneously, each cave almost exclusively contains a unique lineage of closely related haplotypes. This remarkable pattern suggests extreme fidelity to their birth site in cave bears, best described as homing behaviour, and that cave bears formed stable maternal social groups at least for hibernation. In contrast, brown bears do not show any strong association of mitochondrial lineage and cave, suggesting that these two closely related species differed in aspects of their behaviour and sociality. This difference is likely to have contributed to cave bear extinction, which occurred at a time in which competition for caves between bears and humans was likely intense and the ability to rapidly colonize new hibernation sites would have been crucial for the survival of a species so dependent on caves for hibernation as cave bears. Our study demonstrates the potential of ancient DNA to uncover patterns of behaviour and sociality in ancient species and populations, even those that went extinct many tens of thousands of years ago.
Ancient DNA of extinct species from the Pleistocene and Holocene has provided valuable evolutionary insights. However, these are largely restricted to mammals and high latitudes because DNA preservation in warm climates is typically poor. In the tropics and subtropics, non-avian reptiles constitute a significant part of the fauna and little is known about the genetics of the many extinct reptiles from tropical islands. We have reconstructed the near-complete mitochondrial genome of an extinct giant tortoise from the Bahamas (Chelonoidis alburyorum) using an approximately 1000-year-old humerus from a water-filled sinkhole (blue hole) on Great Abaco Island. Phylogenetic and molecular clock analyses place this extinct species as closely related to Galapagos (C. niger complex) and Chaco tortoises (C. chilensis), and provide evidence for repeated overseas dispersal in this tortoise group. The ancestors of extant Chelonoidis species arrived in South America from Africa only after the opening of the Atlantic Ocean and dispersed from there to the Caribbean and the Galapagos Islands. Our results also suggest that the anoxic, thermally buffered environment of blue holes may enhance DNA preservation, and thus are opening a window for better understanding evolution and population history of extinct tropical species, which would likely still exist without human impact.
Objective
Plant carnivory is distributed across the tree of life and has evolved at least six times independently, but sequenced and annotated nuclear genomes of carnivorous plants are currently lacking. We have sequenced and structurally annotated the nuclear genome of the carnivorous Roridula gorgonias and that of a non-carnivorous relative, Madeira’s lily-of-the-valley-tree, Clethra arborea, both within the Ericales. This data adds an important resource to study the evolutionary genetics of plant carnivory across angiosperm lineages and also for functional and systematic aspects of plants within the Ericales.
Results
Our assemblies have total lengths of 284 Mbp (R. gorgonias) and 511 Mbp (C. arborea) and show high BUSCO scores of 84.2% and 89.5%, respectively. We used their predicted genes together with publicly available data from other Ericales’ genomes and transcriptomes to assemble a phylogenomic data set for the inference of a species tree. However, groups of orthologs showed a marked absence of species represented by a transcriptome. We discuss possible reasons and caution against combining predicted genes from genome- and transriptome-based assemblies.
The Great Hungarian Plain was a crossroads of cultural transformations that have shaped European prehistory. Here we analyse a 5,000-year transect of human genomes, sampled from petrous bones giving consistently excellent endogenous DNA yields, from 13 Hungarian Neolithic, Copper, Bronze and Iron Age burials including two to high (similar to 22x) and seven to similar to 1x coverage, to investigate the impact of these on Europe's genetic landscape. These data suggest genomic shifts with the advent of the Neolithic, Bronze and Iron Ages, with interleaved periods of genome stability. The earliest Neolithic context genome shows a European hunter-gatherer genetic signature and a restricted ancestral population size, suggesting direct contact between cultures after the arrival of the first farmers into Europe. The latest, Iron Age, sample reveals an eastern genomic influence concordant with introduced Steppe burial rites. We observe transition towards lighter pigmentation and surprisingly, no Neolithic presence of lactase persistence.
We extend the scope of European palaeogenomics by sequencing the genomes of Late Upper Palaeolithic (13,300 years old, 1.4-fold coverage) and Mesolithic (9,700 years old, 15.4-fold) males from western Georgia in the Caucasus and a Late Upper Palaeolithic (13,700 years old, 9.5-fold) male from Switzerland. While we detect Late Palaeolithic–Mesolithic genomic continuity in both regions, we find that Caucasus hunter-gatherers (CHG) belong to a distinct ancient clade that split from western hunter-gatherers ∼45 kya, shortly after the expansion of anatomically modern humans into Europe and from the ancestors of Neolithic farmers ∼25 kya, around the Last Glacial Maximum. CHG genomes significantly contributed to the Yamnaya steppe herders who migrated into Europe ∼3,000 BC, supporting a formative Caucasus influence on this important Early Bronze age culture. CHG left their imprint on modern populations from the Caucasus and also central and south Asia possibly marking the arrival of Indo-Aryan languages.
We extend the scope of European palaeogenomics by sequencing the genomes of Late Upper Palaeolithic (13,300 years old, 1.4-fold coverage) and Mesolithic (9,700 years old, 15.4-fold) males from western Georgia in the Caucasus and a Late Upper Palaeolithic (13,700 years old, 9.5-fold) male from Switzerland. While we detect Late Palaeolithic-Mesolithic genomic continuity in both regions, we find that Caucasus hunter-gatherers (CHG) belong to a distinct ancient clade that split from western hunter-gatherers similar to 45 kya, shortly after the expansion of anatomically modern humans into Europe and from the ancestors of Neolithic farmers similar to 25 kya, around the Last Glacial Maximum. CHG genomes significantly contributed to the Yamnaya steppe herders who migrated into Europe similar to 3,000 BC, supporting a formative Caucasus influence on this important Early Bronze age culture. CHG left their imprint on modern populations from the Caucasus and also central and south Asia possibly marking the arrival of Indo-Aryan languages.
Consensify
(2020)
A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes.
The recently extinct (ca. 1768) Steller's sea cow (Hydrodamalis gigas) was a large, edentulous North Pacific sirenian. The phylogenetic affinities of this taxon to other members of this clade, living and extinct, are uncertain based on previous morphological and molecular studies. We employed hybridization capture methods and second generation sequencing technology to obtain >30 kb of exon sequences from 26 nuclear genes for both H. gigas and Dugong dugon. We also obtained complete coding sequences for the tooth-related enamelin (ENAM) gene. Hybridization probes designed using dugong and manatee sequences were both highly effective in retrieving sequences from H. gigas (mean = 98.8% coverage), as were more divergent probes for regions of ENAM (99.0% coverage) that were designed exclusively from a proboscidean (African elephant) and a hyracoid (Cape hyrax). New sequences were combined with available sequences for representatives of all other afrotherian orders. We also expanded a previously published morphological matrix for living and fossil Sirenia by adding both new taxa and nine new postcranial characters. Maximum likelihood and parsimony analyses of the molecular data provide robust support for an association of H. gigas and D. dugon to the exclusion of living trichechids (manatees). Parsimony analyses of the morphological data also support the inclusion of H. gigas in Dugongidae with D. dugon and fossil dugongids. Timetree analyses based on calibration density approaches with hard- and soft-bounded constraints suggest that H. gigas and D. dugon diverged in the Oligocene and that crown sirenians last shared a common ancestor in the Eocene. The coding sequence for the ENAM gene in H. gigas does not contain frameshift mutations or stop codons, but there is a transversion mutation (AG to CG) in the acceptor splice site of intron 2. This disruption in the edentulous Steller's sea cow is consistent with previous studies that have documented inactivating mutations in tooth-specific loci of a variety of edentulous and enamelless vertebrates including birds, turtles, aardvarks, pangolins, xenarthrans, and baleen whales. Further, branch-site dN/dS analyses provide evidence for positive selection in ENAM on the stem dugongid branch where extensive tooth reduction occurred, followed by neutral evolution on the Hydrodamalis branch. Finally, we present a synthetic evolutionary tree for living and fossil sirenians showing several key innovations in the history of this clade including character state changes that parallel those that occurred in the evolutionary history of cetaceans. (C) 2015 Elsevier Inc. All rights reserved.
Technological innovations such as next generation sequencing and DNA hybridisation enrichment have resulted in multi-fold increases in both the quantity of ancient DNA sequence data and the time depth for DNA retrieval. To date, over 30 ancient genomes have been sequenced, moving from 0.7x coverage (mammoth) in 2008 to more than 50x coverage (Neanderthal) in 2014. Studies of rapid evolutionary changes, such as the evolution and spread of pathogens and the genetic responses of hosts, or the genetics of domestication and climatic adaptation, are developing swiftly and the importance of palaeogenomics for investigating evolutionary processes during the last million years is likely to increase considerably. However, these new datasets require new methods of data processing and analysis, as well as conceptual changes in interpreting the results. In this review we highlight important areas of future technical and conceptual progress and discuss research topics in the rapidly growing field of palaeogenomics.
Morphological and genetic evidence for early Holocene cattle management in northeastern China
(2013)
The domestication of cattle is generally accepted to have taken place in two independent centres: around 10,500 years ago in the Near East, giving rise to modern taurine cattle, and two millennia later in southern Asia, giving rise to zebu cattle. Here we provide firmly dated morphological and genetic evidence for early Holocene management of taurine cattle in northeastern China. We describe conjoining mandibles from this region that show evidence of oral stereotypy, dated to the early Holocene by two independent C-14 dates. Using Illumina high-throughput sequencing coupled with DNA hybridization capture, we characterize 15,406 bp of the mitogenome with on average 16.7-fold coverage. Phylogenetic analyses reveal a hitherto unknown mitochondrial haplogroup that falls outside the known taurine diversity. Our data suggest that the first attempts to manage cattle in northern China predate the introduction of domestic cattle that gave rise to the current stock by several thousand years.
As limits on O2 availability during submergence impose severe constraints on aerobic respiration, the oxygen binding globin proteins of marine mammals are expected to have evolved under strong evolutionary pressures during their land-to-sea transition. Here, we address this question for the order Sirenia by retrieving, annotating, and performing detailed selection analyses on the globin repertoire of the extinct Steller’s sea cow (Hydrodamalis gigas), dugong (Dugong dugon), and Florida manatee (Trichechus manatus latirostris) in relation to their closest living terrestrial relatives (elephants and hyraxes). These analyses indicate most loci experienced elevated nucleotide substitution rates during their transition to a fully aquatic lifestyle. While most of these genes evolved under neutrality or strong purifying selection, the rate of nonsynonymous/synonymous replacements increased in two genes (Hbz-T1 and Hba-T1) that encode the α-type chains of hemoglobin (Hb) during each stage of life. Notably, the relaxed evolution of Hba-T1 is temporally coupled with the emergence of a chimeric pseudogene (Hba-T2/Hbq-ps) that contributed to the tandemly linked Hba-T1 of stem sirenians via interparalog gene conversion. Functional tests on recombinant Hb proteins from extant and ancestral sirenians further revealed that the molecular remodeling of Hba-T1 coincided with increased Hb–O2 affinity in early sirenians. Available evidence suggests that this trait evolved to maximize O2 extraction from finite lung stores and suppress tissue O2 offloading, thereby facilitating the low metabolic intensities of extant sirenians. In contrast, the derived reduction in Hb–O2 affinity in (sub)Arctic Steller’s sea cows is consistent with fueling increased thermogenesis by these once colossal marine herbivores.
Being at the western fringe of Europe, Iberia had a peculiar prehistory and a complex pattern of Neolithization. A few studies, all based on modern populations, reported the presence of DNA of likely African origin in this region, generally concluding it was the result of recent gene flow, probably during the Islamic period. Here, we provide evidence of much older gene flow from Africa to Iberia by sequencing whole genomes from four human remains from northern Portugal and southern Spain dated around 4000 years BP (from the Middle Neolithic to the Bronze Age). We found one of them to carry an unequivocal sub-Saharan mitogenome of most probably West or West-Central African origin, to our knowledge never reported before in prehistoric remains outside Africa. Our analyses of ancient nuclear genomes show small but significant levels of sub-Saharan African affinity in several ancient Iberian samples, which indicates that what we detected was not an occasional individual phenomenon, but an admixture event recognizable at the population level. We interpret this result as evidence of an early migration process from Africa into the Iberian Peninsula through a western route, possibly across the Strait of Gibraltar.
Molecular identification of late and terminal Pleistocene Equus ovodovi from northeastern China
(2019)
The extant diversity of horses (family Equidae) represents a small fraction of that occurring over their evolutionary history. One such lost lineage is the subgenus Sussemionus, which is thought to have become extinct during the Middle Pleistocene. However, recent molecular studies and morphological analysis have revealed that one of their representatives, E. ovodovi, did exist in Siberia during the Late Pleistocene. Fossil materials of E. ovodovi have thus far only been found in Russia. In this study, we extracted DNA from three equid fossil specimens excavated from northeastern China dated at 12,770-12,596, 29,525-28,887 and 40,201-38,848 cal. yBP, respectively, and retrieved three near-complete mitochondrial genomes from the specimens. Phylogenetic analyses cluster the Chinese haplotypes together with previously published Russian E. ovodovi, strongly supporting the assignment of these samples to this taxon. The molecular identification of E. ovodovi in northeastern China extends the known geographical range of this fossil species by several thousand kilometers to the east. The estimated coalescence time of all E. ovodovi haplotypes is approximately 199 Kya, with the Chinese haplotypes coalescing approximately 130 Kya. With a radiocarbon age of 12,770-12,596 cal. yBP, the youngest sample in this study represents the first E. ovodovi sample dating to the terminal Pleistocene, moving the extinction date of this species forwards considerably compared to previously documented fossils. Overall, comparison of our three mitochondrial genomes with the two published ones suggests a genetic diversity similar to several extant species of the genus Equus.
The prevalence of contaminant microbial DNA in ancient bone samples represents the principal limiting factor for palaeogenomic studies, as it may comprise more than 99% of DNA molecules obtained. Efforts to exclude or reduce this contaminant fraction have been numerous but also variable in their success. Here, we present a simple but highly effective method to increase the relative proportion of endogenous molecules obtained from ancient bones. Using computed tomography (CT) scanning, we identify the densest region of a bone as optimal for sampling. This approach accurately identifies the densest internal regions of petrous bones, which are known to be a source of high-purity ancient DNA. For ancient long bones, CT scans reveal a high-density outermost layer, which has been routinely removed and discarded prior to DNA extraction. For almost all long bones investigated, we find that targeted sampling of this outermost layer provides an increase in endogenous DNA content over that obtained from softer, trabecular bone. This targeted sampling can produce as much as 50-fold increase in the proportion of endogenous DNA, providing a directly proportional reduction in sequencing costs for shotgun sequencing experiments. The observed increases in endogenous DNA proportion are not associated with any reduction in absolute endogenous molecule recovery. Although sampling the outermost layer can result in higher levels of human contamination, some bones were found to have more contamination associated with the internal bone structures. Our method is highly consistent, reproducible and applicable across a wide range of bone types, ages and species. We predict that this discovery will greatly extend the potential to study ancient populations and species in the genomics era.
Although many large mammal species went extinct at the end of the Pleistocene epoch, their DNA may persist due to past episodes of interspecies admixture. However, direct empirical evidence of the persistence of ancient alleles remains scarce. Here, we present multifold coverage genomic data from four Late Pleistocene cave bears (Ursus spelaeus complex) and show that cave bears hybridized with brown bears (Ursus arctos) during the Pleistocene. We develop an approach to assess both the directionality and relative timing of gene flow. We find that segments of cave bear DNA still persist in the genomes of living brown bears, with cave bears contributing 0.9 to 2.4% of the genomes of all brown bears investigated. Our results show that even though extinction is typically considered as absolute, following admixture, fragments of the gene pool of extinct species can survive for tens of thousands of years in the genomes of extant recipient species.
Reply to Peng et al.: Archaeological contexts should not be ignored for early chicken domestication
(2015)
Insects and their six-legged relatives (Hexapoda) comprise more than half of all described species and dominate terrestrial and freshwater ecosystems. Understanding the macroevolutionary processes generating this richness requires a historical perspective, but the fossil record of hexapods is patchy and incomplete. Dated molecular phylogenies provide an alternative perspective on divergence times and have been combined with birth-death models to infer patterns of diversification across a range of taxonomic groups. Here we generate a dated phylogeny of hexapod families, based on previously published sequence data and literature derived constraints, in order to identify the broad pattern of macroevolutionary changes responsible for the composition of the extant hexapod fauna. The most prominent increase in diversification identified is associated with the origin of complete metamorphosis, confirming this as a key innovation in promoting insect diversity. Subsequent reductions are recovered for several groups previously identified as having a higher fossil diversity during the Mesozoic. In addition, a number of recently derived taxa are found to have radiated following the development of flowering plant (angiosperm) floras during the mid-Cretaceous. These results reveal that the composition of the modern hexapod fauna is a product of a key developmental innovation, combined with multiple and varied evolutionary responses to environmental changes from the mid Cretaceous floral transition onward.
The publication of partial and complete paleogenomes within the last few years has reinvigorated research in ancient DNA. No longer limited to short fragments of mitochondrial DNA, inference of evolutionary processes through time can now be investigated from genome-wide data sampled as far back as 700,000 years. Tremendous insights have been made, in particular regarding the hominin lineage. With rare exception, however, a paleogenomic perspective has been mired by the quality and quantity of recoverable DNA. Though conceptually simple, extracting ancient DNA remains challenging, and sequencing ancient genomes to high coverage remains prohibitively expensive for most laboratories. Still, with improvements in DNA isolation and declining sequencing costs, the taxonomic and geographic purview of paleogenomics is expanding at a rapid pace. With improved capacity to screen large numbers of samples for those with high proportions of endogenous ancient DNA, paleogenomics is poised to become a key technology to better understand recent evolutionary events.
Anatomical changes in extinct mammalian lineages over evolutionary time, such as the loss of fingers and teeth and the rapid increase in body size that accompanied the late Miocene dispersal of the progenitors of Steller's sea cows (Hydrodamalis gigas (Zimmermann, 1780)) into North Pacific waters and the convergent development of a thick pelage and accompanying reductions in ear and tail surface area of woolly mammoths (Mammuthus primigenius (Blumenbach, 1799)) and woolly rhinoceros (Coelodonta antiquitatis (Blumenbach, 1799)), are prime examples of adaptive evolution underlying the exploitation of new habitats. It is likely, however, that biochemical specializations adopted during these evolutionary transitions were of similar or even greater biological importance. As these "living" processes do not fossilize, direct information regarding the physiological attributes of extinct species has largely remained beyond the range of scientific inquiry. However, the ability to retrieve genomic sequences from ancient DNA samples, combined with ectopic expression systems, now permit the evolutionary origins and structural and functional properties of authentic prehistoric proteins to be examined in great detail. Exponential technical advances in ancient DNA retrieval, enrichment, and sequencing will soon permit targeted generation of complete genomes from hundreds of extinct species across the last one million years that, in combination with emerging in vitro expression, genome engineering, and cell differentiation techniques, promises to herald an exciting new trajectory of evolutionary research at the interface of biochemistry, genomics, palaeontology, and cell biology.
Ancient mitochondrial DNA and the genetic history of Eurasian beaver (Castor fiber) in Europe
(2014)
After centuries of human hunting, the Eurasian beaver Castor fiber had disappeared from most of its original range by the end of the 19th century. The surviving relict populations are characterized by both low genetic diversity and strong phylogeographical structure. However, it remains unclear whether these attributes are the result of a human-induced, late Holocene bottleneck or already existed prior to this reduction in range. To investigate genetic diversity in Eurasian beaver populations during the Holocene, we obtained mitochondrial control region DNA sequences from 48 ancient beaver samples and added 152 modern sequences from GenBank. Phylogeographical analyses of the data indicate a differentiation of European beaver populations into three mitochondrial clades. The two main clades occur in western and eastern Europe, respectively, with an early Holocene contact zone in eastern Europe near a present-day contact zone. A divergent and previously unknown clade of beavers from the Danube Basin survived until at least 6000years ago, but went extinct during the transition to modern times. Finally, we identify a recent decline in effective population size of Eurasian beavers, with a stronger bottleneck signal in the western than in the eastern clade. Our results suggest that the low genetic diversity and the strong phylogeographical structure in recent beavers are artefacts of human hunting-associated population reductions. While beaver populations have been growing rapidly since the late 19th century, genetic diversity within modern beaver populations remains considerably reduced compared to what was present prior to the period of human hunting and habitat reduction.
For over a hundred years, the "river sharks" of the genus Glyphis were only known from the type specimens of species that had been collected in the 19th century. They were widely considered extinct until populations of Glyphis-like sharks were rediscovered in remote regions of Borneo and Northern Australia at the end of the 20th century. However, the genetic affinities between the newly discovered Glyphis-like populations and the poorly preserved, original museum-type specimens have never been established. Here, we present the first (to our knowledge) fully resolved, complete phylogeny of Glyphis that includes both archival-type specimens and modern material. We used a sensitive DNA hybridization capture method to obtain complete mitochondrial genomes from all of our samples and show that three of the five described river shark species are probably conspecific and widely distributed in Southeast Asia. Furthermore we show that there has been recent gene flow between locations that are separated by large oceanic expanses. Our data strongly suggest marine dispersal in these species, overturning the widely held notion that river sharks are restricted to freshwater. It seems that species in the genus Glyphis are euryhaline with an ecology similar to the bull shark, in which adult individuals live in the ocean while the young grow up in river habitats with reduced predation pressure. Finally, we discovered a previously unidentified species within the genus Glyphis that is deeply divergent from all other lineages, underscoring the current lack of knowledge about the biodiversity and ecology of these mysterious sharks.
Background: The forelimb-specific gene tbx5 is highly conserved and essential for the development of forelimbs in zebrafish, mice, and humans. Amongst birds, a single order, Dinornithiformes, comprising the extinct wingless moa of New Zealand, are unique in having no skeletal evidence of forelimb-like structures.
Results: To determine the sequence of tbx5 in moa, we used a range of PCR-based techniques on ancient DNA to retrieve all nine tbx5 exons and splice sites from the giant moa, Dinornis. Moa Tbx5 is identical to chicken Tbx5 in being able to activate the downstream promotors of fgf10 and ANF. In addition we show that missexpression of moa tbx5 in the hindlimb of chicken embryos results in the formation of forelimb features, suggesting that Tbx5 was fully functional in wingless moa. An alternatively spliced exon 1 for tbx5 that is expressed specifically in the forelimb region was shown to be almost identical between moa and ostrich, suggesting that, as well as being fully functional, tbx5 is likely to have been expressed normally in moa since divergence from their flighted ancestors, approximately 60 mya.
Background: Kiwi, comprising five species from the genus Apteryx, are endangered, ground-dwelling bird species endemic to New Zealand. They are the smallest and only nocturnal representatives of the ratites. The timing of kiwi adaptation to a nocturnal niche and the genomic innovations, which shaped sensory systems and morphology to allow this adaptation, are not yet fully understood.
Results: We sequenced and assembled the brown kiwi genome to 150-fold coverage and annotated the genome using kiwi transcript data and non-redundant protein information from multiple bird species. We identified evolutionary sequence changes that underlie adaptation to nocturnality and estimated the onset time of these adaptations. Several opsin genes involved in color vision are inactivated in the kiwi. We date this inactivation to the Oligocene epoch, likely after the arrival of the ancestor of modern kiwi in New Zealand. Genome comparisons between kiwi and representatives of ratites, Galloanserae, and Neoaves, including nocturnal and song birds, show diversification of kiwi's odorant receptors repertoire, which may reflect an increased reliance on olfaction rather than sight during foraging. Further, there is an enrichment of genes influencing mitochondrial function and energy expenditure among genes that are rapidly evolving specifically on the kiwi branch, which may also be linked to its nocturnal lifestyle.
Conclusions: The genomic changes in kiwi vision and olfaction are consistent with changes that are hypothesized to occur during adaptation to nocturnal lifestyle in mammals. The kiwi genome provides a valuable genomic resource for future genome-wide comparative analyses to other extinct and extant diurnal ratites.
Chickens represent by far the most important poultry species, yet the number, locations, and timings of their domestication have remained controversial for more than a century. Here we report ancient mitochondrial DNA sequences from the earliest archaeological chicken bones from China, dating back to similar to 10,000 B.P. The results clearly show that all investigated bones, including the oldest from the Nanzhuangtou site, are derived from the genus Gallus, rather than any other related genus, such as Phasianus. Our analyses also suggest that northern China represents one region of the earliest chicken domestication, possibly dating as early as 10,000 y B.P. Similar to the evidence from pig domestication, our results suggest that these early domesticated chickens contributed to the gene pool of modern chicken populations. Moreover, our results support the idea that multiple members of the genus Gallus, specifically Gallus gallus and Gallus sonneratii contributed to the gene pool of the modern domestic chicken. Our results provide further support for the growing evidence of an early mixed agricultural complex in northern China.
Faunal remains from Palaeolithic sites are important genetic sources to study preglacial and postglacial populations and to investigate the effect of climate change and human impact. Post mortem decay, resulting in fragmented and chemically modified DNA, is a key obstacle in ancient DNA analyses. In the absence of reliable methods to determine the presence of endogenous DNA in sub-fossil samples, temporal and spatial surveys of DNA survival on a regional scale may help to estimate the potential of faunal remains from a given time period and region. We therefore investigated PCR amplification success, PCR performance and post mortem damage in c. 47,000 to c. 12,000-year-old horse remains from 14 Palaeolithic sites along the Swiss Jura Mountains in relation to depositional context, tissue type, storage time and age, potentially influencing DNA preservation. The targeted 75 base pair mitochondrial DNA fragment could be amplified solely from equid remains from caves and not from any of the open dry and (temporary) wetland sites. Whether teeth are better than bones cannot be ultimately decided; however, both storage time after excavation and age significantly affect PCR amplification and performance, albeit not in a linear way. This is best explained by the-inevitable-heterogeneity of the data set. The extent of post mortem damage is not related to any of the potential impact factors. The results encourage comprehensive investigations of Palaeolithic cave sites, even from temperate regions.
Background:
Skewed body size distributions and the high relative richness of small-bodied taxa are a fundamental
property of a wide range of animal clades. The evolutionary processes responsible for generating these distributions
are well described in vertebrate model systems but have yet to be explored in detail for other major terrestrial
clades. In this study, we explore the macro-evolutionary patterns of body size variation across families of Hexapoda
(insects and their close relatives), using recent advances in phylogenetic understanding, with an aim to investigate
the link between size and diversity within this ancient and highly diverse lineage.
Results:
The maximum, minimum and mean-log body lengths of hexapod families are all approximately log-normally
distributed, consistent with previous studies at lower taxonomic levels, and contrasting with skewed distributions
typical of vertebrate groups. After taking phylogeny and within-tip variation into account, we find no evidence for a
negative relationship between diversification rate and body size, suggesting decoupling of the forces controlling these
two traits. Likelihood-based modeling of the log-mean body size identifies distinct processes operating within
Holometabola and Diptera compared with other hexapod groups, consistent with accelerating rates of size evolution
within these clades, while as a whole, hexapod body size evolution is found to be dominated by neutral processes
including significant phylogenetic conservatism.
Conclusions:
Based on our findings we suggest that the use of models derived from well-studied but atypical clades,
such as vertebrates may lead to misleading conclusions when applied to other major terrestrial lineages. Our results
indicate that within hexapods, and within the limits of current systematic and phylogenetic knowledge, insect
diversification is generally unfettered by size-biased macro-evolutionary processes, and that these processes over large
timescales tend to converge on apparently neutral evolutionary processes. We also identify limitations on available
data within the clade and modeling approaches for the resolution of trees of higher taxa, the resolution of which may
collectively enhance our understanding of this key component of terrestrial ecosystems.
Eusociality is one of the most complex forms of social organization, characterized by cooperative and reproductive units termed colonies. Altruistic behavior of workers within colonies is explained by inclusive fitness, with indirect fitness benefits accrued by helping kin. Members of a social insect colony are expected to be more closely related to one another than they are to other conspecifics. In many social insects, the colony can extend to multiple socially connected but spatially separate nests (polydomy). Social connections, such as trails between nests, promote cooperation and resource exchange, and we predict that workers from socially connected nests will have higher internest relatedness than those from socially unconnected, and noncooperating, nests. We measure social connections, resource exchange, and internest genetic relatedness in the polydomous wood ant Formica lugubris to test whether (1) socially connected but spatially separate nests cooperate, and (2) high internest relatedness is the underlying driver of this cooperation. Our results show that socially connected nests exhibit movement of workers and resources, which suggests they do cooperate, whereas unconnected nests do not. However, we find no difference in internest genetic relatedness between socially connected and unconnected nest pairs, both show high kinship. Our results suggest that neighboring pairs of connected nests show a social and cooperative distinction, but no genetic distinction. We hypothesize that the loss of a social connection may initiate ecological divergence within colonies. Genetic divergence between neighboring nests may build up only later, as a consequence rather than a cause of colony separation.
Horses have been valued for their diversity of coat colour since prehistoric times; this is especially the case since their domestication in the Caspian steppe in similar to 3,500 BC. Although we can assume that human preferences were not constant, we have only anecdotal information about how domestic horses were influenced by humans. Our results from genotype analyses show a significant increase in spotted coats in early domestic horses (Copper Age to Iron Age). In contrast, medieval horses carried significantly fewer alleles for these phenotypes, whereas solid phenotypes (i.e., chestnut) became dominant. This shift may have been supported because of (i) pleiotropic disadvantages, (ii) a reduced need to separate domestic horses from their wild counterparts, (iii) a lower religious prestige, or (iv) novel developments in weaponry. These scenarios may have acted alone or in combination. However, the dominance of chestnut is a remarkable feature of the medieval horse population.
The origin of ambling horses
(2016)
Horseback riding is the most fundamental use of domestic horses and has had a huge influence on the development of human societies for millennia. Over time, riding techniques and the style of riding improved. Therefore, horses with the ability to perform comfortable gaits (e.g. ambling or pacing), so-called ‘gaited’ horses, have been highly valued by humans, especially for long distance travel. Recently, the causative mutation for gaitedness in horses has been linked to a substitution causing a premature stop codon in the DMRT3 gene (DMRT3_Ser301STOP) [1]. In mice, Dmrt3 is expressed in spinal cord interneurons and plays an important role in the development of limb movement coordination [1]. Genotyping the position in 4396 modern horses from 141 breeds revealed that nowadays the mutated allele is distributed worldwide with an especially high frequency in gaited horses and breeds used for harness racing [2]. Here, we examine historic horse remains for the DMRT3 SNP, tracking the origin of gaitedness to Medieval England between 850 and 900 AD. The presence of the corresponding allele in Icelandic horses (9th–11th century) strongly suggests that ambling horses were brought from the British Isles to Iceland by Norse people. Considering the high frequency of the ambling allele in early Icelandic horses, we believe that Norse settlers selected for this comfortable mode of horse riding soon after arrival. The absence of the allele in samples from continental Europe (including Scandinavia) at this time implies that ambling horses may have spread from Iceland and maybe also the British Isles across the continent at a later date.
Dromedaries have been fundamental to the development of human societies in arid landscapes and for long-distance trade across hostile hot terrains for 3,000 y. Today they continue to be an important livestock resource in marginal agro-ecological zones. However, the history of dromedary domestication and the influence of ancient trading networks on their genetic structure have remained elusive. We combined ancient DNA sequences of wild and early-domesticated dromedary samples from arid regions with nuclear microsatellite and mitochondrial genotype information from 1,083 extant animals collected across the species’ range. We observe little phylogeographic signal in the modern population, indicative of extensive gene flow and virtually affecting all regions except East Africa, where dromedary populations have remained relatively isolated. In agreement with archaeological findings, we identify wild dromedaries from the southeast Arabian Peninsula among the founders of the domestic dromedary gene pool. Approximate Bayesian computations further support the “restocking from the wild” hypothesis, with an initial domestication followed by introgression from individuals from wild, now-extinct populations. Compared with other livestock, which show a long history of gene flow with their wild ancestors, we find a high initial diversity relative to the native distribution of the wild ancestor on the Arabian Peninsula and to the brief coexistence of early-domesticated and wild individuals. This study also demonstrates the potential to retrieve ancient DNA sequences from osseous remains excavated in hot and dry desert environments.
Background: Skewed body size distributions and the high relative richness of small-bodied taxa are a fundamental property of a wide range of animal clades. The evolutionary processes responsible for generating these distributions are well described in vertebrate model systems but have yet to be explored in detail for other major terrestrial clades. In this study, we explore the macro-evolutionary patterns of body size variation across families of Hexapoda (insects and their close relatives), using recent advances in phylogenetic understanding, with an aim to investigate the link between size and diversity within this ancient and highly diverse lineage. Results: The maximum, minimum and mean-log body lengths of hexapod families are all approximately log-normally distributed, consistent with previous studies at lower taxonomic levels, and contrasting with skewed distributions typical of vertebrate groups. After taking phylogeny and within-tip variation into account, we find no evidence for a negative relationship between diversification rate and body size, suggesting decoupling of the forces controlling these two traits. Likelihood-based modeling of the log-mean body size identifies distinct processes operating within Holometabola and Diptera compared with other hexapod groups, consistent with accelerating rates of size evolution within these clades, while as a whole, hexapod body size evolution is found to be dominated by neutral processes including significant phylogenetic conservatism. Conclusions: Based on our findings we suggest that the use of models derived from well-studied but atypical clades, such as vertebrates may lead to misleading conclusions when applied to other major terrestrial lineages. Our results indicate that within hexapods, and within the limits of current systematic and phylogenetic knowledge, insect diversification is generally unfettered by size-biased macro-evolutionary processes, and that these processes over large timescales tend to converge on apparently neutral evolutionary processes. We also identify limitations on available data within the clade and modeling approaches for the resolution of trees of higher taxa, the resolution of which may collectively enhance our understanding of this key component of terrestrial ecosystems.
There is the tendency to assume that endangered species have been both genetically and demographically healthier in the past, so that any genetic erosion observed today was caused by their recent decline. The Iberian lynx (Lynx pardinus) suffered a dramatic and continuous decline during the 20th century, and now shows extremely low genome- and species-wide genetic diversity among other signs of genomic erosion. We analyze ancient (N = 10), historical (N = 245), and contemporary (N = 172) samples with microsatellite and mitogenome data to reconstruct the species' demography and investigate patterns of genetic variation across space and time. Iberian lynx populations transitioned from low but significantly higher genetic diversity than today and shallow geographical differentiation millennia ago, through a structured metapopulation with varying levels of diversity during the last centuries, to two extremely genetically depauperate and differentiated remnant populations by 2002. The historical subpopulations show varying extents of genetic drift in relation to their recent size and time in isolation, but these do not predict whether the populations persisted or went finally extinct. In conclusion, current genetic patterns were mainly shaped by genetic drift, supporting the current admixture of the two genetic pools and calling for a comprehensive genetic management of the ongoing conservation program. This study illustrates how a retrospective analysis of demographic and genetic patterns of endangered species can shed light onto their evolutionary history and this, in turn, can inform conservation actions.
Climate impacts on transocean dispersal and habitat in gray whales from the Pleistocene to 2100
(2015)
Arctic animals face dramatic habitat alteration due to ongoing climate change. Understanding how such species have responded to past glacial cycles can help us forecast their response to today's changing climate. Gray whales are among those marine species likely to be strongly affected by Arctic climate change, but a thorough analysis of past climate impacts on this species has been complicated by lack of information about an extinct population in the Atlantic. While little is known about the history of Atlantic gray whales or their relationship to the extant Pacific population, the extirpation of the Atlantic population during historical times has been attributed to whaling. We used a combination of ancient and modern DNA, radiocarbon dating and predictive habitat modelling to better understand the distribution of gray whales during the Pleistocene and Holocene. Our results reveal that dispersal between the Pacific and Atlantic was climate dependent and occurred both during the Pleistocene prior to the last glacial period and the early Holocene immediately following the opening of the Bering Strait. Genetic diversity in the Atlantic declined over an extended interval that predates the period of intensive commercial whaling, indicating this decline may have been precipitated by Holocene climate or other ecological causes. These first genetic data for Atlantic gray whales, particularly when combined with predictive habitat models for the year 2100, suggest that two recent sightings of gray whales in the Atlantic may represent the beginning of the expansion of this species' habitat beyond its currently realized range.
Paging through history: parchment as a reservoir of ancient DNA for next generation sequencing
(2015)
Parchment represents an invaluable cultural reservoir. Retrieving an additional layer of information from these abundant, dated livestock-skins via the use of ancient DNA (aDNA) sequencing has been mooted by a number of researchers. However, prior PCR-based work has indicated that this may be challenged by cross-individual and cross-species contamination, perhaps from the bulk parchment preparation process. Here we apply next generation sequencing to two parchments of seventeenth and eighteenth century northern English provenance. Following alignment to the published sheep, goat, cow and human genomes, it is clear that the only genome displaying substantial unique homology is sheep and this species identification is confirmed by collagen peptide mass spectrometry. Only 4% of sequence reads align preferentially to a different species indicating low contamination across species. Moreover, mitochondrial DNA sequences suggest an upper bound of contamination at 5%. Over 45% of reads aligned to the sheep genome, and even this limited sequencing exercise yield 9 and 7% of each sampled sheep genome post filtering, allowing the mapping of genetic affinity to modern British sheep breeds. We conclude that parchment represents an excellent substrate for genomic analyses of historical livestock.
This is a reply to the comments of Morey (2014) on our identification of Palaeolithic dogs from several European Palaeolithic sites. In his comments Morey (2014) presents some misrepresentations and misunderstandings that we remedy here. In contrast to what Morey (2014) propounds, our results suggest that the domestication of the wolf was a long process that started early in the Upper Palaeolithic and that since that time two sympatric canid morphotypes can be seen in Eurasian sites: Pleistocene wolves and Palaeolithic dogs. Contrary to Morey (2014), we are convinced that the study of this domestication process should be multidisciplinary. (C) 2014 Elsevier Ltd. All rights reserved.
Faunal remains from Palaeolithic sites are important genetic sources to study preglacial and postglacial populations and to investigate the effect of climate change and human impact. Post mortem decay, resulting in fragmented and chemically modified DNA, is a key obstacle in ancient DNA analyses. In the absence of reliable methods to determine the presence of endogenous DNA in sub-fossil samples, temporal and spatial surveys of DNA survival on a regional scale may help to estimate the potential of faunal remains from a given time period and region. We therefore investigated PCR amplification success, PCR performance and post mortem damage in c. 47,000 to c. 12,000-year-old horse remains from 14 Palaeolithic sites along the Swiss Jura Mountains in relation to depositional context, tissue type, storage time and age, potentially influencing DNA preservation. The targeted 75 base pair mitochondrial DNA fragment could be amplified solely from equid remains from caves and not from any of the open dry and (temporary) wetland sites. Whether teeth are better than bones cannot be ultimately decided; however, both storage time after excavation and age significantly affect PCR amplification and performance, albeit not in a linear way. This is best explained by the—inevitable—heterogeneity of the data set. The extent of post mortem damage is not related to any of the potential impact factors. The results encourage comprehensive investigations of Palaeolithic cave sites, even from temperate regions.
Background
Contiguous genome assemblies are a highly valued biological resource because of the higher number of completely annotated genes and genomic elements that are usable compared to fragmented draft genomes. Nonetheless, contiguity is difficult to obtain if only low coverage data and/or only distantly related reference genome assemblies are available.
Findings
In order to improve genome contiguity, we have developed Cross-Species Scaffolding—a new pipeline that imports long-range distance information directly into the de novo assembly process by constructing mate-pair libraries in silico.
Conclusions
We show how genome assembly metrics and gene prediction dramatically improve with our pipeline by assembling two primate genomes solely based on ∼30x coverage of shotgun sequencing data.
Background
Contiguous genome assemblies are a highly valued biological resource because of the higher number of completely annotated genes and genomic elements that are usable compared to fragmented draft genomes. Nonetheless, contiguity is difficult to obtain if only low coverage data and/or only distantly related reference genome assemblies are available.
Findings
In order to improve genome contiguity, we have developed Cross-Species Scaffolding—a new pipeline that imports long-range distance information directly into the de novo assembly process by constructing mate-pair libraries in silico.
Conclusions
We show how genome assembly metrics and gene prediction dramatically improve with our pipeline by assembling two primate genomes solely based on ∼30x coverage of shotgun sequencing data.
Palaeogenomes of Eurasian straight-tusked elephants challenge the current view of elephant evolution
(2017)
The straight-tusked elephants Palaeoloxodon spp. were widespread across Eurasia during the Pleistocene. Phylogenetic reconstructions using morphological traits have grouped them with Asian elephants (Elephas maximus), and many paleontologists place Palaeoloxodon within Elephas. Here, we report the recovery of full mitochondrial genomes from four and partial nuclear genomes from two P. antiquus fossils. These fossils were collected at two sites in Germany, Neumark-Nord and Weimar-Ehringsdorf, and likely date to interglacial periods similar to 120 and similar to 244 thousand years ago, respectively. Unexpectedly, nuclear and mitochondrial DNA analyses suggest that P. antiquus was a close relative of extant African forest elephants (Loxodonta cyclotis). Species previously referred to Palaeoloxodon are thus most parsimoniously explained as having diverged from the lineage of Loxodonta, indicating that Loxodonta has not been constrained to Africa. Our results demonstrate that the current picture of elephant evolution is in need of substantial revision.
The genomic changes underlying both early and late stages of horse domestication remain largely unknown. We examined the genomes of 14 early domestic horses from the Bronze and Iron Ages, dating to between similar to 4.1 and 2.3 thousand years before present. We find early domestication selection patterns supporting the neural crest hypothesis, which provides a unified developmental origin for common domestic traits. Within the past 2.3 thousand years, horses lost genetic diversity and archaic DNA tracts introgressed from a now-extinct lineage. They accumulated deleterious mutations later than expected under the cost-of-domestication hypothesis, probably because of breeding from limited numbers of stallions. We also reveal that Iron Age Scythian steppe nomads implemented breeding strategies involving no detectable inbreeding and selection for coat-color variation and robust forelimbs.
The transition from hunting and gathering to farming involved profound cultural and technological changes. In Western and Central Europe, these changes occurred rapidly and synchronously after the arrival of early farmers of Anatolian origin [1-3], who largely replaced the local Mesolithic hunter-gatherers [1, 4-6]. Further east, in the Baltic region, the transition was gradual, with little or no genetic input from incoming farmers [7]. Here we use ancient DNA to investigate the relationship between hunter-gatherers and farmers in the Lower Danube basin, a geographically intermediate area that is characterized by a rapid Neolithic transition but also by the presence of archaeological evidence that points to cultural exchange, and thus possible admixture, between hunter-gatherers and farmers. We recovered four human paleogenomes (1.13 to 4.13 coverage) from Romania spanning a time transect between 8.8 thousand years ago (kya) and 5.4 kya and supplemented them with two Mesolithic genomes (1.73- and 5.33) from Spain to provide further context on the genetic background of Mesolithic Europe. Our results show major Western hunter-gatherer (WHG) ancestry in a Romanian Eneolithic sample with a minor, but sizeable, contribution from Anatolian farmers, suggesting multiple admixture events between hunter-gatherers and farmers. Dietary stableisotope analysis of this sample suggests a mixed terrestrial/ aquatic diet. Our results provide support for complex interactions among hunter-gatherers and farmers in the Danube basin, demonstrating that in some regions, demic and cultural diffusion were not mutually exclusive, but merely the ends of a continuum for the process of Neolithization.
Domestic cattle were brought to Spain by early settlers and agricultural societies. Due to missing Neolithic sites in the Spanish region of Galicia, very little is known about this process in this region. We sampled 18 cattle subfossils from different ages and different mountain caves in Galicia, of which 11 were subject to sequencing of the mitochondrial genome and phylogenetic analysis, to provide insight into the introduction of cattle to this region. We detected high similarity between samples from different time periods and were able to compare the time frame of the first domesticated cattle in Galicia to data from the connecting region of Cantabria to show a plausible connection between the Neolithization of these two regions. Our data shows a close relationship of the early domesticated cattle of Galicia and modern cow breeds and gives a general insight into cattle phylogeny. We conclude that settlers migrated to this region of Spain from Europe and introduced common European breeds to Galicia.
Domestic cattle were brought to Spain by early settlers and agricultural societies. Due to missing Neolithic sites in the Spanish region of Galicia, very little is known about this process in this region. We sampled 18 cattle subfossils from different ages and different mountain caves in Galicia, of which 11 were subject to sequencing of the mitochondrial genome and phylogenetic analysis, to provide insight into the introduction of cattle to this region. We detected high similarity between samples from different time periods and were able to compare the time frame of the first domesticated cattle in Galicia to data from the connecting region of Cantabria to show a plausible connection between the Neolithization of these two regions. Our data shows a close relationship of the early domesticated cattle of Galicia and modern cow breeds and gives a general insight into cattle phylogeny. We conclude that settlers migrated to this region of Spain from Europe and introduced common European breeds to Galicia.
The Great Hungarian Plain was a crossroads of cultural transformations that have shaped European prehistory. Here we analyse a 5,000-year transect of human genomes, sampled from petrous bones giving consistently excellent endogenous DNA yields, from 13 Hungarian Neolithic, Copper, Bronze and Iron Age burials including two to high (similar to 22x) and seven to similar to 1x coverage, to investigate the impact of these on Europe's genetic landscape. These data suggest genomic shifts with the advent of the Neolithic, Bronze and Iron Ages, with interleaved periods of genome stability. The earliest Neolithic context genome shows a European hunter-gatherer genetic signature and a restricted ancestral population size, suggesting direct contact between cultures after the arrival of the first farmers into Europe. The latest, Iron Age, sample reveals an eastern genomic influence concordant with introduced Steppe burial rites. We observe transition towards lighter pigmentation and surprisingly, no Neolithic presence of lactase persistence.
Abstract
By combining high-throughput sequencing with target enrichment (‘hybridization capture’), researchers are able to obtain molecular data from genomic regions of interest for projects that are otherwise constrained by sample quality (e.g. degraded and contamination-rich samples) or a lack of a priori sequence information (e.g. studies on nonmodel species). Despite the use of hybridization capture in various fields of research for many years, the impact of enrichment conditions on capture success is not yet thoroughly understood. We evaluated the impact of a key parameter – hybridization temperature – on the capture success of mitochondrial genomes across the carnivoran family Felidae. Capture was carried out for a range of sample types (fresh, archival, ancient) with varying levels of sequence divergence between bait and target (i.e. across a range of species) using pools of individually indexed libraries on Agilent SureSelect™ arrays. Our results suggest that hybridization capture protocols require specific optimization for the sample type that is being investigated. Hybridization temperature affected the proportion of on-target sequences following capture: for degraded samples, we obtained the best results with a hybridization temperature of 65 °C, while a touchdown approach (65 °C down to 50 °C) yielded the best results for fresh samples. Evaluation of capture performance at a regional scale (sliding window approach) revealed no significant improvement in the recovery of DNA fragments with high sequence divergence from the bait at any of the tested hybridization temperatures, suggesting that hybridization temperature may not be the critical parameter for the enrichment of divergent fragments.
Targeted capture coupled with high-throughput sequencing can be used to gain information about nuclear sequence variation at hundreds to thousands of loci. Divergent reference capture makes use of molecular data of one species to enrich target loci in other (related) species. This is particularly valuable for nonmodel organisms, for which often no a priori knowledge exists regarding these loci. Here, we have used targeted capture to obtain data for 809 nuclear coding DNA sequences (CDS) in a nonmodel organism, the Eurasian lynx Lynx lynx, using baits designed with the help of the published genome of a related model organism (the domestic cat Felis catus). Using this approach, we were able to survey intraspecific variation at hundreds of nuclear loci in L. lynx across the species’ European range. A large set of biallelic candidate SNPs was then evaluated using a high-throughput SNP genotyping platform (Fluidigm), which we then reduced to a final 96 SNP-panel based on assay performance and reliability; validation was carried out with 100 additional Eurasian lynx samples not included in the SNP discovery phase. The 96 SNP-panel developed from CDS performed very successfully in the identification of individuals and in population genetic structure inference (including the assignment of individuals to their source population). In keeping with recent studies, our results show that genic SNPs can be valuable for genetic monitoring of wildlife species.
Utilising a reconstructed ancestral mitochondrial genome of a clade to design hybridisation capture baits can provide the opportunity for recovering mitochondrial sequences from all its descendent and even sister lineages. This approach is useful for taxa with no extant close relatives, as is often the case for rare or extinct species, and is a viable approach for the analysis of historical museum specimens. Asiatic linsangs (genus Prionodon) exemplify this situation, being rare Southeast Asian carnivores for which little molecular data is available. Using ancestral capture we recover partial mitochondrial genome sequences for seven banded linsangs (P. linsang) from historical specimens, representing the first intraspecific genetic dataset for this species. We additionally assemble a high quality mitogenome for the banded linsang using shotgun sequencing for time-calibrated phylogenetic analysis. This reveals a deep divergence between the two Asiatic linsang species (P. linsang, P. pardicolor), with an estimated divergence of ~12 million years (Ma). Although our sample size precludes any robust interpretation of the population structure of the banded linsang, we recover two distinct matrilines with an estimated tMRCA of ~1 Ma. Our results can be used as a basis for further investigation of the Asiatic linsangs, and further demonstrate the utility of ancestral capture for studying divergent taxa without close relatives.
Insights into the geographical origin and phylogeographical patterns of Paradisaea birds-of-paradise
(2022)
Birds-of-paradise represent a textbook example for geographical speciation and sexual selection. Perhaps the most iconic genus is Paradisaea, which is restricted to New Guinea and a few surrounding islands. Although several species concepts have been applied in the past to disentangle the different entities within this genus, no attempt has been made so far to uncover phylogeographical patterns based on a genetic dataset that includes multiple individuals per species. Here, we applied amplicon sequencing for the mitochondrial fragment Cytb for a total of 69 museum specimens representing all seven Paradisaea species described and inferred both phylogenetic relationships and colonization pathways across the island. Our analyses show that the most recent common ancestor of the diverging lineages within Paradisaea probably originated in the Late Miocene in the eastern part of the Central Range and suggest that tectonic processes played a key role in shaping the diversification and distribution of species. All species were recovered as monophyletic, except for those within the apoda-minor-raggiana clade, which comprises the allopatric and parapatric species P. apoda, P. minor and P. raggiana. The comparatively young divergence times, together with possible instances of mitochondrial introgression and incomplete lineage sorting, suggest recent speciation in this clade.
Utilising a reconstructed ancestral mitochondrial genome of a clade to design hybridisation capture baits can provide the opportunity for recovering mitochondrial sequences from all its descendent and even sister lineages. This approach is useful for taxa with no extant close relatives, as is often the case for rare or extinct species, and is a viable approach for the analysis of historical museum specimens. Asiatic linsangs (genus Prionodon) exemplify this situation, being rare Southeast Asian carnivores for which little molecular data is available. Using ancestral capture we recover partial mitochondrial genome sequences for seven banded linsangs (P. linsang) from historical specimens, representing the first intraspecific genetic dataset for this species. We additionally assemble a high quality mitogenome for the banded linsang using shotgun sequencing for time-calibrated phylogenetic analysis. This reveals a deep divergence between the two Asiatic linsang species (P. linsang, P. pardicolor), with an estimated divergence of ~12 million years (Ma). Although our sample size precludes any robust interpretation of the population structure of the banded linsang, we recover two distinct matrilines with an estimated tMRCA of ~1 Ma. Our results can be used as a basis for further investigation of the Asiatic linsangs, and further demonstrate the utility of ancestral capture for studying divergent taxa without close relatives.
After initial detection of target archival DNA of a 116-year-old syntype specimen of the smooth lantern shark, Etmopterus pusillus, in a single-stranded DNA library, we shotgun-sequenced additional 9 million reads from this same DNA library. Sequencing reads were used for extracting mitochondrial sequence information for analyses of mitochondrial DNA characteristics and reconstruction of the mitochondrial genome. The archival DNA is highly fragmented. A total of 4599 mitochondrial reads were available for the genome reconstruction using an iterative mapping approach. The resulting genome sequence has 12 times coverage and a length of 16 741 bp. All 37 vertebrate mitochondrial loci plus the control region were identified and annotated. The mitochondrial NADH2 gene was subsequently used to place the syntype haplotype in a network comprising multiple E. pusillus samples from various distant localities as well as sequences from a morphological similar species, the shortfin smooth lantern shark Etmopterus joungi. Results confirm the almost global distribution of E. pusillus and suggest E. joungi to be a junior synonym of E. pusillus. As mitochondrial DNA often represents the only available reference information in non-model organisms, this study illustrates the importance of mitochondrial DNA from an aged, wet collection type specimen for taxonomy.
(1) Background:
Adaptive diversification of complex traits plays a pivotal role in the evolution of organismal diversity. In the freshwater snail genus Tylomelania, adaptive radiations were likely promoted by trophic specialization via diversification of their key foraging organ, the radula.
(2) Methods:
To investigate the molecular basis of radula diversification and its contribution to lineage divergence, we used tissue-specific transcriptomes of two sympatric Tylomelania sarasinorum ecomorphs.
(3) Results:
We show that ecomorphs are genetically divergent lineages with habitat-correlated abundances. Sequence divergence and the proportion of highly differentially expressed genes are significantly higher between radula transcriptomes compared to the mantle and foot. However, the same is not true when all differentially expressed genes or only non-synonymous SNPs are considered. Finally, putative homologs of some candidate genes for radula diversification (hh, arx, gbb) were also found to contribute to trophic specialization in cichlids and Darwin's finches.
(4) Conclusions:
Our results are in line with diversifying selection on the radula driving Tylomelania ecomorph divergence and indicate that some molecular pathways may be especially prone to adaptive diversification, even across phylogenetically distant animal groups.
Iconographic evidence from Egypt suggests that watermelon pulp was consumed there as a dessert by 4,360 BP.
Earlier archaeobotanical evidence comes from seeds from Neolithic settlements in Libya, but whether these were watermelons with sweet pulp or other forms is unknown.
We generated genome sequences from 6,000- and 3,300-year-old seeds from Libya and Sudan, and from worldwide herbarium collections made between 1824 and 2019, and analyzed these data together with resequenced genomes from important germplasm collections for a total of 131 accessions.
Phylogenomic and population-genomic analyses reveal that (1) much of the nuclear genome of both ancient seeds is traceable to West African seed-use "egusi-type" watermelon (Citrullus mucosospermus) rather than domesticated pulp-use watermelon (Citrullus lanatus ssp. vulgaris); (2) the 6,000-year-old watermelon likely had bitter pulp and greenish-white flesh as today found in C. mucosospermus, given alleles in the bitterness regulators ClBT and in the red color marker LYCB; and (3) both ancient genomes showed admixture from C. mucosospermus, C. lanatus ssp. cordophanus, C. lanatus ssp. vulgaris, and even South African Citrullus amarus, and evident introgression between the Libyan seed (UMB-6) and populations of C. lanatus.
An unexpected new insight is that Citrullus appears to have initially been collected or cultivated for its seeds, not its flesh, consistent with seed damage patterns induced by human teeth in the oldest Libyan material.
The subgenus Laurentomantis in the genus Gephyromantis contains some of the least known amphibian species of Madagascar. The six currently valid nominal species are rainforest frogs known from few individuals, hampering a full understanding of the species diversity of the clade. We assembled data on specimens collected during field surveys over the past 30 years and integrated analysis of mitochondrial and nuclear-encoded genes of 88 individuals, a comprehensive bioacoustic analysis, and morphological comparisons to delimit a minimum of nine species-level lineages in the subgenus. To clarify the identity of the species Gephyromantis malagasius, we applied a target-enrichment approach to a sample of the 110 year old holotype of Microphryne malagasia Methuen and Hewitt, 1913 to assign this specimen to a lineage based on a mitochondrial DNA barcode. The holotype clustered unambiguously with specimens previously named G. ventrimaculatus. Consequently we propose to consider Trachymantis malagasia ventrimaculatus Angel, 1935 as a junior synonym of Gephyromantis malagasius. Due to this redefinition of G. malagasius, no scientific name is available for any of the four deep lineages of frogs previously subsumed under this name, all characterized by red color ventrally on the hindlimbs. These are here formally named as Gephyromantis fiharimpe sp. nov., G. matsilo sp. nov., G. oelkrugi sp. nov., and G. portonae sp. nov. The new species are distinguishable from each other by genetic divergences of >4% uncorrected pairwise distance in a fragment of the 16S rRNA marker and a combination of morphological and bioacoustic characters. Gephyromantis fiharimpe and G. matsilo occur, respectively, at mid-elevations and lower elevations along a wide stretch of Madagascar's eastern rainforest band, while G. oelkrugi and G. portonae appear to be more range-restricted in parts of Madagascar's North East and Northern Central East regions. Open taxonomic questions surround G. horridus, to which we here assign specimens from Montagne d'Ambre and the type locality Nosy Be; and G. ranjomavo, which contains genetically divergent populations from Marojejy, Tsaratanana, and Ampotsidy.