Refine
Year of publication
Document Type
Language
- English (108)
Is part of the Bibliography
- yes (108)
Keywords
- ancient DNA (28)
- palaeogenomics (7)
- phylogeny (6)
- mitochondrial genome (5)
- Ancient DNA (4)
- Genomics (4)
- Mitochondria (4)
- admixture (4)
- hybridization capture (4)
- museum specimens (4)
Simultaneous Barcode Sequencing of Diverse Museum Collection Specimens Using a Mixed RNA Bait Set
(2022)
A growing number of publications presenting results from sequencing natural history collection specimens reflect the importance of DNA sequence information from such samples. Ancient DNA extraction and library preparation methods in combination with target gene capture are a way of unlocking archival DNA, including from formalin-fixed wet-collection material. Here we report on an experiment, in which we used an RNA bait set containing baits from a wide taxonomic range of species for DNA hybridisation capture of nuclear and mitochondrial targets for analysing natural history collection specimens. The bait set used consists of 2,492 mitochondrial and 530 nuclear RNA baits and comprises specific barcode loci of diverse animal groups including both invertebrates and vertebrates. The baits allowed to capture DNA sequence information of target barcode loci from 84% of the 37 samples tested, with nuclear markers being captured more frequently and consensus sequences of these being more complete compared to mitochondrial markers. Samples from dry material had a higher rate of success than wet-collection specimens, although target sequence information could be captured from 50% of formalin-fixed samples. Our study illustrates how efforts to obtain barcode sequence information from natural history collection specimens may be combined and are a way of implementing barcoding inventories of scientific collection material.
Simultaneous Barcode Sequencing of Diverse Museum Collection Specimens Using a Mixed RNA Bait Set
(2022)
A growing number of publications presenting results from sequencing natural history collection specimens reflect the importance of DNA sequence information from such samples. Ancient DNA extraction and library preparation methods in combination with target gene capture are a way of unlocking archival DNA, including from formalin-fixed wet-collection material. Here we report on an experiment, in which we used an RNA bait set containing baits from a wide taxonomic range of species for DNA hybridisation capture of nuclear and mitochondrial targets for analysing natural history collection specimens. The bait set used consists of 2,492 mitochondrial and 530 nuclear RNA baits and comprises specific barcode loci of diverse animal groups including both invertebrates and vertebrates. The baits allowed to capture DNA sequence information of target barcode loci from 84% of the 37 samples tested, with nuclear markers being captured more frequently and consensus sequences of these being more complete compared to mitochondrial markers. Samples from dry material had a higher rate of success than wet-collection specimens, although target sequence information could be captured from 50% of formalin-fixed samples. Our study illustrates how efforts to obtain barcode sequence information from natural history collection specimens may be combined and are a way of implementing barcoding inventories of scientific collection material.
The prevalence of contaminant microbial DNA in ancient bone samples represents the principal limiting factor for palaeogenomic studies, as it may comprise more than 99% of DNA molecules obtained. Efforts to exclude or reduce this contaminant fraction have been numerous but also variable in their success. Here, we present a simple but highly effective method to increase the relative proportion of endogenous molecules obtained from ancient bones. Using computed tomography (CT) scanning, we identify the densest region of a bone as optimal for sampling. This approach accurately identifies the densest internal regions of petrous bones, which are known to be a source of high-purity ancient DNA. For ancient long bones, CT scans reveal a high-density outermost layer, which has been routinely removed and discarded prior to DNA extraction. For almost all long bones investigated, we find that targeted sampling of this outermost layer provides an increase in endogenous DNA content over that obtained from softer, trabecular bone. This targeted sampling can produce as much as 50-fold increase in the proportion of endogenous DNA, providing a directly proportional reduction in sequencing costs for shotgun sequencing experiments. The observed increases in endogenous DNA proportion are not associated with any reduction in absolute endogenous molecule recovery. Although sampling the outermost layer can result in higher levels of human contamination, some bones were found to have more contamination associated with the internal bone structures. Our method is highly consistent, reproducible and applicable across a wide range of bone types, ages and species. We predict that this discovery will greatly extend the potential to study ancient populations and species in the genomics era.
Dromedaries have been fundamental to the development of human societies in arid landscapes and for long-distance trade across hostile hot terrains for 3,000 y. Today they continue to be an important livestock resource in marginal agro-ecological zones. However, the history of dromedary domestication and the influence of ancient trading networks on their genetic structure have remained elusive. We combined ancient DNA sequences of wild and early-domesticated dromedary samples from arid regions with nuclear microsatellite and mitochondrial genotype information from 1,083 extant animals collected across the species’ range. We observe little phylogeographic signal in the modern population, indicative of extensive gene flow and virtually affecting all regions except East Africa, where dromedary populations have remained relatively isolated. In agreement with archaeological findings, we identify wild dromedaries from the southeast Arabian Peninsula among the founders of the domestic dromedary gene pool. Approximate Bayesian computations further support the “restocking from the wild” hypothesis, with an initial domestication followed by introgression from individuals from wild, now-extinct populations. Compared with other livestock, which show a long history of gene flow with their wild ancestors, we find a high initial diversity relative to the native distribution of the wild ancestor on the Arabian Peninsula and to the brief coexistence of early-domesticated and wild individuals. This study also demonstrates the potential to retrieve ancient DNA sequences from osseous remains excavated in hot and dry desert environments.
Climate impacts on transocean dispersal and habitat in gray whales from the Pleistocene to 2100
(2015)
Arctic animals face dramatic habitat alteration due to ongoing climate change. Understanding how such species have responded to past glacial cycles can help us forecast their response to today's changing climate. Gray whales are among those marine species likely to be strongly affected by Arctic climate change, but a thorough analysis of past climate impacts on this species has been complicated by lack of information about an extinct population in the Atlantic. While little is known about the history of Atlantic gray whales or their relationship to the extant Pacific population, the extirpation of the Atlantic population during historical times has been attributed to whaling. We used a combination of ancient and modern DNA, radiocarbon dating and predictive habitat modelling to better understand the distribution of gray whales during the Pleistocene and Holocene. Our results reveal that dispersal between the Pacific and Atlantic was climate dependent and occurred both during the Pleistocene prior to the last glacial period and the early Holocene immediately following the opening of the Bering Strait. Genetic diversity in the Atlantic declined over an extended interval that predates the period of intensive commercial whaling, indicating this decline may have been precipitated by Holocene climate or other ecological causes. These first genetic data for Atlantic gray whales, particularly when combined with predictive habitat models for the year 2100, suggest that two recent sightings of gray whales in the Atlantic may represent the beginning of the expansion of this species' habitat beyond its currently realized range.
Climate impacts on transocean dispersal and habitat in gray whales from the Pleistocene to 2100
(2015)
Arctic animals face dramatic habitat alteration due to ongoing climate change. Understanding how such species have responded to past glacial cycles can help us forecast their response to today's changing climate. Gray whales are among those marine species likely to be strongly affected by Arctic climate change, but a thorough analysis of past climate impacts on this species has been complicated by lack of information about an extinct population in the Atlantic. While little is known about the history of Atlantic gray whales or their relationship to the extant Pacific population, the extirpation of the Atlantic population during historical times has been attributed to whaling. We used a combination of ancient and modern DNA, radiocarbon dating and predictive habitat modelling to better understand the distribution of gray whales during the Pleistocene and Holocene. Our results reveal that dispersal between the Pacific and Atlantic was climate dependent and occurred both during the Pleistocene prior to the last glacial period and the early Holocene immediately following the opening of the Bering Strait. Genetic diversity in the Atlantic declined over an extended interval that predates the period of intensive commercial whaling, indicating this decline may have been precipitated by Holocene climate or other ecological causes. These first genetic data for Atlantic gray whales, particularly when combined with predictive habitat models for the year 2100, suggest that two recent sightings of gray whales in the Atlantic may represent the beginning of the expansion of this species' habitat beyond its currently realized range.
Although many large mammal species went extinct at the end of the Pleistocene epoch, their DNA may persist due to past episodes of interspecies admixture. However, direct empirical evidence of the persistence of ancient alleles remains scarce. Here, we present multifold coverage genomic data from four Late Pleistocene cave bears (Ursus spelaeus complex) and show that cave bears hybridized with brown bears (Ursus arctos) during the Pleistocene. We develop an approach to assess both the directionality and relative timing of gene flow. We find that segments of cave bear DNA still persist in the genomes of living brown bears, with cave bears contributing 0.9 to 2.4% of the genomes of all brown bears investigated. Our results show that even though extinction is typically considered as absolute, following admixture, fragments of the gene pool of extinct species can survive for tens of thousands of years in the genomes of extant recipient species.
Consensify
(2020)
A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes.
Consensify
(2020)
A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes.
Taxonomy plays a central role in biological sciences. It provides a communication system for scientists as it aims to enable correct identification of the studied organisms. As a consequence, species descriptions should seek to include as much available information as possible at species level to follow an integrative concept of 'taxonomics'. Here, we describe the cryptic species Epimeria frankei sp. nov. from the North Sea, and also redescribe its sister species, Epimeria cornigera. The morphological information obtained is substantiated by DNA barcodes and complete nuclear 18S rRNA gene sequences. In addition, we provide, for the first time, full mitochondrial genome data as part of a metazoan species description for a holotype, as well as the neotype. This study represents the first successful implementation of the recently proposed concept of taxonomics, using data from high-throughput technologies for integrative taxonomic studies, allowing the highest level of confidence for both biodiversity and ecological research.
Taxonomy plays a central role in biological sciences. It provides a communication system for scientists as it aims to enable correct identification of the studied organisms. As a consequence, species descriptions should seek to include as much available information as possible at species level to follow an integrative concept of 'taxonomics'. Here, we describe the cryptic species Epimeria frankei sp. nov. from the North Sea, and also redescribe its sister species, Epimeria cornigera. The morphological information obtained is substantiated by DNA barcodes and complete nuclear 18S rRNA gene sequences. In addition, we provide, for the first time, full mitochondrial genome data as part of a metazoan species description for a holotype, as well as the neotype. This study represents the first successful implementation of the recently proposed concept of taxonomics, using data from high-throughput technologies for integrative taxonomic studies, allowing the highest level of confidence for both biodiversity and ecological research.
Anatomical changes in extinct mammalian lineages over evolutionary time, such as the loss of fingers and teeth and the rapid increase in body size that accompanied the late Miocene dispersal of the progenitors of Steller's sea cows (Hydrodamalis gigas (Zimmermann, 1780)) into North Pacific waters and the convergent development of a thick pelage and accompanying reductions in ear and tail surface area of woolly mammoths (Mammuthus primigenius (Blumenbach, 1799)) and woolly rhinoceros (Coelodonta antiquitatis (Blumenbach, 1799)), are prime examples of adaptive evolution underlying the exploitation of new habitats. It is likely, however, that biochemical specializations adopted during these evolutionary transitions were of similar or even greater biological importance. As these "living" processes do not fossilize, direct information regarding the physiological attributes of extinct species has largely remained beyond the range of scientific inquiry. However, the ability to retrieve genomic sequences from ancient DNA samples, combined with ectopic expression systems, now permit the evolutionary origins and structural and functional properties of authentic prehistoric proteins to be examined in great detail. Exponential technical advances in ancient DNA retrieval, enrichment, and sequencing will soon permit targeted generation of complete genomes from hundreds of extinct species across the last one million years that, in combination with emerging in vitro expression, genome engineering, and cell differentiation techniques, promises to herald an exciting new trajectory of evolutionary research at the interface of biochemistry, genomics, palaeontology, and cell biology.
There is the tendency to assume that endangered species have been both genetically and demographically healthier in the past, so that any genetic erosion observed today was caused by their recent decline. The Iberian lynx (Lynx pardinus) suffered a dramatic and continuous decline during the 20th century, and now shows extremely low genome- and species-wide genetic diversity among other signs of genomic erosion. We analyze ancient (N = 10), historical (N = 245), and contemporary (N = 172) samples with microsatellite and mitogenome data to reconstruct the species' demography and investigate patterns of genetic variation across space and time. Iberian lynx populations transitioned from low but significantly higher genetic diversity than today and shallow geographical differentiation millennia ago, through a structured metapopulation with varying levels of diversity during the last centuries, to two extremely genetically depauperate and differentiated remnant populations by 2002. The historical subpopulations show varying extents of genetic drift in relation to their recent size and time in isolation, but these do not predict whether the populations persisted or went finally extinct. In conclusion, current genetic patterns were mainly shaped by genetic drift, supporting the current admixture of the two genetic pools and calling for a comprehensive genetic management of the ongoing conservation program. This study illustrates how a retrospective analysis of demographic and genetic patterns of endangered species can shed light onto their evolutionary history and this, in turn, can inform conservation actions.
Near the end of the Pleistocene epoch, populations of the woolly mammoth (Mammuthus primigenius) were distributed across parts of three continents, from western Europe and northern Asia through Beringia to the Atlantic seaboard of North America. Nonetheless, questions about the connectivity and temporal continuity of mammoth populations and species remain unanswered. We use a combination of targeted enrichment and high-throughput sequencing to assemble and interpret a data set of 143 mammoth mitochondrial genomes, sampled from fossils recovered from across their Holarctic range. Our dataset includes 54 previously unpublished mitochondrial genomes and significantly increases the coverage of the Eurasian range of the species. The resulting global phylogeny confirms that the Late Pleistocene mammoth population comprised three distinct mitochondrial lineages that began to diverge ~1.0–2.0 million years ago (Ma). We also find that mammoth mitochondrial lineages were strongly geographically partitioned throughout the Pleistocene. In combination, our genetic results and the pattern of morphological variation in time and space suggest that male-mediated gene flow, rather than large-scale dispersals, was important in the Pleistocene evolutionary history of mammoths.
Domestic Bactrian camel (Camelus bactrianus) used to be one of the most important livestock species in Chinese history, as well as the major transport carrier on the ancient Silk Road. However, archeological studies on Chinese C. bactrianus are still limited, and molecular biology research on this species is mainly focused on modern specimens. In this study, we retrieved the complete mitochondrial genome from a C. bactrianus specimen, which was excavated from northwestern China and dated at 1290-1180 cal. Phylogenetic analyses using 18 mitochondrial genomes indicated that the C. bactrianus clade was divided into two maternal lineages. The majority of samples originating from Iran to Japan and Mongolia belong to subclade A1, while our sample together with two Mongolian individuals formed the much smaller subclade A2. Furthermore, the divergence time of these two maternal lineages was estimated as 165 Kya (95% credibility interval 117-222 Kya), this might indicate that several different evolutionary lineages were incorporated into the domestic gene pool during the initial domestication process. Bayesian skyline plot (BSP) analysis a slow increase in female effective population size of C. bactrianus from 5000 years ago, which to the beginning of domestication of C. bactrianus. The present study also revealed that there were extensive exchanges of genetic information among C. bactrianus populations in regions along the Silk Road.
Comparing mitogenomic timetrees for two African savannah primate genera (Chlorocebus and Papio)
(2020)
Comparing mitogenomic timetrees for two African savannah primate genera (Chlorocebus and Papio)
(2017)
Complete mitochondrial (mtDNA) genomes have proved to be useful in reconstructing primate phylogenies with higher resolution and confidence compared to reconstructions based on partial mtDNA sequences. Here, we analyse complete mtDNA genomes of African green monkeys (genus Chlorocebus), a widely distributed primate genus in Africa representing an interesting phylogeographical model for the evolution of savannah species. Previous studies on partial mtDNA sequences revealed nine major clades, suggesting several cases of para- and polyphyly among Chlorocebus species. However, in these studies, phylogenetic relationships among several clades were not resolved, and divergence times were not estimated. We analysed complete mtDNA genomes for ten Chlorocebus samples representing major mtDNA clades to find stronger statistical support in the phylogenetic reconstruction than in the previous studies and to estimate divergence times. Our results confirmed para- and polyphyletic relationships of most Chlorocebus species, while the support for the phylogenetic relationships between the mtDNA clades increased compared to the previous studies. Our results indicate an initial west-east division in the northern part of the Chlorocebus range with subsequent divergence into north-eastern and southern clades. This phylogeographic scenario contrasts with that for another widespread African savannah primate genus, the baboons (Papio), for which a dispersal from southern Africa into East and West Africa was suggested.
Faunal remains from Palaeolithic sites are important genetic sources to study preglacial and postglacial populations and to investigate the effect of climate change and human impact. Post mortem decay, resulting in fragmented and chemically modified DNA, is a key obstacle in ancient DNA analyses. In the absence of reliable methods to determine the presence of endogenous DNA in sub-fossil samples, temporal and spatial surveys of DNA survival on a regional scale may help to estimate the potential of faunal remains from a given time period and region. We therefore investigated PCR amplification success, PCR performance and post mortem damage in c. 47,000 to c. 12,000-year-old horse remains from 14 Palaeolithic sites along the Swiss Jura Mountains in relation to depositional context, tissue type, storage time and age, potentially influencing DNA preservation. The targeted 75 base pair mitochondrial DNA fragment could be amplified solely from equid remains from caves and not from any of the open dry and (temporary) wetland sites. Whether teeth are better than bones cannot be ultimately decided; however, both storage time after excavation and age significantly affect PCR amplification and performance, albeit not in a linear way. This is best explained by the-inevitable-heterogeneity of the data set. The extent of post mortem damage is not related to any of the potential impact factors. The results encourage comprehensive investigations of Palaeolithic cave sites, even from temperate regions.
Faunal remains from Palaeolithic sites are important genetic sources to study preglacial and postglacial populations and to investigate the effect of climate change and human impact. Post mortem decay, resulting in fragmented and chemically modified DNA, is a key obstacle in ancient DNA analyses. In the absence of reliable methods to determine the presence of endogenous DNA in sub-fossil samples, temporal and spatial surveys of DNA survival on a regional scale may help to estimate the potential of faunal remains from a given time period and region. We therefore investigated PCR amplification success, PCR performance and post mortem damage in c. 47,000 to c. 12,000-year-old horse remains from 14 Palaeolithic sites along the Swiss Jura Mountains in relation to depositional context, tissue type, storage time and age, potentially influencing DNA preservation. The targeted 75 base pair mitochondrial DNA fragment could be amplified solely from equid remains from caves and not from any of the open dry and (temporary) wetland sites. Whether teeth are better than bones cannot be ultimately decided; however, both storage time after excavation and age significantly affect PCR amplification and performance, albeit not in a linear way. This is best explained by the—inevitable—heterogeneity of the data set. The extent of post mortem damage is not related to any of the potential impact factors. The results encourage comprehensive investigations of Palaeolithic cave sites, even from temperate regions.
Horse domestication revolutionized warfare and accelerated travel, trade, and the geographic expansion of languages. Here, we present the largest DNA time series for a non-human organism to date, including genome-scale data from 149 ancient animals and 129 ancient genomes (>= 1-fold coverage), 87 of which are new. This extensive dataset allows us to assess the modem legacy of past equestrian civilisations. We find that two extinct horse lineages existed during early domestication, one at the far western (Iberia) and the other at the far eastern range (Siberia) of Eurasia. None of these contributed significantly to modern diversity. We show that the influence of Persian-related horse lineages increased following the Islamic conquests in Europe and Asia. Multiple alleles associated with elite-racing, including at the MSTN "speed gene," only rose in popularity within the last millennium. Finally, the development of modem breeding impacted genetic diversity more dramatically than the previous millennia of human management.
The complete mitochondrial genome of the common vole, Microtus arvalis (Rodentia: Arvicolinae)
(2018)
The common vole, Microtus arvalis belongs to the genus Microtus in the subfamily Arvicolinae. In this study, the complete mitochondrial genome of M. arvalis was recovered using shotgun sequencing and an iterative mapping approach using three related species. Phylogenetic analyses using the sequence of 21 arvicoline species place the common vole as a sister species to the East European vole (Microtus levis), but as opposed to previous results we find no support for the recognition of the genus Neodon within the subfamily Arvicolinae, as this is, as well as the genus Lasiopodomys, found within the Microtus genus.
The complete mitochondrial genome of the common vole, Microtus arvalis (Rodentia: Arvicolinae)
(2018)
The common vole, Microtus arvalis belongs to the genus Microtus in the subfamily Arvicolinae. In this study, the complete mitochondrial genome of M. arvalis was recovered using shotgun sequencing and an iterative mapping approach using three related species. Phylogenetic analyses using the sequence of 21 arvicoline species place the common vole as a sister species to the East European vole (Microtus levis), but as opposed to previous results we find no support for the recognition of the genus Neodon within the subfamily Arvicolinae, as this is, as well as the genus Lasiopodomys, found within the Microtus genus.
Targeted capture coupled with high-throughput sequencing can be used to gain information about nuclear sequence variation at hundreds to thousands of loci. Divergent reference capture makes use of molecular data of one species to enrich target loci in other (related) species. This is particularly valuable for nonmodel organisms, for which often no a priori knowledge exists regarding these loci. Here, we have used targeted capture to obtain data for 809 nuclear coding DNA sequences (CDS) in a nonmodel organism, the Eurasian lynx Lynx lynx, using baits designed with the help of the published genome of a related model organism (the domestic cat Felis catus). Using this approach, we were able to survey intraspecific variation at hundreds of nuclear loci in L. lynx across the species’ European range. A large set of biallelic candidate SNPs was then evaluated using a high-throughput SNP genotyping platform (Fluidigm), which we then reduced to a final 96 SNP-panel based on assay performance and reliability; validation was carried out with 100 additional Eurasian lynx samples not included in the SNP discovery phase. The 96 SNP-panel developed from CDS performed very successfully in the identification of individuals and in population genetic structure inference (including the assignment of individuals to their source population). In keeping with recent studies, our results show that genic SNPs can be valuable for genetic monitoring of wildlife species.
The agricultural transition profoundly changed human societies. We sequenced and analysed the first genome (1.39x) of an early Neolithic woman from Ganj Dareh, in the Zagros Mountains of Iran, a site with early evidence for an economy based on goat herding, ca. 10,000 BP. We show that Western Iran was inhabited by a population genetically most similar to hunter-gatherers from the Caucasus, but distinct from the Neolithic Anatolian people who later brought food production into Europe. The inhabitants of Ganj Dareh made little direct genetic contribution to modern European populations, suggesting those of the Central Zagros were somewhat isolated from other populations of the Fertile Crescent. Runs of homozygosity are of a similar length to those from Neolithic farmers, and shorter than those of Caucasus and Western Hunter-Gatherers, suggesting that the inhabitants of Ganj Dareh did not undergo the large population bottleneck suffered by their northern neighbours. While some degree of cultural diffusion between Anatolia, Western Iran and other neighbouring regions is possible, the genetic dissimilarity between early Anatolian farmers and the inhabitants of Ganj Dareh supports a model in which Neolithic societies in these areas were distinct.
The Great Hungarian Plain was a crossroads of cultural transformations that have shaped European prehistory. Here we analyse a 5,000-year transect of human genomes, sampled from petrous bones giving consistently excellent endogenous DNA yields, from 13 Hungarian Neolithic, Copper, Bronze and Iron Age burials including two to high (similar to 22x) and seven to similar to 1x coverage, to investigate the impact of these on Europe's genetic landscape. These data suggest genomic shifts with the advent of the Neolithic, Bronze and Iron Ages, with interleaved periods of genome stability. The earliest Neolithic context genome shows a European hunter-gatherer genetic signature and a restricted ancestral population size, suggesting direct contact between cultures after the arrival of the first farmers into Europe. The latest, Iron Age, sample reveals an eastern genomic influence concordant with introduced Steppe burial rites. We observe transition towards lighter pigmentation and surprisingly, no Neolithic presence of lactase persistence.
The Great Hungarian Plain was a crossroads of cultural transformations that have shaped European prehistory. Here we analyse a 5,000-year transect of human genomes, sampled from petrous bones giving consistently excellent endogenous DNA yields, from 13 Hungarian Neolithic, Copper, Bronze and Iron Age burials including two to high (similar to 22x) and seven to similar to 1x coverage, to investigate the impact of these on Europe's genetic landscape. These data suggest genomic shifts with the advent of the Neolithic, Bronze and Iron Ages, with interleaved periods of genome stability. The earliest Neolithic context genome shows a European hunter-gatherer genetic signature and a restricted ancestral population size, suggesting direct contact between cultures after the arrival of the first farmers into Europe. The latest, Iron Age, sample reveals an eastern genomic influence concordant with introduced Steppe burial rites. We observe transition towards lighter pigmentation and surprisingly, no Neolithic presence of lactase persistence.
Inactivation of thermogenic UCP1 as a historical contingency in multiple placental mammal clades
(2017)
This is a reply to the comments of Morey (2014) on our identification of Palaeolithic dogs from several European Palaeolithic sites. In his comments Morey (2014) presents some misrepresentations and misunderstandings that we remedy here. In contrast to what Morey (2014) propounds, our results suggest that the domestication of the wolf was a long process that started early in the Upper Palaeolithic and that since that time two sympatric canid morphotypes can be seen in Eurasian sites: Pleistocene wolves and Palaeolithic dogs. Contrary to Morey (2014), we are convinced that the study of this domestication process should be multidisciplinary. (C) 2014 Elsevier Ltd. All rights reserved.
Being at the western fringe of Europe, Iberia had a peculiar prehistory and a complex pattern of Neolithization. A few studies, all based on modern populations, reported the presence of DNA of likely African origin in this region, generally concluding it was the result of recent gene flow, probably during the Islamic period. Here, we provide evidence of much older gene flow from Africa to Iberia by sequencing whole genomes from four human remains from northern Portugal and southern Spain dated around 4000 years BP (from the Middle Neolithic to the Bronze Age). We found one of them to carry an unequivocal sub-Saharan mitogenome of most probably West or West-Central African origin, to our knowledge never reported before in prehistoric remains outside Africa. Our analyses of ancient nuclear genomes show small but significant levels of sub-Saharan African affinity in several ancient Iberian samples, which indicates that what we detected was not an occasional individual phenomenon, but an admixture event recognizable at the population level. We interpret this result as evidence of an early migration process from Africa into the Iberian Peninsula through a western route, possibly across the Strait of Gibraltar.
The transition from hunting and gathering to farming involved profound cultural and technological changes. In Western and Central Europe, these changes occurred rapidly and synchronously after the arrival of early farmers of Anatolian origin [1-3], who largely replaced the local Mesolithic hunter-gatherers [1, 4-6]. Further east, in the Baltic region, the transition was gradual, with little or no genetic input from incoming farmers [7]. Here we use ancient DNA to investigate the relationship between hunter-gatherers and farmers in the Lower Danube basin, a geographically intermediate area that is characterized by a rapid Neolithic transition but also by the presence of archaeological evidence that points to cultural exchange, and thus possible admixture, between hunter-gatherers and farmers. We recovered four human paleogenomes (1.13 to 4.13 coverage) from Romania spanning a time transect between 8.8 thousand years ago (kya) and 5.4 kya and supplemented them with two Mesolithic genomes (1.73- and 5.33) from Spain to provide further context on the genetic background of Mesolithic Europe. Our results show major Western hunter-gatherer (WHG) ancestry in a Romanian Eneolithic sample with a minor, but sizeable, contribution from Anatolian farmers, suggesting multiple admixture events between hunter-gatherers and farmers. Dietary stableisotope analysis of this sample suggests a mixed terrestrial/ aquatic diet. Our results provide support for complex interactions among hunter-gatherers and farmers in the Danube basin, demonstrating that in some regions, demic and cultural diffusion were not mutually exclusive, but merely the ends of a continuum for the process of Neolithization.
The transition from hunting and gathering to farming involved profound cultural and technological changes. In Western and Central Europe, these changes occurred rapidly and synchronously after the arrival of early farmers of Anatolian origin [1-3], who largely replaced the local Mesolithic hunter-gatherers [1, 4-6]. Further east, in the Baltic region, the transition was gradual, with little or no genetic input from incoming farmers [7]. Here we use ancient DNA to investigate the relationship between hunter-gatherers and farmers in the Lower Danube basin, a geographically intermediate area that is characterized by a rapid Neolithic transition but also by the presence of archaeological evidence that points to cultural exchange, and thus possible admixture, between hunter-gatherers and farmers. We recovered four human paleogenomes (1.13 to 4.13 coverage) from Romania spanning a time transect between 8.8 thousand years ago (kya) and 5.4 kya and supplemented them with two Mesolithic genomes (1.73- and 5.33) from Spain to provide further context on the genetic background of Mesolithic Europe. Our results show major Western hunter-gatherer (WHG) ancestry in a Romanian Eneolithic sample with a minor, but sizeable, contribution from Anatolian farmers, suggesting multiple admixture events between hunter-gatherers and farmers. Dietary stableisotope analysis of this sample suggests a mixed terrestrial/ aquatic diet. Our results provide support for complex interactions among hunter-gatherers and farmers in the Danube basin, demonstrating that in some regions, demic and cultural diffusion were not mutually exclusive, but merely the ends of a continuum for the process of Neolithization.
Ancient DNA studies have revolutionized the study of extinct species and populations, providing insights on phylogeny, phylogeography, admixture and demographic history. However, inferences on behaviour and sociality have been far less frequent. Here, we investigate the complete mitochondrial genomes of extinct Late Pleistocene cave bears and middle Holocene brown bears that each inhabited multiple geographically proximate caves in northern Spain. In cave bears, we find that, although most caves were occupied simultaneously, each cave almost exclusively contains a unique lineage of closely related haplotypes. This remarkable pattern suggests extreme fidelity to their birth site in cave bears, best described as homing behaviour, and that cave bears formed stable maternal social groups at least for hibernation. In contrast, brown bears do not show any strong association of mitochondrial lineage and cave, suggesting that these two closely related species differed in aspects of their behaviour and sociality. This difference is likely to have contributed to cave bear extinction, which occurred at a time in which competition for caves between bears and humans was likely intense and the ability to rapidly colonize new hibernation sites would have been crucial for the survival of a species so dependent on caves for hibernation as cave bears. Our study demonstrates the potential of ancient DNA to uncover patterns of behaviour and sociality in ancient species and populations, even those that went extinct many tens of thousands of years ago.
Background
Contiguous genome assemblies are a highly valued biological resource because of the higher number of completely annotated genes and genomic elements that are usable compared to fragmented draft genomes. Nonetheless, contiguity is difficult to obtain if only low coverage data and/or only distantly related reference genome assemblies are available.
Findings
In order to improve genome contiguity, we have developed Cross-Species Scaffolding—a new pipeline that imports long-range distance information directly into the de novo assembly process by constructing mate-pair libraries in silico.
Conclusions
We show how genome assembly metrics and gene prediction dramatically improve with our pipeline by assembling two primate genomes solely based on ∼30x coverage of shotgun sequencing data.
Background
Contiguous genome assemblies are a highly valued biological resource because of the higher number of completely annotated genes and genomic elements that are usable compared to fragmented draft genomes. Nonetheless, contiguity is difficult to obtain if only low coverage data and/or only distantly related reference genome assemblies are available.
Findings
In order to improve genome contiguity, we have developed Cross-Species Scaffolding—a new pipeline that imports long-range distance information directly into the de novo assembly process by constructing mate-pair libraries in silico.
Conclusions
We show how genome assembly metrics and gene prediction dramatically improve with our pipeline by assembling two primate genomes solely based on ∼30x coverage of shotgun sequencing data.
Domestic cattle were brought to Spain by early settlers and agricultural societies. Due to missing Neolithic sites in the Spanish region of Galicia, very little is known about this process in this region. We sampled 18 cattle subfossils from different ages and different mountain caves in Galicia, of which 11 were subject to sequencing of the mitochondrial genome and phylogenetic analysis, to provide insight into the introduction of cattle to this region. We detected high similarity between samples from different time periods and were able to compare the time frame of the first domesticated cattle in Galicia to data from the connecting region of Cantabria to show a plausible connection between the Neolithization of these two regions. Our data shows a close relationship of the early domesticated cattle of Galicia and modern cow breeds and gives a general insight into cattle phylogeny. We conclude that settlers migrated to this region of Spain from Europe and introduced common European breeds to Galicia.
Domestic cattle were brought to Spain by early settlers and agricultural societies. Due to missing Neolithic sites in the Spanish region of Galicia, very little is known about this process in this region. We sampled 18 cattle subfossils from different ages and different mountain caves in Galicia, of which 11 were subject to sequencing of the mitochondrial genome and phylogenetic analysis, to provide insight into the introduction of cattle to this region. We detected high similarity between samples from different time periods and were able to compare the time frame of the first domesticated cattle in Galicia to data from the connecting region of Cantabria to show a plausible connection between the Neolithization of these two regions. Our data shows a close relationship of the early domesticated cattle of Galicia and modern cow breeds and gives a general insight into cattle phylogeny. We conclude that settlers migrated to this region of Spain from Europe and introduced common European breeds to Galicia.
Objective
Plant carnivory is distributed across the tree of life and has evolved at least six times independently, but sequenced and annotated nuclear genomes of carnivorous plants are currently lacking. We have sequenced and structurally annotated the nuclear genome of the carnivorous Roridula gorgonias and that of a non-carnivorous relative, Madeira’s lily-of-the-valley-tree, Clethra arborea, both within the Ericales. This data adds an important resource to study the evolutionary genetics of plant carnivory across angiosperm lineages and also for functional and systematic aspects of plants within the Ericales.
Results
Our assemblies have total lengths of 284 Mbp (R. gorgonias) and 511 Mbp (C. arborea) and show high BUSCO scores of 84.2% and 89.5%, respectively. We used their predicted genes together with publicly available data from other Ericales’ genomes and transcriptomes to assemble a phylogenomic data set for the inference of a species tree. However, groups of orthologs showed a marked absence of species represented by a transcriptome. We discuss possible reasons and caution against combining predicted genes from genome- and transriptome-based assemblies.
Objective
Plant carnivory is distributed across the tree of life and has evolved at least six times independently, but sequenced and annotated nuclear genomes of carnivorous plants are currently lacking. We have sequenced and structurally annotated the nuclear genome of the carnivorous Roridula gorgonias and that of a non-carnivorous relative, Madeira’s lily-of-the-valley-tree, Clethra arborea, both within the Ericales. This data adds an important resource to study the evolutionary genetics of plant carnivory across angiosperm lineages and also for functional and systematic aspects of plants within the Ericales.
Results
Our assemblies have total lengths of 284 Mbp (R. gorgonias) and 511 Mbp (C. arborea) and show high BUSCO scores of 84.2% and 89.5%, respectively. We used their predicted genes together with publicly available data from other Ericales’ genomes and transcriptomes to assemble a phylogenomic data set for the inference of a species tree. However, groups of orthologs showed a marked absence of species represented by a transcriptome. We discuss possible reasons and caution against combining predicted genes from genome- and transriptome-based assemblies.
The radula is the central foraging organ and apomorphy of the Mollusca. However, in contrast to other innovations, including the mollusk shell, genetic underpinnings of radula formation remain virtually unknown. Here, we present the first radula formative tissue transcriptome using the viviparous freshwater snail Tylomelania sarasinorum and compare it to foot tissue and the shell-building mantle of the same species. We combine differential expression, functional enrichment, and phylostratigraphic analyses to identify both specific and shared genetic underpinnings of the three tissues as well as their dominant functions and evolutionary origins. Gene expression of radula formative tissue is very distinct, but nevertheless more similar to mantle than to foot. Generally, the genetic bases of both radula and shell formation were shaped by novel orchestration of preexisting genes and continuous evolution of novel genes. A significantly increased proportion of radula-specific genes originated since the origin of stem-mollusks, indicating that novel genes were especially important for radula evolution. Genes with radula-specific expression in our study are frequently also expressed during the formation of other lophotrochozoan hard structures, like chaetae (hes1, arx), spicules (gbx), and shells of mollusks (gbx, heph) and brachiopods (heph), suggesting gene co-option for hard structure formation. Finally, a Lophotrochozoa-specific chitin synthase with a myosin motor domain (CS-MD), which is expressed during mollusk and brachiopod shell formation, had radula-specific expression in our study. CS-MD potentially facilitated the construction of complex chitinous structures and points at the potential of molecular novelties to promote the evolution of different morphological innovations.
(1) Background:
Adaptive diversification of complex traits plays a pivotal role in the evolution of organismal diversity. In the freshwater snail genus Tylomelania, adaptive radiations were likely promoted by trophic specialization via diversification of their key foraging organ, the radula.
(2) Methods:
To investigate the molecular basis of radula diversification and its contribution to lineage divergence, we used tissue-specific transcriptomes of two sympatric Tylomelania sarasinorum ecomorphs.
(3) Results:
We show that ecomorphs are genetically divergent lineages with habitat-correlated abundances. Sequence divergence and the proportion of highly differentially expressed genes are significantly higher between radula transcriptomes compared to the mantle and foot. However, the same is not true when all differentially expressed genes or only non-synonymous SNPs are considered. Finally, putative homologs of some candidate genes for radula diversification (hh, arx, gbb) were also found to contribute to trophic specialization in cichlids and Darwin's finches.
(4) Conclusions:
Our results are in line with diversifying selection on the radula driving Tylomelania ecomorph divergence and indicate that some molecular pathways may be especially prone to adaptive diversification, even across phylogenetically distant animal groups.
Obtaining information about functional details of proteins of extinct species is of critical importance for a better understanding of the real-life appearance, behavior and ecology of these lost entries in the book of life. In this chapter, we discuss the possibilities to retrieve the necessary DNA sequence information from paleogenomic data obtained from fossil specimens, which can then be used to express and subsequently analyze the protein of interest. We discuss the problems specific to ancient DNA, including mis-coding lesions, short read length and incomplete paleogenome assemblies. Finally, we discuss an alternative, but currently rarely used approach, direct PCR amplification, which is especially useful for comparatively short proteins.
The future of ancient DNA
(2015)
Technological innovations such as next generation sequencing and DNA hybridisation enrichment have resulted in multi-fold increases in both the quantity of ancient DNA sequence data and the time depth for DNA retrieval. To date, over 30 ancient genomes have been sequenced, moving from 0.7x coverage (mammoth) in 2008 to more than 50x coverage (Neanderthal) in 2014. Studies of rapid evolutionary changes, such as the evolution and spread of pathogens and the genetic responses of hosts, or the genetics of domestication and climatic adaptation, are developing swiftly and the importance of palaeogenomics for investigating evolutionary processes during the last million years is likely to increase considerably. However, these new datasets require new methods of data processing and analysis, as well as conceptual changes in interpreting the results. In this review we highlight important areas of future technical and conceptual progress and discuss research topics in the rapidly growing field of palaeogenomics.
Technological innovations such as next generation sequencing and DNA hybridisation enrichment have resulted in multi-fold increases in both the quantity of ancient DNA sequence data and the time depth for DNA retrieval. To date, over 30 ancient genomes have been sequenced, moving from 0.7x coverage (mammoth) in 2008 to more than 50x coverage (Neanderthal) in 2014. Studies of rapid evolutionary changes, such as the evolution and spread of pathogens and the genetic responses of hosts, or the genetics of domestication and climatic adaptation, are developing swiftly and the importance of palaeogenomics for investigating evolutionary processes during the last million years is likely to increase considerably. However, these new datasets require new methods of data processing and analysis, as well as conceptual changes in interpreting the results. In this review we highlight important areas of future technical and conceptual progress and discuss research topics in the rapidly growing field of palaeogenomics.
Ancient mitochondrial DNA and the genetic history of Eurasian beaver (Castor fiber) in Europe
(2014)
After centuries of human hunting, the Eurasian beaver Castor fiber had disappeared from most of its original range by the end of the 19th century. The surviving relict populations are characterized by both low genetic diversity and strong phylogeographical structure. However, it remains unclear whether these attributes are the result of a human-induced, late Holocene bottleneck or already existed prior to this reduction in range. To investigate genetic diversity in Eurasian beaver populations during the Holocene, we obtained mitochondrial control region DNA sequences from 48 ancient beaver samples and added 152 modern sequences from GenBank. Phylogeographical analyses of the data indicate a differentiation of European beaver populations into three mitochondrial clades. The two main clades occur in western and eastern Europe, respectively, with an early Holocene contact zone in eastern Europe near a present-day contact zone. A divergent and previously unknown clade of beavers from the Danube Basin survived until at least 6000years ago, but went extinct during the transition to modern times. Finally, we identify a recent decline in effective population size of Eurasian beavers, with a stronger bottleneck signal in the western than in the eastern clade. Our results suggest that the low genetic diversity and the strong phylogeographical structure in recent beavers are artefacts of human hunting-associated population reductions. While beaver populations have been growing rapidly since the late 19th century, genetic diversity within modern beaver populations remains considerably reduced compared to what was present prior to the period of human hunting and habitat reduction.
Background
The forelimb-specific gene tbx5 is highly conserved and essential for the development of forelimbs in zebrafish, mice, and humans. Amongst birds, a single order, Dinornithiformes, comprising the extinct wingless moa of New Zealand, are unique in having no skeletal evidence of forelimb-like structures.
Results
To determine the sequence of tbx5 in moa, we used a range of PCR-based techniques on ancient DNA to retrieve all nine tbx5 exons and splice sites from the giant moa, Dinornis. Moa Tbx5 is identical to chicken Tbx5 in being able to activate the downstream promotors of fgf10 and ANF. In addition we show that missexpression of moa tbx5 in the hindlimb of chicken embryos results in the formation of forelimb features, suggesting that Tbx5 was fully functional in wingless moa. An alternatively spliced exon 1 for tbx5 that is expressed specifically in the forelimb region was shown to be almost identical between moa and ostrich, suggesting that, as well as being fully functional, tbx5 is likely to have been expressed normally in moa since divergence from their flighted ancestors, approximately 60 mya.
Conclusions
The results suggests that, as in mice, moa tbx5 is necessary for the induction of forelimbs, but is not sufficient for their outgrowth. Moa Tbx5 may have played an important role in the development of moa’s remnant forelimb girdle, and may be required for the formation of this structure. Our results further show that genetic changes affecting genes other than tbx5 must be responsible for the complete loss of forelimbs in moa.