570 Biowissenschaften; Biologie
Refine
Has Fulltext
- no (12) (remove)
Document Type
- Article (10)
- Doctoral Thesis (2)
Language
- English (12)
Is part of the Bibliography
- yes (12)
Keywords
- ancient DNA (12) (remove)
Institute
An ‛Aukward’ tale
(2017)
One hundred and seventy-three years ago, the last two Great Auks, Pinguinus impennis, ever reliably seen were killed. Their internal organs can be found in the collections of the Natural History Museum of Denmark, but the location of their skins has remained a mystery. In 1999, Great Auk expert Errol Fuller proposed a list of five potential candidate skins in museums around the world. Here we take a palaeogenomic approach to test which—if any—of Fuller’s candidate skins likely belong to either of the two birds. Using mitochondrial genomes from the five candidate birds (housed in museums in Bremen, Brussels, Kiel, Los Angeles, and Oldenburg) and the organs of the last two known individuals, we partially solve the mystery that has been on Great Auk scholars’ minds for generations and make new suggestions as to the whereabouts of the still-missing skin from these two birds.
The performance of hybridization capture combined with next-generation sequencing (NGS) has seen limited investigation with samples from hot and arid regions until now. We applied hybridization capture and shotgun sequencing to recover DNA sequences from bone specimens of ancient-domestic dromedary (Camelus dromedarius) and its extinct ancestor, the wild dromedary from Jordan, Syria, Turkey and the Arabian Peninsula, respectively. Our results show that hybridization capture increased the percentage of mitochondrial DNA (mtDNA) recovery by an average 187-fold and in some cases yielded virtually complete mitochondrial (mt) genomes at multifold coverage in a single capture experiment. Furthermore, we tested the effect of hybridization temperature and time by using a touchdown approach on a limited number of samples. We observed no significant difference in the number of unique dromedary mtDNA reads retrieved with the standard capture compared to the touchdown method. In total, we obtained 14 partial mitochondrial genomes from ancient-domestic dromedaries with 17-95% length coverage and 1.27-47.1-fold read depths for the covered regions. Using whole-genome shotgun sequencing, we successfully recovered endogenous dromedary nuclear DNA (nuDNA) from domestic and wild dromedary specimens with 1-1.06-fold read depths for covered regions. Our results highlight that despite recent methodological advances, obtaining ancient DNA (aDNA) from specimens recovered from hot, arid environments is still problematic. Hybridization protocols require specific optimization, and samples at the limit of DNA preservation need multiple replications of DNA extraction and hybridization capture as has been shown previously for Middle Pleistocene specimens.
Consensify
(2020)
A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes.
Domestic Bactrian camel (Camelus bactrianus) used to be one of the most important livestock species in Chinese history, as well as the major transport carrier on the ancient Silk Road. However, archeological studies on Chinese C. bactrianus are still limited, and molecular biology research on this species is mainly focused on modern specimens. In this study, we retrieved the complete mitochondrial genome from a C. bactrianus specimen, which was excavated from northwestern China and dated at 1290-1180 cal. Phylogenetic analyses using 18 mitochondrial genomes indicated that the C. bactrianus clade was divided into two maternal lineages. The majority of samples originating from Iran to Japan and Mongolia belong to subclade A1, while our sample together with two Mongolian individuals formed the much smaller subclade A2. Furthermore, the divergence time of these two maternal lineages was estimated as 165 Kya (95% credibility interval 117-222 Kya), this might indicate that several different evolutionary lineages were incorporated into the domestic gene pool during the initial domestication process. Bayesian skyline plot (BSP) analysis a slow increase in female effective population size of C. bactrianus from 5000 years ago, which to the beginning of domestication of C. bactrianus. The present study also revealed that there were extensive exchanges of genetic information among C. bactrianus populations in regions along the Silk Road.
As limits on O2 availability during submergence impose severe constraints on aerobic respiration, the oxygen binding globin proteins of marine mammals are expected to have evolved under strong evolutionary pressures during their land-to-sea transition. Here, we address this question for the order Sirenia by retrieving, annotating, and performing detailed selection analyses on the globin repertoire of the extinct Steller’s sea cow (Hydrodamalis gigas), dugong (Dugong dugon), and Florida manatee (Trichechus manatus latirostris) in relation to their closest living terrestrial relatives (elephants and hyraxes). These analyses indicate most loci experienced elevated nucleotide substitution rates during their transition to a fully aquatic lifestyle. While most of these genes evolved under neutrality or strong purifying selection, the rate of nonsynonymous/synonymous replacements increased in two genes (Hbz-T1 and Hba-T1) that encode the α-type chains of hemoglobin (Hb) during each stage of life. Notably, the relaxed evolution of Hba-T1 is temporally coupled with the emergence of a chimeric pseudogene (Hba-T2/Hbq-ps) that contributed to the tandemly linked Hba-T1 of stem sirenians via interparalog gene conversion. Functional tests on recombinant Hb proteins from extant and ancestral sirenians further revealed that the molecular remodeling of Hba-T1 coincided with increased Hb–O2 affinity in early sirenians. Available evidence suggests that this trait evolved to maximize O2 extraction from finite lung stores and suppress tissue O2 offloading, thereby facilitating the low metabolic intensities of extant sirenians. In contrast, the derived reduction in Hb–O2 affinity in (sub)Arctic Steller’s sea cows is consistent with fueling increased thermogenesis by these once colossal marine herbivores.
Mitochondrial genomes of Late Pleistocene caballine horses from China belong to a separate clade
(2020)
There were several species of Equus in northern China during the Late Pleistocene, including Equus przewalskii and Equus dalianensis. A number of morphological studies have been carried out on E. przewalskii and E. dalianensis, but their evolutionary history is still unresolved. In this study, we retrieved near-complete mitochondrial genomes from E. dalianensis and E. przewalskii specimens excavated from Late Pleistocene strata in northeastern China. Phylogenetic analyses revealed that caballoid horses were divided into two subclades: the New World and the Old World caballine horse subclades. The Old World caballine horses comprise of two deep phylogenetic lineages, with modern and ancient Equus caballus and modern E. przewalskii forming lineage I, and the individuals in this study together with one Yakut specimen forming lineage II. Our results indicate that Chinese Late Pleistocene caballoid horses showed a closer relationship to other Eurasian caballine horses than that to Pleistocene horses from North America. In addition, phylogenetic analyses suggested a close relationship between E. dalianensis and the Chinese fossil E. przewalskii, in agreement with previous researches based on morphological analyses. Interestingly, E. dalianensis and the fossil E. przewalskii were intermixed rather than split into distinct lineages, suggesting either that gene flow existed between these two species or that morphology-based species assignment of palaeontological specimens is not always correct. Moreover, Bayesian analysis showed that the divergence time between the New World and the Old World caballoid horses was at 1.02 Ma (95% CI: 0.86-1.24 Ma), and the two Old World lineages (I & II) split at 0.88 Ma (95% CI: 0.69-1.13 Ma), which indicates that caballoid horses seem to have evolved into different populations in the Old World soon after they migrated from North America via the Bering Land Bridge. Finally, the TMRCA of E. dalianensis was estimated at 0.20 Ma (95% CI: 0.15-0.28 Ma), and it showed a relative low genetic diversity compared with other Equus species.
The prevalence of contaminant microbial DNA in ancient bone samples represents the principal limiting factor for palaeogenomic studies, as it may comprise more than 99% of DNA molecules obtained. Efforts to exclude or reduce this contaminant fraction have been numerous but also variable in their success. Here, we present a simple but highly effective method to increase the relative proportion of endogenous molecules obtained from ancient bones. Using computed tomography (CT) scanning, we identify the densest region of a bone as optimal for sampling. This approach accurately identifies the densest internal regions of petrous bones, which are known to be a source of high-purity ancient DNA. For ancient long bones, CT scans reveal a high-density outermost layer, which has been routinely removed and discarded prior to DNA extraction. For almost all long bones investigated, we find that targeted sampling of this outermost layer provides an increase in endogenous DNA content over that obtained from softer, trabecular bone. This targeted sampling can produce as much as 50-fold increase in the proportion of endogenous DNA, providing a directly proportional reduction in sequencing costs for shotgun sequencing experiments. The observed increases in endogenous DNA proportion are not associated with any reduction in absolute endogenous molecule recovery. Although sampling the outermost layer can result in higher levels of human contamination, some bones were found to have more contamination associated with the internal bone structures. Our method is highly consistent, reproducible and applicable across a wide range of bone types, ages and species. We predict that this discovery will greatly extend the potential to study ancient populations and species in the genomics era.
Past and present biodiversity in northeastern Siberia inferred from sedimentary DNA metabarcoding
(2021)
The arctic-boreal treeline is a transition zone from taiga to tundra covering a vast area in Siberia. It often features large environmental gradients and reacts sensitively to changes in the environment. For example, the expansion of shrubs and a northward movement of the treeline are observable in Siberia as a response to the warming climate. The changes in vegetation across the treeline are known to influence the water chemistry in the lakes. This causes further alteration to the composition and diversity of sensitive aquatic organisms such as diatoms and macrophytes. Despite the rising awareness of the complex climate-feedback mechanisms of terrestrial plants, the understanding of their assembly rules and about responses of aquatic biomes in the surrounding treeline lakes is still limited. The goal of this thesis is to examine the previous and present biodiversity of terrestrial and freshwater biomes from the Siberian treeline ecotone, as well as their reactions to environmental changes. In particular, this thesis attempts to examine the performance of applying sedimentary DNA metabarcoding in terrestrial plants, aquatic macrophytes and diatoms, their spatial patterns along the environmental gradients and their temporal patterns throughout the climate transition from the late Pleistocene to Holocene. Sedimentary DNA metabarcoding combined with next-generation sequencing is applied as a primary tool to explore the composition and diversity of terrestrial plants, diatoms and aquatic macrophytes. The main study area is located in Chukotka of northeastern Siberia in the Arctic, a biodiversity hotspot due to its continental location and the diverse habitats of the glacial refugium. The modern diatom diversity was assessed with a specific diatom metabarcoding marker and morphological identification. Both approaches agree to a dominance of Fragilariaceae and Aulacoseiraceae, as well as on the environmental influential indicators of the diatom community. The high diversity of Fragilariaceae identified in the thermokarst lakes is found to follow the vegetation gradient along the treeline, suggesting that diatom metabarcoding can decipher relationships between diatom assemblage shifts and the relevant environmental changes. In particular, the metabarcoding approach detects diversification of fragilarioids in glacial lakes which is not visible using morphology. Sedimentary ancient DNA records indicate a vegetation mosaic of forb-dominated steppe-tundra during 28-19 ka, followed by a shift to dwarf-shrub tundra during 19-14 ka. During the most recent 14 thousand years, the vegetation consists of deciduous shrublands, then a change to boreal forest is observed. Investigations on the alpha diversity of the vegetation show that species richness is unexpectedly highest during pre-LGM, which is likely related to the extensive area that allows for more taxa. The optimum Holocene warming during 9-6 ka is not accompanied by a high richness as widely believed, but with an evenly distributed community by the fulfilment of erect shrubs. Furthermore, changes in taxonomic and phylogenetic diversity show complementary results in understanding community diversity. The composition and richness in the modern macrophytes community from Siberian Arctic and Chinese alpine are best co-influenced by July temperature and electrical conductivity.. Past macrophyte turnover during the late Pleistocene-Holocene is less noticeable in Siberia, whereas a pronounced community change from emergent to submerged plants is detected from Chinese alpine regions at about 14 ka due to increasing temperature and varying water conductivity. Finally, sedimentary DNA metabarcoding is a cost-effective and powerful proxy for ecological application, whereas completeness of the reference library, coverage and resolution of the metabarcoding marker are the major limitations of sedimentary DNA based diversity monitoring. The composition and richness in modern vegetation and macrophytes across broad spatial gradients is constrained by environmental variables, suggesting a potential usage for environmental monitoring. Diatom distributions are driven by different water variables along the treeline. Past records indicate that the shrub coverage has a noticeable influence on the assemblies of both terrestrial plants and aquatic macrophytes, though the shift in macrophyte community is relatively minor in the past 28 thousand years. In the long-term, the shrub expansion may eventually result in a genetically more diverse vegetation community but reduced species richness. When exceeding the optimal temperatures, further warming may lead to a decrease and putative loss of macrophytes and diatoms.
Obtaining information about functional details of proteins of extinct species is of critical importance for a better understanding of the real-life appearance, behavior and ecology of these lost entries in the book of life. In this chapter, we discuss the possibilities to retrieve the necessary DNA sequence information from paleogenomic data obtained from fossil specimens, which can then be used to express and subsequently analyze the protein of interest. We discuss the problems specific to ancient DNA, including mis-coding lesions, short read length and incomplete paleogenome assemblies. Finally, we discuss an alternative, but currently rarely used approach, direct PCR amplification, which is especially useful for comparatively short proteins.
High-throughput sequence data retrieved from ancient or other degraded samples has led to unprecedented insights into the evolutionary history of many species, but the analysis of such sequences also poses specific computational challenges. The most commonly used approach involves mapping sequence reads to a reference genome. However, this process becomes increasingly challenging with an elevated genetic distance between target and reference or with the presence of contaminant sequences with high sequence similarity to the target species. The evaluation and testing of mapping efficiency and stringency are thus paramount for the reliable identification and analysis of ancient sequences. In this paper, we present ‘TAPAS’, (Testing of Alignment Parameters for Ancient Samples), a computational tool that enables the systematic testing of mapping tools for ancient data by simulating sequence data reflecting the properties of an ancient dataset and performing test runs using the mapping software and parameter settings of interest. We showcase TAPAS by using it to assess and improve mapping strategy for a degraded sample from a banded linsang (Prionodon linsang), for which no closely related reference is currently available. This enables a 1.8-fold increase of the number of mapped reads without sacrificing mapping specificity. The increase of mapped reads effectively reduces the need for additional sequencing, thus making more economical use of time, resources, and sample material.
High-throughput sequence data retrieved from ancient or other degraded samples has led to unprecedented insights into the evolutionary history of many species, but the analysis of such sequences also poses specific computational challenges. The most commonly used approach involves mapping sequence reads to a reference genome. However, this process becomes increasingly challenging with an elevated genetic distance between target and reference or with the presence of contaminant sequences with high sequence similarity to the target species. The evaluation and testing of mapping efficiency and stringency are thus paramount for the reliable identification and analysis of ancient sequences. In this paper, we present ‘TAPAS’, (Testing of Alignment Parameters for Ancient Samples), a computational tool that enables the systematic testing of mapping tools for ancient data by simulating sequence data reflecting the properties of an ancient dataset and performing test runs using the mapping software and parameter settings of interest. We showcase TAPAS by using it to assess and improve mapping strategy for a degraded sample from a banded linsang (Prionodon linsang), for which no closely related reference is currently available. This enables a 1.8-fold increase of the number of mapped reads without sacrificing mapping specificity. The increase of mapped reads effectively reduces the need for additional sequencing, thus making more economical use of time, resources, and sample material.
The horse is a fascinating animal symbolizing power, beauty, strength and grace. Among all the animal species domesticated the horse had the largest impact on the course of human history due to its importance for warfare and transportation. Studying the process of horse domestication contributes to the knowledge about the history of horses and even of our own species.
Research based on molecular methods has increasingly focused on the genetic basis of horse domestication. Mitochondrial DNA (mtDNA) analyses of modern and ancient horses detected immense maternal diversity, probably due to many mares that contributed to the domestic population. However, mtDNA does not provide an informative phylogeographic structure. In contrast, Y chromosome analyses displayed almost complete uniformity in modern stallions but relatively high diversity in a few ancient horses. Further molecular markers that seem to be well suited to infer the domestication history of horses or genetic and phenotypic changes during this process are loci associated with phenotypic traits.
This doctoral thesis consists of three different parts for which I analyzed various single nucleotide polymorphisms (SNPs) associated with coat color, locomotion or Y chromosomal variation of horses. These SNPs were genotyped in 350 ancient horses from the Chalcolithic (5,000 BC) to the Middle Ages (11th century). The distribution of the samples ranges from China to the Iberian Peninsula and Iceland. By applying multiplexed next-generation sequencing (NGS) I sequenced short amplicons covering the relevant positions: i) eight coat-color-associated mutations in six genes to deduce the coat color phenotype; ii) the so-called ’Gait-keeper’ SNP in the DMRT3 gene to screen for the ability to amble; iii) 16 SNPs previously detected in ancient horses to infer the corresponding haplotype. Based on these data I investigated the occurrence and frequencies of alleles underlying the respective phenotypes as well as Y chromosome haplotypes at different times and regions. Also, selection coefficients for several Y chromosome lineages or phenotypes were estimated.
Concerning coat color differences in ancient horses my work constitutes the most comprehensive study to date. I detected an increase of chestnut horses in the Middle Ages as well as differential selection for spotted and solid phenotypes over time which reflects changing human preferences.
With regard to ambling horses, the corresponding allele was present in medieval English and Icelandic horses. Based on these results I argue that Norse settlers, who frequently invaded parts of Britain, brought ambling individuals to Iceland from the British Isles which can be regarded the origin of this trait. Moreover, these settlers appear to have selected for ambling in Icelandic horses.
Relating to the third trait, the paternal diversity, these findings represent the largest ancient dataset of Y chromosome variation in non-humans. I proved the existence of several Y chromosome haplotypes in early domestic horses. The decline of Y chromosome variation coincides with the movement of nomadic peoples from the Eurasian steppes and later with different breeding practices in the Roman period.
In conclusion, positive selection was estimated for several phenotypes/lineages
in different regions or times which indicates that these were preferred by humans. Furthermore, I could successfully infer the distribution and dispersal of horses in association with human movements and actions. Thereby, a better understanding of the influence of people on the changing appearance and genetic diversity of domestic horses could be gained. My results also emphasize the close relationship of ancient genetics and archeology or history and that only in combination well-founded conclusions can be reached.