Refine
Year of publication
Document Type
Language
- English (91)
Is part of the Bibliography
- yes (91)
Keywords
- ancient DNA (22)
- palaeogenomics (6)
- Genomics (4)
- Mitochondria (4)
- hybridization capture (4)
- museum specimens (4)
- phylogeny (4)
- Phylogenetics (3)
- genome assembly (3)
- mitochondrial genome (3)
Institute
- Institut für Biochemie und Biologie (91) (remove)
Simultaneous Barcode Sequencing of Diverse Museum Collection Specimens Using a Mixed RNA Bait Set
(2022)
A growing number of publications presenting results from sequencing natural history collection specimens reflect the importance of DNA sequence information from such samples. Ancient DNA extraction and library preparation methods in combination with target gene capture are a way of unlocking archival DNA, including from formalin-fixed wet-collection material. Here we report on an experiment, in which we used an RNA bait set containing baits from a wide taxonomic range of species for DNA hybridisation capture of nuclear and mitochondrial targets for analysing natural history collection specimens. The bait set used consists of 2,492 mitochondrial and 530 nuclear RNA baits and comprises specific barcode loci of diverse animal groups including both invertebrates and vertebrates. The baits allowed to capture DNA sequence information of target barcode loci from 84% of the 37 samples tested, with nuclear markers being captured more frequently and consensus sequences of these being more complete compared to mitochondrial markers. Samples from dry material had a higher rate of success than wet-collection specimens, although target sequence information could be captured from 50% of formalin-fixed samples. Our study illustrates how efforts to obtain barcode sequence information from natural history collection specimens may be combined and are a way of implementing barcoding inventories of scientific collection material.
Simultaneous Barcode Sequencing of Diverse Museum Collection Specimens Using a Mixed RNA Bait Set
(2022)
A growing number of publications presenting results from sequencing natural history collection specimens reflect the importance of DNA sequence information from such samples. Ancient DNA extraction and library preparation methods in combination with target gene capture are a way of unlocking archival DNA, including from formalin-fixed wet-collection material. Here we report on an experiment, in which we used an RNA bait set containing baits from a wide taxonomic range of species for DNA hybridisation capture of nuclear and mitochondrial targets for analysing natural history collection specimens. The bait set used consists of 2,492 mitochondrial and 530 nuclear RNA baits and comprises specific barcode loci of diverse animal groups including both invertebrates and vertebrates. The baits allowed to capture DNA sequence information of target barcode loci from 84% of the 37 samples tested, with nuclear markers being captured more frequently and consensus sequences of these being more complete compared to mitochondrial markers. Samples from dry material had a higher rate of success than wet-collection specimens, although target sequence information could be captured from 50% of formalin-fixed samples. Our study illustrates how efforts to obtain barcode sequence information from natural history collection specimens may be combined and are a way of implementing barcoding inventories of scientific collection material.
Ancient genome provides insights into the history of Eurasian lynx in Iberia and Western Europe
(2022)
The Eurasian lynx (Lynx lynx) is one of the most widely distributed felids in the world. However, most of its populations started to decline a few millennia ago. Historical declines have been especially severe in Europe, and particularly in Western Europe, from where the species disappeared in the last few centuries. Here, we analyze the genome of an Eurasian lynx inhabiting the Iberian Peninsula 2500 ya, to gain insights into the phylogeographic position and genetic status of this extinct population. Also, we contextualize previous ancient data in the light of new phylogeographic studies of the species. Our results suggest that the Iberian population is part of an extinct European lineage closely related to the current Carpathian-Baltic lineages. Also, this sample holds the lowest diversity reported for the species so far, and similar to that of the highly endangered Iberian lynx. A combination of historical factors, such as a founder effect while colonizing the peninsula, together with intensified human impacts during the Holocene in the Cantabrian strip, could have led to a genetic impoverishment of the population and precipitated its extinction. Mitogenomic lineages distribution in space and time support the long-term coexistence of several lineages of Eurasian lynx in Western Europe with fluctuating ranges. While mitochondrial sequences related to the lineages currently found in Balkans and Caucasus were predominant during the Pleistocene, those more closely related to the lineage currently distributed in Central Europe prevailed during the Holocene. The use of ancient genomics has proven to be a useful tool to understand the biogeographic pattern of the Eurasian lynx in the past.
(1) Background:
Adaptive diversification of complex traits plays a pivotal role in the evolution of organismal diversity. In the freshwater snail genus Tylomelania, adaptive radiations were likely promoted by trophic specialization via diversification of their key foraging organ, the radula.
(2) Methods:
To investigate the molecular basis of radula diversification and its contribution to lineage divergence, we used tissue-specific transcriptomes of two sympatric Tylomelania sarasinorum ecomorphs.
(3) Results:
We show that ecomorphs are genetically divergent lineages with habitat-correlated abundances. Sequence divergence and the proportion of highly differentially expressed genes are significantly higher between radula transcriptomes compared to the mantle and foot. However, the same is not true when all differentially expressed genes or only non-synonymous SNPs are considered. Finally, putative homologs of some candidate genes for radula diversification (hh, arx, gbb) were also found to contribute to trophic specialization in cichlids and Darwin's finches.
(4) Conclusions:
Our results are in line with diversifying selection on the radula driving Tylomelania ecomorph divergence and indicate that some molecular pathways may be especially prone to adaptive diversification, even across phylogenetically distant animal groups.
Eastern Africa has been a prime target for scientific drilling because it is rich in key paleoanthropological sites as well as in paleolakes, containing valuable paleoclimatic information on evolutionary time scales. The Hominin Sites and Paleolakes Drilling Project (HSPDP) explores these paleolakes with the aim of reconstructing environmental conditions around critical episodes of hominin evolution. Identification of biological taxa based on their sedimentary ancient DNA (sedaDNA) traces can contribute to understand past ecological and climatological conditions of the living environment of our ancestors. However, sedaDNA recovery from tropical environments is challenging because high temperatures, UV irradiation, and desiccation result in highly degraded DNA. Consequently, most of the DNA fragments in tropical sediments are too short for PCR amplification. We analyzed sedaDNA in the upper 70 m of the composite sediment core of the HSPDP drill site at Chew Bahir for eukaryotic remnants. We first tested shotgun high throughput sequencing which leads to metagenomes dominated by bacterial DNA of the deep biosphere, while only a small fraction was derived from eukaryotic, and thus probably ancient, DNA. Subsequently, we performed cross-species hybridization capture of sedaDNA to enrich ancient DNA (aDNA) from eukaryotic remnants for paleoenvironmental analysis, using established barcoding genes (cox1 and rbcL for animals and plants, respectively) from 199 species that may have had relatives in the past biosphere at Chew Bahir. Metagenomes yielded after hybridization capture are richer in reads with similarity to cox1 and rbcL in comparison to metagenomes without prior hybridization capture. Taxonomic assignments of the reads from these hybridization capture metagenomes also yielded larger fractions of the eukaryotic domain. For reads assigned to cox1, inferred wet periods were associated with high inferred relative abundances of putative limnic organisms (gastropods, green algae), while inferred dry periods showed increased relative abundances for insects. These findings indicate that cross-species hybridization capture can be an effective approach to enhance the information content of sedaDNA in order to explore biosphere changes associated with past environmental conditions, enabling such analyses even under tropical conditions.
Eastern Africa has been a prime target for scientific drilling because it is rich in key paleoanthropological sites as well as in paleolakes, containing valuable paleoclimatic information on evolutionary time scales. The Hominin Sites and Paleolakes Drilling Project (HSPDP) explores these paleolakes with the aim of reconstructing environmental conditions around critical episodes of hominin evolution. Identification of biological taxa based on their sedimentary ancient DNA (sedaDNA) traces can contribute to understand past ecological and climatological conditions of the living environment of our ancestors. However, sedaDNA recovery from tropical environments is challenging because high temperatures, UV irradiation, and desiccation result in highly degraded DNA. Consequently, most of the DNA fragments in tropical sediments are too short for PCR amplification. We analyzed sedaDNA in the upper 70 m of the composite sediment core of the HSPDP drill site at Chew Bahir for eukaryotic remnants. We first tested shotgun high throughput sequencing which leads to metagenomes dominated by bacterial DNA of the deep biosphere, while only a small fraction was derived from eukaryotic, and thus probably ancient, DNA. Subsequently, we performed cross-species hybridization capture of sedaDNA to enrich ancient DNA (aDNA) from eukaryotic remnants for paleoenvironmental analysis, using established barcoding genes (cox1 and rbcL for animals and plants, respectively) from 199 species that may have had relatives in the past biosphere at Chew Bahir. Metagenomes yielded after hybridization capture are richer in reads with similarity to cox1 and rbcL in comparison to metagenomes without prior hybridization capture. Taxonomic assignments of the reads from these hybridization capture metagenomes also yielded larger fractions of the eukaryotic domain. For reads assigned to cox1, inferred wet periods were associated with high inferred relative abundances of putative limnic organisms (gastropods, green algae), while inferred dry periods showed increased relative abundances for insects. These findings indicate that cross-species hybridization capture can be an effective approach to enhance the information content of sedaDNA in order to explore biosphere changes associated with past environmental conditions, enabling such analyses even under tropical conditions.
Domestic cattle were brought to Spain by early settlers and agricultural societies. Due to missing Neolithic sites in the Spanish region of Galicia, very little is known about this process in this region. We sampled 18 cattle subfossils from different ages and different mountain caves in Galicia, of which 11 were subject to sequencing of the mitochondrial genome and phylogenetic analysis, to provide insight into the introduction of cattle to this region. We detected high similarity between samples from different time periods and were able to compare the time frame of the first domesticated cattle in Galicia to data from the connecting region of Cantabria to show a plausible connection between the Neolithization of these two regions. Our data shows a close relationship of the early domesticated cattle of Galicia and modern cow breeds and gives a general insight into cattle phylogeny. We conclude that settlers migrated to this region of Spain from Europe and introduced common European breeds to Galicia.
Domestic cattle were brought to Spain by early settlers and agricultural societies. Due to missing Neolithic sites in the Spanish region of Galicia, very little is known about this process in this region. We sampled 18 cattle subfossils from different ages and different mountain caves in Galicia, of which 11 were subject to sequencing of the mitochondrial genome and phylogenetic analysis, to provide insight into the introduction of cattle to this region. We detected high similarity between samples from different time periods and were able to compare the time frame of the first domesticated cattle in Galicia to data from the connecting region of Cantabria to show a plausible connection between the Neolithization of these two regions. Our data shows a close relationship of the early domesticated cattle of Galicia and modern cow breeds and gives a general insight into cattle phylogeny. We conclude that settlers migrated to this region of Spain from Europe and introduced common European breeds to Galicia.
Comparing mitogenomic timetrees for two African savannah primate genera (Chlorocebus and Papio)
(2020)
Objective
Plant carnivory is distributed across the tree of life and has evolved at least six times independently, but sequenced and annotated nuclear genomes of carnivorous plants are currently lacking. We have sequenced and structurally annotated the nuclear genome of the carnivorous Roridula gorgonias and that of a non-carnivorous relative, Madeira’s lily-of-the-valley-tree, Clethra arborea, both within the Ericales. This data adds an important resource to study the evolutionary genetics of plant carnivory across angiosperm lineages and also for functional and systematic aspects of plants within the Ericales.
Results
Our assemblies have total lengths of 284 Mbp (R. gorgonias) and 511 Mbp (C. arborea) and show high BUSCO scores of 84.2% and 89.5%, respectively. We used their predicted genes together with publicly available data from other Ericales’ genomes and transcriptomes to assemble a phylogenomic data set for the inference of a species tree. However, groups of orthologs showed a marked absence of species represented by a transcriptome. We discuss possible reasons and caution against combining predicted genes from genome- and transriptome-based assemblies.
Obtaining information about functional details of proteins of extinct species is of critical importance for a better understanding of the real-life appearance, behavior and ecology of these lost entries in the book of life. In this chapter, we discuss the possibilities to retrieve the necessary DNA sequence information from paleogenomic data obtained from fossil specimens, which can then be used to express and subsequently analyze the protein of interest. We discuss the problems specific to ancient DNA, including mis-coding lesions, short read length and incomplete paleogenome assemblies. Finally, we discuss an alternative, but currently rarely used approach, direct PCR amplification, which is especially useful for comparatively short proteins.
Xenikoudakis et al. report a partial mitochondrial genome of the extinct giant beaver Castoroides and estimate the origin of aquatic behavior in beavers to approximately 20 million years. This time estimate coincides with the extinction of terrestrial beavers and raises the question whether the two events had a common cause.
Consensify
(2020)
A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes.
Objective
Plant carnivory is distributed across the tree of life and has evolved at least six times independently, but sequenced and annotated nuclear genomes of carnivorous plants are currently lacking. We have sequenced and structurally annotated the nuclear genome of the carnivorous Roridula gorgonias and that of a non-carnivorous relative, Madeira’s lily-of-the-valley-tree, Clethra arborea, both within the Ericales. This data adds an important resource to study the evolutionary genetics of plant carnivory across angiosperm lineages and also for functional and systematic aspects of plants within the Ericales.
Results
Our assemblies have total lengths of 284 Mbp (R. gorgonias) and 511 Mbp (C. arborea) and show high BUSCO scores of 84.2% and 89.5%, respectively. We used their predicted genes together with publicly available data from other Ericales’ genomes and transcriptomes to assemble a phylogenomic data set for the inference of a species tree. However, groups of orthologs showed a marked absence of species represented by a transcriptome. We discuss possible reasons and caution against combining predicted genes from genome- and transriptome-based assemblies.
Consensify
(2020)
A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes.
Utilising a reconstructed ancestral mitochondrial genome of a clade to design hybridisation capture baits can provide the opportunity for recovering mitochondrial sequences from all its descendent and even sister lineages. This approach is useful for taxa with no extant close relatives, as is often the case for rare or extinct species, and is a viable approach for the analysis of historical museum specimens. Asiatic linsangs (genus Prionodon) exemplify this situation, being rare Southeast Asian carnivores for which little molecular data is available. Using ancestral capture we recover partial mitochondrial genome sequences for seven banded linsangs (P. linsang) from historical specimens, representing the first intraspecific genetic dataset for this species. We additionally assemble a high quality mitogenome for the banded linsang using shotgun sequencing for time-calibrated phylogenetic analysis. This reveals a deep divergence between the two Asiatic linsang species (P. linsang, P. pardicolor), with an estimated divergence of ~12 million years (Ma). Although our sample size precludes any robust interpretation of the population structure of the banded linsang, we recover two distinct matrilines with an estimated tMRCA of ~1 Ma. Our results can be used as a basis for further investigation of the Asiatic linsangs, and further demonstrate the utility of ancestral capture for studying divergent taxa without close relatives.
Utilising a reconstructed ancestral mitochondrial genome of a clade to design hybridisation capture baits can provide the opportunity for recovering mitochondrial sequences from all its descendent and even sister lineages. This approach is useful for taxa with no extant close relatives, as is often the case for rare or extinct species, and is a viable approach for the analysis of historical museum specimens. Asiatic linsangs (genus Prionodon) exemplify this situation, being rare Southeast Asian carnivores for which little molecular data is available. Using ancestral capture we recover partial mitochondrial genome sequences for seven banded linsangs (P. linsang) from historical specimens, representing the first intraspecific genetic dataset for this species. We additionally assemble a high quality mitogenome for the banded linsang using shotgun sequencing for time-calibrated phylogenetic analysis. This reveals a deep divergence between the two Asiatic linsang species (P. linsang, P. pardicolor), with an estimated divergence of ~12 million years (Ma). Although our sample size precludes any robust interpretation of the population structure of the banded linsang, we recover two distinct matrilines with an estimated tMRCA of ~1 Ma. Our results can be used as a basis for further investigation of the Asiatic linsangs, and further demonstrate the utility of ancestral capture for studying divergent taxa without close relatives.
Mitochondrial genomes of Late Pleistocene caballine horses from China belong to a separate clade
(2020)
There were several species of Equus in northern China during the Late Pleistocene, including Equus przewalskii and Equus dalianensis. A number of morphological studies have been carried out on E. przewalskii and E. dalianensis, but their evolutionary history is still unresolved. In this study, we retrieved near-complete mitochondrial genomes from E. dalianensis and E. przewalskii specimens excavated from Late Pleistocene strata in northeastern China. Phylogenetic analyses revealed that caballoid horses were divided into two subclades: the New World and the Old World caballine horse subclades. The Old World caballine horses comprise of two deep phylogenetic lineages, with modern and ancient Equus caballus and modern E. przewalskii forming lineage I, and the individuals in this study together with one Yakut specimen forming lineage II. Our results indicate that Chinese Late Pleistocene caballoid horses showed a closer relationship to other Eurasian caballine horses than that to Pleistocene horses from North America. In addition, phylogenetic analyses suggested a close relationship between E. dalianensis and the Chinese fossil E. przewalskii, in agreement with previous researches based on morphological analyses. Interestingly, E. dalianensis and the fossil E. przewalskii were intermixed rather than split into distinct lineages, suggesting either that gene flow existed between these two species or that morphology-based species assignment of palaeontological specimens is not always correct. Moreover, Bayesian analysis showed that the divergence time between the New World and the Old World caballoid horses was at 1.02 Ma (95% CI: 0.86-1.24 Ma), and the two Old World lineages (I & II) split at 0.88 Ma (95% CI: 0.69-1.13 Ma), which indicates that caballoid horses seem to have evolved into different populations in the Old World soon after they migrated from North America via the Bering Land Bridge. Finally, the TMRCA of E. dalianensis was estimated at 0.20 Ma (95% CI: 0.15-0.28 Ma), and it showed a relative low genetic diversity compared with other Equus species.
The great auk was once abundant and distributed across the North Atlantic. It is now extinct, having been heavily exploited for its eggs, meat, and feathers. We investigated the impact of human hunting on its demise by integrating genetic data, GPS-based ocean current data, and analyses of population viability. We sequenced complete mitochondrial genomes of 41 individuals from across the species' geographic range and reconstructed population structure and population dynamics throughout the Holocene. Taken together, our data do not provide any evidence that great auks were at risk of extinction prior to the onset of intensive human hunting in the early 16th century. In addition, our population viability analyses reveal that even if the great auk had not been under threat by environmental change, human hunting alone could have been sufficient to cause its extinction. Our results emphasise the vulnerability of even abundant and widespread species to intense and localised exploitation.
Domestic Bactrian camel (Camelus bactrianus) used to be one of the most important livestock species in Chinese history, as well as the major transport carrier on the ancient Silk Road. However, archeological studies on Chinese C. bactrianus are still limited, and molecular biology research on this species is mainly focused on modern specimens. In this study, we retrieved the complete mitochondrial genome from a C. bactrianus specimen, which was excavated from northwestern China and dated at 1290-1180 cal. Phylogenetic analyses using 18 mitochondrial genomes indicated that the C. bactrianus clade was divided into two maternal lineages. The majority of samples originating from Iran to Japan and Mongolia belong to subclade A1, while our sample together with two Mongolian individuals formed the much smaller subclade A2. Furthermore, the divergence time of these two maternal lineages was estimated as 165 Kya (95% credibility interval 117-222 Kya), this might indicate that several different evolutionary lineages were incorporated into the domestic gene pool during the initial domestication process. Bayesian skyline plot (BSP) analysis a slow increase in female effective population size of C. bactrianus from 5000 years ago, which to the beginning of domestication of C. bactrianus. The present study also revealed that there were extensive exchanges of genetic information among C. bactrianus populations in regions along the Silk Road.