Refine
Has Fulltext
- yes (13) (remove)
Document Type
- Postprint (13)
Language
- English (13)
Is part of the Bibliography
- yes (13)
Keywords
- ancient DNA (4)
- Genomics (2)
- Mitochondria (2)
- Carnivorous plant (1)
- Cattle (1)
- Chew Bahir (1)
- Clethra arborea (1)
- D statistics (1)
- DNA (1)
- DNA preservation (1)
Institute
- Institut für Biochemie und Biologie (13) (remove)
Simultaneous Barcode Sequencing of Diverse Museum Collection Specimens Using a Mixed RNA Bait Set
(2022)
A growing number of publications presenting results from sequencing natural history collection specimens reflect the importance of DNA sequence information from such samples. Ancient DNA extraction and library preparation methods in combination with target gene capture are a way of unlocking archival DNA, including from formalin-fixed wet-collection material. Here we report on an experiment, in which we used an RNA bait set containing baits from a wide taxonomic range of species for DNA hybridisation capture of nuclear and mitochondrial targets for analysing natural history collection specimens. The bait set used consists of 2,492 mitochondrial and 530 nuclear RNA baits and comprises specific barcode loci of diverse animal groups including both invertebrates and vertebrates. The baits allowed to capture DNA sequence information of target barcode loci from 84% of the 37 samples tested, with nuclear markers being captured more frequently and consensus sequences of these being more complete compared to mitochondrial markers. Samples from dry material had a higher rate of success than wet-collection specimens, although target sequence information could be captured from 50% of formalin-fixed samples. Our study illustrates how efforts to obtain barcode sequence information from natural history collection specimens may be combined and are a way of implementing barcoding inventories of scientific collection material.
Eastern Africa has been a prime target for scientific drilling because it is rich in key paleoanthropological sites as well as in paleolakes, containing valuable paleoclimatic information on evolutionary time scales. The Hominin Sites and Paleolakes Drilling Project (HSPDP) explores these paleolakes with the aim of reconstructing environmental conditions around critical episodes of hominin evolution. Identification of biological taxa based on their sedimentary ancient DNA (sedaDNA) traces can contribute to understand past ecological and climatological conditions of the living environment of our ancestors. However, sedaDNA recovery from tropical environments is challenging because high temperatures, UV irradiation, and desiccation result in highly degraded DNA. Consequently, most of the DNA fragments in tropical sediments are too short for PCR amplification. We analyzed sedaDNA in the upper 70 m of the composite sediment core of the HSPDP drill site at Chew Bahir for eukaryotic remnants. We first tested shotgun high throughput sequencing which leads to metagenomes dominated by bacterial DNA of the deep biosphere, while only a small fraction was derived from eukaryotic, and thus probably ancient, DNA. Subsequently, we performed cross-species hybridization capture of sedaDNA to enrich ancient DNA (aDNA) from eukaryotic remnants for paleoenvironmental analysis, using established barcoding genes (cox1 and rbcL for animals and plants, respectively) from 199 species that may have had relatives in the past biosphere at Chew Bahir. Metagenomes yielded after hybridization capture are richer in reads with similarity to cox1 and rbcL in comparison to metagenomes without prior hybridization capture. Taxonomic assignments of the reads from these hybridization capture metagenomes also yielded larger fractions of the eukaryotic domain. For reads assigned to cox1, inferred wet periods were associated with high inferred relative abundances of putative limnic organisms (gastropods, green algae), while inferred dry periods showed increased relative abundances for insects. These findings indicate that cross-species hybridization capture can be an effective approach to enhance the information content of sedaDNA in order to explore biosphere changes associated with past environmental conditions, enabling such analyses even under tropical conditions.
Domestic cattle were brought to Spain by early settlers and agricultural societies. Due to missing Neolithic sites in the Spanish region of Galicia, very little is known about this process in this region. We sampled 18 cattle subfossils from different ages and different mountain caves in Galicia, of which 11 were subject to sequencing of the mitochondrial genome and phylogenetic analysis, to provide insight into the introduction of cattle to this region. We detected high similarity between samples from different time periods and were able to compare the time frame of the first domesticated cattle in Galicia to data from the connecting region of Cantabria to show a plausible connection between the Neolithization of these two regions. Our data shows a close relationship of the early domesticated cattle of Galicia and modern cow breeds and gives a general insight into cattle phylogeny. We conclude that settlers migrated to this region of Spain from Europe and introduced common European breeds to Galicia.
Objective
Plant carnivory is distributed across the tree of life and has evolved at least six times independently, but sequenced and annotated nuclear genomes of carnivorous plants are currently lacking. We have sequenced and structurally annotated the nuclear genome of the carnivorous Roridula gorgonias and that of a non-carnivorous relative, Madeira’s lily-of-the-valley-tree, Clethra arborea, both within the Ericales. This data adds an important resource to study the evolutionary genetics of plant carnivory across angiosperm lineages and also for functional and systematic aspects of plants within the Ericales.
Results
Our assemblies have total lengths of 284 Mbp (R. gorgonias) and 511 Mbp (C. arborea) and show high BUSCO scores of 84.2% and 89.5%, respectively. We used their predicted genes together with publicly available data from other Ericales’ genomes and transcriptomes to assemble a phylogenomic data set for the inference of a species tree. However, groups of orthologs showed a marked absence of species represented by a transcriptome. We discuss possible reasons and caution against combining predicted genes from genome- and transriptome-based assemblies.
Consensify
(2020)
A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes.
Utilising a reconstructed ancestral mitochondrial genome of a clade to design hybridisation capture baits can provide the opportunity for recovering mitochondrial sequences from all its descendent and even sister lineages. This approach is useful for taxa with no extant close relatives, as is often the case for rare or extinct species, and is a viable approach for the analysis of historical museum specimens. Asiatic linsangs (genus Prionodon) exemplify this situation, being rare Southeast Asian carnivores for which little molecular data is available. Using ancestral capture we recover partial mitochondrial genome sequences for seven banded linsangs (P. linsang) from historical specimens, representing the first intraspecific genetic dataset for this species. We additionally assemble a high quality mitogenome for the banded linsang using shotgun sequencing for time-calibrated phylogenetic analysis. This reveals a deep divergence between the two Asiatic linsang species (P. linsang, P. pardicolor), with an estimated divergence of ~12 million years (Ma). Although our sample size precludes any robust interpretation of the population structure of the banded linsang, we recover two distinct matrilines with an estimated tMRCA of ~1 Ma. Our results can be used as a basis for further investigation of the Asiatic linsangs, and further demonstrate the utility of ancestral capture for studying divergent taxa without close relatives.
Taxonomy plays a central role in biological sciences. It provides a communication system for scientists as it aims to enable correct identification of the studied organisms. As a consequence, species descriptions should seek to include as much available information as possible at species level to follow an integrative concept of 'taxonomics'. Here, we describe the cryptic species Epimeria frankei sp. nov. from the North Sea, and also redescribe its sister species, Epimeria cornigera. The morphological information obtained is substantiated by DNA barcodes and complete nuclear 18S rRNA gene sequences. In addition, we provide, for the first time, full mitochondrial genome data as part of a metazoan species description for a holotype, as well as the neotype. This study represents the first successful implementation of the recently proposed concept of taxonomics, using data from high-throughput technologies for integrative taxonomic studies, allowing the highest level of confidence for both biodiversity and ecological research.
Background
Contiguous genome assemblies are a highly valued biological resource because of the higher number of completely annotated genes and genomic elements that are usable compared to fragmented draft genomes. Nonetheless, contiguity is difficult to obtain if only low coverage data and/or only distantly related reference genome assemblies are available.
Findings
In order to improve genome contiguity, we have developed Cross-Species Scaffolding—a new pipeline that imports long-range distance information directly into the de novo assembly process by constructing mate-pair libraries in silico.
Conclusions
We show how genome assembly metrics and gene prediction dramatically improve with our pipeline by assembling two primate genomes solely based on ∼30x coverage of shotgun sequencing data.
Background:
Skewed body size distributions and the high relative richness of small-bodied taxa are a fundamental
property of a wide range of animal clades. The evolutionary processes responsible for generating these distributions
are well described in vertebrate model systems but have yet to be explored in detail for other major terrestrial
clades. In this study, we explore the macro-evolutionary patterns of body size variation across families of Hexapoda
(insects and their close relatives), using recent advances in phylogenetic understanding, with an aim to investigate
the link between size and diversity within this ancient and highly diverse lineage.
Results:
The maximum, minimum and mean-log body lengths of hexapod families are all approximately log-normally
distributed, consistent with previous studies at lower taxonomic levels, and contrasting with skewed distributions
typical of vertebrate groups. After taking phylogeny and within-tip variation into account, we find no evidence for a
negative relationship between diversification rate and body size, suggesting decoupling of the forces controlling these
two traits. Likelihood-based modeling of the log-mean body size identifies distinct processes operating within
Holometabola and Diptera compared with other hexapod groups, consistent with accelerating rates of size evolution
within these clades, while as a whole, hexapod body size evolution is found to be dominated by neutral processes
including significant phylogenetic conservatism.
Conclusions:
Based on our findings we suggest that the use of models derived from well-studied but atypical clades,
such as vertebrates may lead to misleading conclusions when applied to other major terrestrial lineages. Our results
indicate that within hexapods, and within the limits of current systematic and phylogenetic knowledge, insect
diversification is generally unfettered by size-biased macro-evolutionary processes, and that these processes over large
timescales tend to converge on apparently neutral evolutionary processes. We also identify limitations on available
data within the clade and modeling approaches for the resolution of trees of higher taxa, the resolution of which may
collectively enhance our understanding of this key component of terrestrial ecosystems.
Faunal remains from Palaeolithic sites are important genetic sources to study preglacial and postglacial populations and to investigate the effect of climate change and human impact. Post mortem decay, resulting in fragmented and chemically modified DNA, is a key obstacle in ancient DNA analyses. In the absence of reliable methods to determine the presence of endogenous DNA in sub-fossil samples, temporal and spatial surveys of DNA survival on a regional scale may help to estimate the potential of faunal remains from a given time period and region. We therefore investigated PCR amplification success, PCR performance and post mortem damage in c. 47,000 to c. 12,000-year-old horse remains from 14 Palaeolithic sites along the Swiss Jura Mountains in relation to depositional context, tissue type, storage time and age, potentially influencing DNA preservation. The targeted 75 base pair mitochondrial DNA fragment could be amplified solely from equid remains from caves and not from any of the open dry and (temporary) wetland sites. Whether teeth are better than bones cannot be ultimately decided; however, both storage time after excavation and age significantly affect PCR amplification and performance, albeit not in a linear way. This is best explained by the—inevitable—heterogeneity of the data set. The extent of post mortem damage is not related to any of the potential impact factors. The results encourage comprehensive investigations of Palaeolithic cave sites, even from temperate regions.