publish.UP Suchen

High-quality whole-genome sequence of an abundant Holarctic odontocete, the harbour porpoise (Phocoena phocoena) (2018)

Autenrieth, Marijke ; Hartmann, Stefanie ; Lah, Ljerka ; Roos, Anna ; Dennis, Alice B. ; Tiedemann, Ralph

The harbour porpoise (Phocoena phocoena) is a highly mobile cetacean found across the Northern hemisphere. It occurs in coastal waters and inhabits basins that vary broadly in salinity, temperature and food availability. These diverse habitats could drive subtle differentiation among populations, but examination of this would be best conducted with a robust reference genome. Here, we report the first harbour porpoise genome, assembled de novo from an individual originating in the Kattegat Sea (Sweden). The genome is one of the most complete cetacean genomes currently available, with a total size of 2.39 Gb and 50% of the total length found in just 34 scaffolds. Using 122 of the longest scaffolds, we were able to show high levels of synteny with the genome of the domestic cattle (Bos taurus). Our draft annotation comprises 22,154 predicted genes, which we further annotated through matches to the NCBI nucleotide database, GO categorization and motif prediction. Within the predicted genes, we have confirmed the presence of >20 genes or gene families that have been associated with adaptive evolution in other cetaceans. Overall, this genome assembly and draft annotation represent a crucial addition to the genomic resources currently available for the study of porpoises and Phocoenidae evolution, phylogeny and conservation.

Partial genomic survival of cave bears in living brown bears (2018)

Although many large mammal species went extinct at the end of the Pleistocene epoch, their DNA may persist due to past episodes of interspecies admixture. However, direct empirical evidence of the persistence of ancient alleles remains scarce. Here, we present multifold coverage genomic data from four Late Pleistocene cave bears (Ursus spelaeus complex) and show that cave bears hybridized with brown bears (Ursus arctos) during the Pleistocene. We develop an approach to assess both the directionality and relative timing of gene flow. We find that segments of cave bear DNA still persist in the genomes of living brown bears, with cave bears contributing 0.9 to 2.4% of the genomes of all brown bears investigated. Our results show that even though extinction is typically considered as absolute, following admixture, fragments of the gene pool of extinct species can survive for tens of thousands of years in the genomes of extant recipient species.

Consensify (2020)

Barlow, Axel ; Hartmann, Stefanie ; Gonzalez, Javier ; Hofreiter, Michael ; Paijmans, Johanna L. A.

A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes.

Consensify (2020)

Barlow, Axel ; Hartmann, Stefanie ; Gonzalez, Javier ; Hofreiter, Michael ; Paijmans, Johanna L. A.

A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes.

Identification of sperm proteins as candidate biomarkers for the analysis of reproductive isolation in Mytilus: a case study for the enkurin locus (2012)

Bartel, Manuela ; Hartmann, Stefanie ; Lehmann, Karola ; Postel, Kai ; Quesada, Humberto ; Philipp, Eva E. R. ; Heilmann, Katja ; Micheel, Burkhard ; Stuckas, Heiko

Sperm proteins of the marine sessile mussels of the Mytilus edulis species complex are models to investigate reproductive isolation and speciation. This study aimed at identifying sperm proteins and their corresponding genes. This was aided by the use of monoclonal antibodies that preferentially bind to yet unknown sperm molecules. By identifying their target molecules, this approach identified proteins with relevance to Mytilus sperm function. This procedure identified 16 proteins, for example, enkurin, laminin, porin and heat shock proteins. The potential use of these proteins as genetic markers to study reproductive isolation is exemplified by analysing the enkurin locus. Enkurin evolution is driven by purifying selection, the locus displays high levels of intraspecific variation and species-specific alleles group in distinct phylogenetic clusters. These findings characterize enkurin as informative candidate biomarker for analyses of clinal variation and differential introgression in hybrid zones, for example, to understand determinants of reproductive isolation in Baltic Mytilus populations.

On the phylogenetic position of Myzostomida : can 77 genes get it wrong? (2009)

Bleidorn, Christoph ; Podsiadlowski, Lars ; Zhong, Min ; Eeckhaut, Igor ; Hartmann, Stefanie ; Halanych, Kenneth M. ; Tiedemann, Ralph

Background: Phylogenomic analyses recently became popular to address questions about deep metazoan phylogeny. Ribosomal proteins (RP) dominate many of these analyses or are, in some cases, the only genes included. Despite initial hopes, hylogenomic analyses including tens to hundreds of genes still fail to robustly place many bilaterian taxa. Results: Using the phylogenetic position of myzostomids as an example, we show that phylogenies derived from RP genes and mitochondrial genes produce incongruent results. Whereas the former support a position within a clade of platyzoan taxa, mitochondrial data recovers an annelid affinity, which is strongly supported by the gene order data and is congruent with morphology. Using hypothesis testing, our RP data significantly rejects the annelids affinity, whereas a platyzoan relationship is significantly rejected by the mitochondrial data. Conclusion: We conclude (i) that reliance of a set of markers belonging to a single class of macromolecular complexes might bias the analysis, and (ii) that concatenation of all available data might introduce conflicting signal into phylogenetic analyses. We therefore strongly recommend testing for data incongruence in phylogenomic analyses. Furthermore, judging all available data, we consider the annelid affinity hypothesis more plausible than a possible platyzoan affinity for myzostomids, and suspect long branch attraction is influencing the RP data. However, this hypothesis needs further confirmation by future analyses.

Permanent genetic resources added to molecular ecology resources database 1 April 2011-31 May 2011 (2011)

This article documents the addition of 92 microsatellite marker loci to the Molecular Ecology Resources Database. Loci were developed for the following species: Anopheles minimus, An. sinensis, An. dirus, Calephelis mutica, Lutjanus kasmira, Murella muralis and Orchestia montagui. These loci were cross-tested on the following species: Calephelis arizonensi, Calephelis borealis, Calephelis nemesis, Calephelis virginiensis and Lutjanus bengalensis.

Genome-Scale Phylogenetics inferring the plant tree of life from 18,896 gene trees (2011)

Burleigh, J. Gordon ; Bansal, Mukul S. ; Eulenstein, Oliver ; Hartmann, Stefanie ; Wehe, Andre ; Vision, Todd J.

Phylogenetic analyses using genome-scale data sets must confront incongruence among gene trees, which in plants is exacerbated by frequent gene duplications and losses. Gene tree parsimony (GTP) is a phylogenetic optimization criterion in which a species tree that minimizes the number of gene duplications induced among a set of gene trees is selected. The run time performance of previous implementations has limited its use on large-scale data sets. We used new software that incorporates recent algorithmic advances to examine the performance of GTP on a plant data set consisting of 18,896 gene trees containing 510,922 protein sequences from 136 plant taxa (giving a combined alignment length of >2.9 million characters). The relationships inferred from the GTP analysis were largely consistent with previous large-scale studies of backbone plant phylogeny and resolved some controversial nodes. The placement of taxa that were present in few gene trees generally varied the most among GTP bootstrap replicates. Excluding these taxa either before or after the GTP analysis revealed high levels of phylogenetic support across plants. The analyses supported magnoliids sister to a eudicot + monocot clade and did not support the eurosid I and II clades. This study presents a nuclear genomic perspective on the broad-scale phylogenic relationships among plants, and it demonstrates that nuclear genes with a history of duplication and loss can be phylogenetically informative for resolving the plant tree of life.

A hierarchical model for incomplete alignments in phylogenetic inference (2009)

Cheng, Fuxia ; Hartmann, Stefanie ; Gupta, Mayetri ; Ibrahim, Joseph G. ; Vision, Todd J.

Motivation: Full-length DNA and protein sequences that span the entire length of a gene are ideally used for multiple sequence alignments (MSAs) and the subsequent inference of their relationships. Frequently, however, MSAs contain a substantial amount of missing data. For example, expressed sequence tags (ESTs), which are partial sequences of expressed genes, are the predominant source of sequence data for many organisms. The patterns of missing data typical for EST-derived alignments greatly compromise the accuracy of estimated phylogenies. Results: We present a statistical method for inferring phylogenetic trees from EST-based incomplete MSA data. We propose a class of hierarchical models for modeling pairwise distances between the sequences, and develop a fully Bayesian approach for estimation of the model parameters. Once the distance matrix is estimated, the phylogenetic tree may be constructed by applying neighbor-joining (or any other algorithm of choice). We also show that maximizing the marginal likelihood from the Bayesian approach yields similar results to a pro. le likelihood estimation. The proposed methods are illustrated using simulated protein families, for which the true phylogeny is known, and one real protein family.

Functional insights from the GC-poor genomes of two aphid parasitoids, Aphidius ervi and Lysiphlebus fabarum (2020)

Background Parasitoid wasps have fascinating life cycles and play an important role in trophic networks, yet little is known about their genome content and function. Parasitoids that infect aphids are an important group with the potential for biological control. Their success depends on adapting to develop inside aphids and overcoming both host aphid defenses and their protective endosymbionts. Results We present the de novo genome assemblies, detailed annotation, and comparative analysis of two closely related parasitoid wasps that target pest aphids: Aphidius ervi and Lysiphlebus fabarum (Hymenoptera: Braconidae: Aphidiinae). The genomes are small (139 and 141 Mbp) and the most AT-rich reported thus far for any arthropod (GC content: 25.8 and 23.8%). This nucleotide bias is accompanied by skewed codon usage and is stronger in genes with adult-biased expression. AT-richness may be the consequence of reduced genome size, a near absence of DNA methylation, and energy efficiency. We identify missing desaturase genes, whose absence may underlie mimicry in the cuticular hydrocarbon profile of L. fabarum. We highlight key gene groups including those underlying venom composition, chemosensory perception, and sex determination, as well as potential losses in immune pathway genes. Conclusions These findings are of fundamental interest for insect evolution and biological control applications. They provide a strong foundation for further functional studies into coevolution between parasitoids and their hosts. Both genomes are available at https://bipaa.genouest.org.

Filtern

Volltext vorhanden

Autor*in

Erscheinungsjahr

Dokumenttyp

Sprache

Gehört zur Bibliographie

Schlagworte

Institut

40 Treffer