TY  - JOUR
A1  - Autenrieth, Marijke
A1  - Hartmann, Stefanie
A1  - Lah, Ljerka
A1  - Roos, Anna
A1  - Dennis, Alice B.
A1  - Tiedemann, Ralph
T1  - High-quality whole-genome sequence of an abundant Holarctic odontocete, the harbour porpoise (Phocoena phocoena)
JF  - Molecular ecology resources
N2  - The harbour porpoise (Phocoena phocoena) is a highly mobile cetacean found across the Northern hemisphere. It occurs in coastal waters and inhabits basins that vary broadly in salinity, temperature and food availability. These diverse habitats could drive subtle differentiation among populations, but examination of this would be best conducted with a robust reference genome. Here, we report the first harbour porpoise genome, assembled de novo from an individual originating in the Kattegat Sea (Sweden). The genome is one of the most complete cetacean genomes currently available, with a total size of 2.39 Gb and 50% of the total length found in just 34 scaffolds. Using 122 of the longest scaffolds, we were able to show high levels of synteny with the genome of the domestic cattle (Bos taurus). Our draft annotation comprises 22,154 predicted genes, which we further annotated through matches to the NCBI nucleotide database, GO categorization and motif prediction. Within the predicted genes, we have confirmed the presence of >20 genes or gene families that have been associated with adaptive evolution in other cetaceans. Overall, this genome assembly and draft annotation represent a crucial addition to the genomic resources currently available for the study of porpoises and Phocoenidae evolution, phylogeny and conservation.
KW  - cetaceans
KW  - genomics/proteomics
KW  - mammals
KW  - molecular evolution
Y1  - 2018
U6  - https://doi.org/10.1111/1755-0998.12932
SN  - 1755-098X
SN  - 1755-0998
VL  - 18
IS  - 6
SP  - 1469
EP  - 1481
PB  - Wiley
CY  - Hoboken
ER  - 
TY  - JOUR
A1  - Barlow, Axel
A1  - Cahill, James A.
A1  - Hartmann, Stefanie
A1  - Theunert, Christoph
A1  - Xenikoudakis, Georgios
A1  - Gonzalez-Fortes, Gloria M.
A1  - Paijmans, Johanna L. A.
A1  - Rabeder, Gernot
A1  - Frischauf, Christine
A1  - Garcia-Vazquez, Ana
A1  - Murtskhvaladze, Marine
A1  - Saarma, Urmas
A1  - Anijalg, Peeter
A1  - Skrbinsek, Tomaz
A1  - Bertorelle, Giorgio
A1  - Gasparian, Boris
A1  - Bar-Oz, Guy
A1  - Pinhasi, Ron
A1  - Slatkin, Montgomery
A1  - Dalen, Love
A1  - Shapiro, Beth
A1  - Hofreiter, Michael
T1  - Partial genomic survival of cave bears in living brown bears
JF  - Nature Ecology & Evolution
N2  - Although many large mammal species went extinct at the end of the Pleistocene epoch, their DNA may persist due to past episodes of interspecies admixture. However, direct empirical evidence of the persistence of ancient alleles remains scarce. Here, we present multifold coverage genomic data from four Late Pleistocene cave bears (Ursus spelaeus complex) and show that cave bears hybridized with brown bears (Ursus arctos) during the Pleistocene. We develop an approach to assess both the directionality and relative timing of gene flow. We find that segments of cave bear DNA still persist in the genomes of living brown bears, with cave bears contributing 0.9 to 2.4% of the genomes of all brown bears investigated. Our results show that even though extinction is typically considered as absolute, following admixture, fragments of the gene pool of extinct species can survive for tens of thousands of years in the genomes of extant recipient species.
Y1  - 2018
U6  - https://doi.org/10.1038/s41559-018-0654-8
SN  - 2397-334X
VL  - 2
IS  - 10
SP  - 1563
EP  - 1570
PB  - Nature Publ. Group
CY  - London
ER  - 
TY  - GEN
A1  - Barlow, Axel
A1  - Hartmann, Stefanie
A1  - Gonzalez, Javier
A1  - Hofreiter, Michael
A1  - Paijmans, Johanna L. A.
T1  - Consensify
BT  - a method for generating pseudohaploid genome sequences from palaeogenomic datasets with reduced error rates
T2  - Postprints der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe
N2  - A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes.
T3  - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - 1033 
KW  - palaeogenomics
KW  - ancient DNA
KW  - sequencing error
KW  - error reduction
KW  - D statistics
KW  - bioinformatics
Y1  - 2020
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-472521
SN  - 1866-8372
IS  - 1033
ER  - 
TY  - JOUR
A1  - Barlow, Axel
A1  - Hartmann, Stefanie
A1  - Gonzalez, Javier
A1  - Hofreiter, Michael
A1  - Paijmans, Johanna L. A.
T1  - Consensify
BT  - a method for generating pseudohaploid genome sequences from palaeogenomic datasets with reduced error rates
JF  - Genes / Molecular Diversity Preservation International
N2  - A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes.
KW  - palaeogenomics
KW  - ancient DNA
KW  - sequencing error
KW  - error reduction
KW  - D statistics
KW  - bioinformatics
Y1  - 2020
U6  - https://doi.org/10.3390/genes11010050
SN  - 2073-4425
VL  - 11
IS  - 1
PB  - MDPI
CY  - Basel
ER  - 
TY  - JOUR
A1  - Bartel, Manuela
A1  - Hartmann, Stefanie
A1  - Lehmann, Karola
A1  - Postel, Kai
A1  - Quesada, Humberto
A1  - Philipp, Eva E. R.
A1  - Heilmann, Katja
A1  - Micheel, Burkhard
A1  - Stuckas, Heiko
T1  - Identification of sperm proteins as candidate biomarkers for the analysis of reproductive isolation in Mytilus: a case study for the enkurin locus
JF  - Marine biology : international journal on life in oceans and coastal waters
N2  - Sperm proteins of the marine sessile mussels of the Mytilus edulis species complex are models to investigate reproductive isolation and speciation. This study aimed at identifying sperm proteins and their corresponding genes. This was aided by the use of monoclonal antibodies that preferentially bind to yet unknown sperm molecules. By identifying their target molecules, this approach identified proteins with relevance to Mytilus sperm function. This procedure identified 16 proteins, for example, enkurin, laminin, porin and heat shock proteins. The potential use of these proteins as genetic markers to study reproductive isolation is exemplified by analysing the enkurin locus. Enkurin evolution is driven by purifying selection, the locus displays high levels of intraspecific variation and species-specific alleles group in distinct phylogenetic clusters. These findings characterize enkurin as informative candidate biomarker for analyses of clinal variation and differential introgression in hybrid zones, for example, to understand determinants of reproductive isolation in Baltic Mytilus populations.
Y1  - 2012
U6  - https://doi.org/10.1007/s00227-012-2005-7
SN  - 0025-3162
VL  - 159
IS  - 10
SP  - 2195
EP  - 2207
PB  - Springer
CY  - New York
ER  - 
TY  - GEN
A1  - Bleidorn, Christoph
A1  - Podsiadlowski, Lars
A1  - Zhong, Min
A1  - Eeckhaut, Igor
A1  - Hartmann, Stefanie
A1  - Halanych, Kenneth M.
A1  - Tiedemann, Ralph
T1  - On the phylogenetic position of Myzostomida : can 77 genes get it wrong?
N2  - Background: Phylogenomic analyses recently became popular to address questions about deep metazoan phylogeny. Ribosomal proteins (RP) dominate many of these analyses or are, in some cases, the only genes included. Despite initial hopes, hylogenomic analyses including tens to hundreds of genes still fail to robustly place many bilaterian taxa. Results: Using the phylogenetic position of myzostomids as an example, we show that phylogenies derived from RP genes and mitochondrial genes produce incongruent results. Whereas the former support a position within a clade of platyzoan taxa, mitochondrial data recovers an annelid affinity, which is strongly supported by the gene order data and is congruent with morphology. Using hypothesis testing, our RP data significantly rejects the annelids affinity, whereas a platyzoan relationship is significantly rejected by the mitochondrial data. Conclusion: We conclude (i) that reliance of a set of markers belonging to a single class of macromolecular complexes might bias the analysis, and (ii) that concatenation of all available data might introduce conflicting signal into phylogenetic analyses. We therefore strongly recommend testing for data incongruence in phylogenomic analyses. Furthermore, judging all available data, we consider the annelid affinity hypothesis more plausible than a possible platyzoan affinity for myzostomids, and suspect long branch attraction is influencing the RP data. However, this hypothesis needs further confirmation by future analyses.
T3  - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - paper 123 
KW  - Cirriferum myzostomida
KW  - Mitochondrial genomes
KW  - Transfer-rna
KW  - Data sets
KW  - Sequence
Y1  - 2009
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus-44893
ER  - 
TY  - JOUR
A1  - Bonizzoni, Mariangela
A1  - Bourjea, Jerome
A1  - Chen, Bin
A1  - Crain, B. J.
A1  - Cui, Liwang
A1  - Fiorentino, V.
A1  - Hartmann, Stefanie
A1  - Hendricks, S.
A1  - Ketmaier, Valerio
A1  - Ma, Xiaoguang
A1  - Muths, Delphine
A1  - Pavesi, Laura
A1  - Pfautsch, Simone
A1  - Rieger, M. A.
A1  - Santonastaso, T.
A1  - Sattabongkot, Jetsumon
A1  - Taron, C. H.
A1  - Taron, D. J.
A1  - Tiedemann, Ralph
A1  - Yan, Guiyun
A1  - Zheng, Bin
A1  - Zhong, Daibin
T1  - Permanent genetic resources added to molecular ecology resources database 1 April 2011-31 May 2011
JF  - Molecular ecology resources
N2  - This article documents the addition of 92 microsatellite marker loci to the Molecular Ecology Resources Database. Loci were developed for the following species: Anopheles minimus, An. sinensis, An. dirus, Calephelis mutica, Lutjanus kasmira, Murella muralis and Orchestia montagui. These loci were cross-tested on the following species: Calephelis arizonensi, Calephelis borealis, Calephelis nemesis, Calephelis virginiensis and Lutjanus bengalensis.
Y1  - 2011
U6  - https://doi.org/10.1111/j.1755-0998.2011.03046.x
SN  - 1755-098X
VL  - 11
IS  - 5
SP  - 935
EP  - 936
PB  - Wiley-Blackwell
CY  - Malden
ER  - 
TY  - JOUR
A1  - Burleigh, J. Gordon
A1  - Bansal, Mukul S.
A1  - Eulenstein, Oliver
A1  - Hartmann, Stefanie
A1  - Wehe, Andre
A1  - Vision, Todd J.
T1  - Genome-Scale Phylogenetics inferring the plant tree of life from 18,896 gene trees
JF  - Systematic biology
N2  - Phylogenetic analyses using genome-scale data sets must confront incongruence among gene trees, which in plants is exacerbated by frequent gene duplications and losses. Gene tree parsimony (GTP) is a phylogenetic optimization criterion in which a species tree that minimizes the number of gene duplications induced among a set of gene trees is selected. The run time performance of previous implementations has limited its use on large-scale data sets. We used new software that incorporates recent algorithmic advances to examine the performance of GTP on a plant data set consisting of 18,896 gene trees containing 510,922 protein sequences from 136 plant taxa (giving a combined alignment length of >2.9 million characters). The relationships inferred from the GTP analysis were largely consistent with previous large-scale studies of backbone plant phylogeny and resolved some controversial nodes. The placement of taxa that were present in few gene trees generally varied the most among GTP bootstrap replicates. Excluding these taxa either before or after the GTP analysis revealed high levels of phylogenetic support across plants. The analyses supported magnoliids sister to a eudicot + monocot clade and did not support the eurosid I and II clades. This study presents a nuclear genomic perspective on the broad-scale phylogenic relationships among plants, and it demonstrates that nuclear genes with a history of duplication and loss can be phylogenetically informative for resolving the plant tree of life.
KW  - Gene tree-species tree reconciliation
KW  - gene tree parsimony
KW  - plant phylogeny
KW  - phylogenomics
Y1  - 2011
U6  - https://doi.org/10.1093/sysbio/syq072
SN  - 1063-5157
VL  - 60
IS  - 2
SP  - 117
EP  - 125
PB  - Oxford Univ. Press
CY  - Oxford
ER  - 
TY  - JOUR
A1  - Cheng, Fuxia
A1  - Hartmann, Stefanie
A1  - Gupta, Mayetri
A1  - Ibrahim, Joseph G.
A1  - Vision, Todd J.
T1  - A hierarchical model for incomplete alignments in phylogenetic inference
N2  - Motivation: Full-length DNA and protein sequences that span the entire length of a gene are ideally used for multiple sequence alignments (MSAs) and the subsequent inference of their relationships. Frequently, however, MSAs contain a substantial amount of missing data. For example, expressed sequence tags (ESTs), which are partial sequences of expressed genes, are the predominant source of sequence data for many organisms. The patterns of missing data typical for EST-derived alignments greatly compromise the accuracy of estimated phylogenies. Results: We present a statistical method for inferring phylogenetic trees from EST-based incomplete MSA data. We propose a class of hierarchical models for modeling pairwise distances between the sequences, and develop a fully Bayesian approach for estimation of the model parameters. Once the distance matrix is estimated, the phylogenetic tree may be constructed by applying neighbor-joining (or any other algorithm of choice). We also show that maximizing the marginal likelihood from the Bayesian approach yields similar results to a pro. le likelihood estimation. The proposed methods are illustrated using simulated protein families, for which the true phylogeny is known, and one real protein family.
Y1  - 2009
UR  - http://bioinformatics.oxfordjournals.org/
U6  - https://doi.org/10.1093/bioinformatics/btp015
SN  - 1367-4803
ER  - 
TY  - JOUR
A1  - Dennis, Alice B.
A1  - Ballesteros, Gabriel I.
A1  - Robin, Stéphanie
A1  - Schrader, Lukas
A1  - Bast, Jens
A1  - Berghöfer, Jan
A1  - Beukeboom, Leo W.
A1  - Belghazi, Maya
A1  - Bretaudeau, Anthony
A1  - Buellesbach, Jan
A1  - Cash, Elizabeth
A1  - Colinet, Dominique
A1  - Dumas, Zoé
A1  - Errbii, Mohammed
A1  - Falabella, Patrizia
A1  - Gatti, Jean-Luc
A1  - Geuverink, Elzemiek
A1  - Gibson, Joshua D.
A1  - Hertaeg, Corinne
A1  - Hartmann, Stefanie
A1  - Jacquin-Joly, Emmanuelle
A1  - Lammers, Mark
A1  - Lavandero, Blas I.
A1  - Lindenbaum, Ina
A1  - Massardier-Galata, Lauriane
A1  - Meslin, Camille
A1  - Montagné, Nicolas
A1  - Pak, Nina
A1  - Poirié, Marylène
A1  - Salvia, Rosanna
A1  - Smith, Chris R.
A1  - Tagu, Denis
A1  - Tares, Sophie
A1  - Vogel, Heiko
A1  - Schwander, Tanja
A1  - Simon, Jean-Christophe
A1  - Figueroa, Christian C.
A1  - Vorburger, Christoph
A1  - Legeai, Fabrice
A1  - Gadau, Jürgen
T1  - Functional insights from the GC-poor genomes of two aphid parasitoids, Aphidius ervi and Lysiphlebus fabarum
JF  - BMC Genomics
N2  - Background

Parasitoid wasps have fascinating life cycles and play an important role in trophic networks, yet little is known about their genome content and function. Parasitoids that infect aphids are an important group with the potential for biological control. Their success depends on adapting to develop inside aphids and overcoming both host aphid defenses and their protective endosymbionts.

Results

We present the de novo genome assemblies, detailed annotation, and comparative analysis of two closely related parasitoid wasps that target pest aphids: Aphidius ervi and Lysiphlebus fabarum (Hymenoptera: Braconidae: Aphidiinae). The genomes are small (139 and 141 Mbp) and the most AT-rich reported thus far for any arthropod (GC content: 25.8 and 23.8%). This nucleotide bias is accompanied by skewed codon usage and is stronger in genes with adult-biased expression. AT-richness may be the consequence of reduced genome size, a near absence of DNA methylation, and energy efficiency. We identify missing desaturase genes, whose absence may underlie mimicry in the cuticular hydrocarbon profile of L. fabarum. We highlight key gene groups including those underlying venom composition, chemosensory perception, and sex determination, as well as potential losses in immune pathway genes.

Conclusions

These findings are of fundamental interest for insect evolution and biological control applications. They provide a strong foundation for further functional studies into coevolution between parasitoids and their hosts. Both genomes are available at https://bipaa.genouest.org.
KW  - Parasitoid wasp
KW  - Aphid host
KW  - Aphidius ervi
KW  - Lysiphlebus fabarum
KW  - de novo genome assembly
KW  - DNA methylation loss
KW  - Chemosensory genes
KW  - Venom proteins
KW  - GC content
KW  - Toll and Imd pathways
Y1  - 2020
U6  - https://doi.org/10.1186/s12864-020-6764-0
SN  - 1471-2164
VL  - 21
PB  - BioMed Central
CY  - London
ER  -