TY - JOUR A1 - Autenrieth, Marijke A1 - Hartmann, Stefanie A1 - Lah, Ljerka A1 - Roos, Anna A1 - Dennis, Alice B. A1 - Tiedemann, Ralph T1 - High-quality whole-genome sequence of an abundant Holarctic odontocete, the harbour porpoise (Phocoena phocoena) JF - Molecular ecology resources N2 - The harbour porpoise (Phocoena phocoena) is a highly mobile cetacean found across the Northern hemisphere. It occurs in coastal waters and inhabits basins that vary broadly in salinity, temperature and food availability. These diverse habitats could drive subtle differentiation among populations, but examination of this would be best conducted with a robust reference genome. Here, we report the first harbour porpoise genome, assembled de novo from an individual originating in the Kattegat Sea (Sweden). The genome is one of the most complete cetacean genomes currently available, with a total size of 2.39 Gb and 50% of the total length found in just 34 scaffolds. Using 122 of the longest scaffolds, we were able to show high levels of synteny with the genome of the domestic cattle (Bos taurus). Our draft annotation comprises 22,154 predicted genes, which we further annotated through matches to the NCBI nucleotide database, GO categorization and motif prediction. Within the predicted genes, we have confirmed the presence of >20 genes or gene families that have been associated with adaptive evolution in other cetaceans. Overall, this genome assembly and draft annotation represent a crucial addition to the genomic resources currently available for the study of porpoises and Phocoenidae evolution, phylogeny and conservation. KW - cetaceans KW - genomics/proteomics KW - mammals KW - molecular evolution Y1 - 2018 U6 - https://doi.org/10.1111/1755-0998.12932 SN - 1755-098X SN - 1755-0998 VL - 18 IS - 6 SP - 1469 EP - 1481 PB - Wiley CY - Hoboken ER - TY - JOUR A1 - Barlow, Axel A1 - Cahill, James A. A1 - Hartmann, Stefanie A1 - Theunert, Christoph A1 - Xenikoudakis, Georgios A1 - Gonzalez-Fortes, Gloria M. A1 - Paijmans, Johanna L. A. A1 - Rabeder, Gernot A1 - Frischauf, Christine A1 - Garcia-Vazquez, Ana A1 - Murtskhvaladze, Marine A1 - Saarma, Urmas A1 - Anijalg, Peeter A1 - Skrbinsek, Tomaz A1 - Bertorelle, Giorgio A1 - Gasparian, Boris A1 - Bar-Oz, Guy A1 - Pinhasi, Ron A1 - Slatkin, Montgomery A1 - Dalen, Love A1 - Shapiro, Beth A1 - Hofreiter, Michael T1 - Partial genomic survival of cave bears in living brown bears JF - Nature Ecology & Evolution N2 - Although many large mammal species went extinct at the end of the Pleistocene epoch, their DNA may persist due to past episodes of interspecies admixture. However, direct empirical evidence of the persistence of ancient alleles remains scarce. Here, we present multifold coverage genomic data from four Late Pleistocene cave bears (Ursus spelaeus complex) and show that cave bears hybridized with brown bears (Ursus arctos) during the Pleistocene. We develop an approach to assess both the directionality and relative timing of gene flow. We find that segments of cave bear DNA still persist in the genomes of living brown bears, with cave bears contributing 0.9 to 2.4% of the genomes of all brown bears investigated. Our results show that even though extinction is typically considered as absolute, following admixture, fragments of the gene pool of extinct species can survive for tens of thousands of years in the genomes of extant recipient species. Y1 - 2018 U6 - https://doi.org/10.1038/s41559-018-0654-8 SN - 2397-334X VL - 2 IS - 10 SP - 1563 EP - 1570 PB - Nature Publ. Group CY - London ER - TY - GEN A1 - Barlow, Axel A1 - Hartmann, Stefanie A1 - Gonzalez, Javier A1 - Hofreiter, Michael A1 - Paijmans, Johanna L. A. T1 - Consensify BT - a method for generating pseudohaploid genome sequences from palaeogenomic datasets with reduced error rates T2 - Postprints der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe N2 - A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes. T3 - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - 1033 KW - palaeogenomics KW - ancient DNA KW - sequencing error KW - error reduction KW - D statistics KW - bioinformatics Y1 - 2020 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-472521 SN - 1866-8372 IS - 1033 ER - TY - JOUR A1 - Barlow, Axel A1 - Hartmann, Stefanie A1 - Gonzalez, Javier A1 - Hofreiter, Michael A1 - Paijmans, Johanna L. A. T1 - Consensify BT - a method for generating pseudohaploid genome sequences from palaeogenomic datasets with reduced error rates JF - Genes / Molecular Diversity Preservation International N2 - A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes. KW - palaeogenomics KW - ancient DNA KW - sequencing error KW - error reduction KW - D statistics KW - bioinformatics Y1 - 2020 U6 - https://doi.org/10.3390/genes11010050 SN - 2073-4425 VL - 11 IS - 1 PB - MDPI CY - Basel ER - TY - JOUR A1 - Bartel, Manuela A1 - Hartmann, Stefanie A1 - Lehmann, Karola A1 - Postel, Kai A1 - Quesada, Humberto A1 - Philipp, Eva E. R. A1 - Heilmann, Katja A1 - Micheel, Burkhard A1 - Stuckas, Heiko T1 - Identification of sperm proteins as candidate biomarkers for the analysis of reproductive isolation in Mytilus: a case study for the enkurin locus JF - Marine biology : international journal on life in oceans and coastal waters N2 - Sperm proteins of the marine sessile mussels of the Mytilus edulis species complex are models to investigate reproductive isolation and speciation. This study aimed at identifying sperm proteins and their corresponding genes. This was aided by the use of monoclonal antibodies that preferentially bind to yet unknown sperm molecules. By identifying their target molecules, this approach identified proteins with relevance to Mytilus sperm function. This procedure identified 16 proteins, for example, enkurin, laminin, porin and heat shock proteins. The potential use of these proteins as genetic markers to study reproductive isolation is exemplified by analysing the enkurin locus. Enkurin evolution is driven by purifying selection, the locus displays high levels of intraspecific variation and species-specific alleles group in distinct phylogenetic clusters. These findings characterize enkurin as informative candidate biomarker for analyses of clinal variation and differential introgression in hybrid zones, for example, to understand determinants of reproductive isolation in Baltic Mytilus populations. Y1 - 2012 U6 - https://doi.org/10.1007/s00227-012-2005-7 SN - 0025-3162 VL - 159 IS - 10 SP - 2195 EP - 2207 PB - Springer CY - New York ER - TY - GEN A1 - Bleidorn, Christoph A1 - Podsiadlowski, Lars A1 - Zhong, Min A1 - Eeckhaut, Igor A1 - Hartmann, Stefanie A1 - Halanych, Kenneth M. A1 - Tiedemann, Ralph T1 - On the phylogenetic position of Myzostomida : can 77 genes get it wrong? N2 - Background: Phylogenomic analyses recently became popular to address questions about deep metazoan phylogeny. Ribosomal proteins (RP) dominate many of these analyses or are, in some cases, the only genes included. Despite initial hopes, hylogenomic analyses including tens to hundreds of genes still fail to robustly place many bilaterian taxa. Results: Using the phylogenetic position of myzostomids as an example, we show that phylogenies derived from RP genes and mitochondrial genes produce incongruent results. Whereas the former support a position within a clade of platyzoan taxa, mitochondrial data recovers an annelid affinity, which is strongly supported by the gene order data and is congruent with morphology. Using hypothesis testing, our RP data significantly rejects the annelids affinity, whereas a platyzoan relationship is significantly rejected by the mitochondrial data. Conclusion: We conclude (i) that reliance of a set of markers belonging to a single class of macromolecular complexes might bias the analysis, and (ii) that concatenation of all available data might introduce conflicting signal into phylogenetic analyses. We therefore strongly recommend testing for data incongruence in phylogenomic analyses. Furthermore, judging all available data, we consider the annelid affinity hypothesis more plausible than a possible platyzoan affinity for myzostomids, and suspect long branch attraction is influencing the RP data. However, this hypothesis needs further confirmation by future analyses. T3 - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - paper 123 KW - Cirriferum myzostomida KW - Mitochondrial genomes KW - Transfer-rna KW - Data sets KW - Sequence Y1 - 2009 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus-44893 ER - TY - JOUR A1 - Bonizzoni, Mariangela A1 - Bourjea, Jerome A1 - Chen, Bin A1 - Crain, B. J. A1 - Cui, Liwang A1 - Fiorentino, V. A1 - Hartmann, Stefanie A1 - Hendricks, S. A1 - Ketmaier, Valerio A1 - Ma, Xiaoguang A1 - Muths, Delphine A1 - Pavesi, Laura A1 - Pfautsch, Simone A1 - Rieger, M. A. A1 - Santonastaso, T. A1 - Sattabongkot, Jetsumon A1 - Taron, C. H. A1 - Taron, D. J. A1 - Tiedemann, Ralph A1 - Yan, Guiyun A1 - Zheng, Bin A1 - Zhong, Daibin T1 - Permanent genetic resources added to molecular ecology resources database 1 April 2011-31 May 2011 JF - Molecular ecology resources N2 - This article documents the addition of 92 microsatellite marker loci to the Molecular Ecology Resources Database. Loci were developed for the following species: Anopheles minimus, An. sinensis, An. dirus, Calephelis mutica, Lutjanus kasmira, Murella muralis and Orchestia montagui. These loci were cross-tested on the following species: Calephelis arizonensi, Calephelis borealis, Calephelis nemesis, Calephelis virginiensis and Lutjanus bengalensis. Y1 - 2011 U6 - https://doi.org/10.1111/j.1755-0998.2011.03046.x SN - 1755-098X VL - 11 IS - 5 SP - 935 EP - 936 PB - Wiley-Blackwell CY - Malden ER - TY - JOUR A1 - Burleigh, J. Gordon A1 - Bansal, Mukul S. A1 - Eulenstein, Oliver A1 - Hartmann, Stefanie A1 - Wehe, Andre A1 - Vision, Todd J. T1 - Genome-Scale Phylogenetics inferring the plant tree of life from 18,896 gene trees JF - Systematic biology N2 - Phylogenetic analyses using genome-scale data sets must confront incongruence among gene trees, which in plants is exacerbated by frequent gene duplications and losses. Gene tree parsimony (GTP) is a phylogenetic optimization criterion in which a species tree that minimizes the number of gene duplications induced among a set of gene trees is selected. The run time performance of previous implementations has limited its use on large-scale data sets. We used new software that incorporates recent algorithmic advances to examine the performance of GTP on a plant data set consisting of 18,896 gene trees containing 510,922 protein sequences from 136 plant taxa (giving a combined alignment length of >2.9 million characters). The relationships inferred from the GTP analysis were largely consistent with previous large-scale studies of backbone plant phylogeny and resolved some controversial nodes. The placement of taxa that were present in few gene trees generally varied the most among GTP bootstrap replicates. Excluding these taxa either before or after the GTP analysis revealed high levels of phylogenetic support across plants. The analyses supported magnoliids sister to a eudicot + monocot clade and did not support the eurosid I and II clades. This study presents a nuclear genomic perspective on the broad-scale phylogenic relationships among plants, and it demonstrates that nuclear genes with a history of duplication and loss can be phylogenetically informative for resolving the plant tree of life. KW - Gene tree-species tree reconciliation KW - gene tree parsimony KW - plant phylogeny KW - phylogenomics Y1 - 2011 U6 - https://doi.org/10.1093/sysbio/syq072 SN - 1063-5157 VL - 60 IS - 2 SP - 117 EP - 125 PB - Oxford Univ. Press CY - Oxford ER - TY - JOUR A1 - Cheng, Fuxia A1 - Hartmann, Stefanie A1 - Gupta, Mayetri A1 - Ibrahim, Joseph G. A1 - Vision, Todd J. T1 - A hierarchical model for incomplete alignments in phylogenetic inference N2 - Motivation: Full-length DNA and protein sequences that span the entire length of a gene are ideally used for multiple sequence alignments (MSAs) and the subsequent inference of their relationships. Frequently, however, MSAs contain a substantial amount of missing data. For example, expressed sequence tags (ESTs), which are partial sequences of expressed genes, are the predominant source of sequence data for many organisms. The patterns of missing data typical for EST-derived alignments greatly compromise the accuracy of estimated phylogenies. Results: We present a statistical method for inferring phylogenetic trees from EST-based incomplete MSA data. We propose a class of hierarchical models for modeling pairwise distances between the sequences, and develop a fully Bayesian approach for estimation of the model parameters. Once the distance matrix is estimated, the phylogenetic tree may be constructed by applying neighbor-joining (or any other algorithm of choice). We also show that maximizing the marginal likelihood from the Bayesian approach yields similar results to a pro. le likelihood estimation. The proposed methods are illustrated using simulated protein families, for which the true phylogeny is known, and one real protein family. Y1 - 2009 UR - http://bioinformatics.oxfordjournals.org/ U6 - https://doi.org/10.1093/bioinformatics/btp015 SN - 1367-4803 ER - TY - JOUR A1 - Dennis, Alice B. A1 - Ballesteros, Gabriel I. A1 - Robin, Stéphanie A1 - Schrader, Lukas A1 - Bast, Jens A1 - Berghöfer, Jan A1 - Beukeboom, Leo W. A1 - Belghazi, Maya A1 - Bretaudeau, Anthony A1 - Buellesbach, Jan A1 - Cash, Elizabeth A1 - Colinet, Dominique A1 - Dumas, Zoé A1 - Errbii, Mohammed A1 - Falabella, Patrizia A1 - Gatti, Jean-Luc A1 - Geuverink, Elzemiek A1 - Gibson, Joshua D. A1 - Hertaeg, Corinne A1 - Hartmann, Stefanie A1 - Jacquin-Joly, Emmanuelle A1 - Lammers, Mark A1 - Lavandero, Blas I. A1 - Lindenbaum, Ina A1 - Massardier-Galata, Lauriane A1 - Meslin, Camille A1 - Montagné, Nicolas A1 - Pak, Nina A1 - Poirié, Marylène A1 - Salvia, Rosanna A1 - Smith, Chris R. A1 - Tagu, Denis A1 - Tares, Sophie A1 - Vogel, Heiko A1 - Schwander, Tanja A1 - Simon, Jean-Christophe A1 - Figueroa, Christian C. A1 - Vorburger, Christoph A1 - Legeai, Fabrice A1 - Gadau, Jürgen T1 - Functional insights from the GC-poor genomes of two aphid parasitoids, Aphidius ervi and Lysiphlebus fabarum JF - BMC Genomics N2 - Background Parasitoid wasps have fascinating life cycles and play an important role in trophic networks, yet little is known about their genome content and function. Parasitoids that infect aphids are an important group with the potential for biological control. Their success depends on adapting to develop inside aphids and overcoming both host aphid defenses and their protective endosymbionts. Results We present the de novo genome assemblies, detailed annotation, and comparative analysis of two closely related parasitoid wasps that target pest aphids: Aphidius ervi and Lysiphlebus fabarum (Hymenoptera: Braconidae: Aphidiinae). The genomes are small (139 and 141 Mbp) and the most AT-rich reported thus far for any arthropod (GC content: 25.8 and 23.8%). This nucleotide bias is accompanied by skewed codon usage and is stronger in genes with adult-biased expression. AT-richness may be the consequence of reduced genome size, a near absence of DNA methylation, and energy efficiency. We identify missing desaturase genes, whose absence may underlie mimicry in the cuticular hydrocarbon profile of L. fabarum. We highlight key gene groups including those underlying venom composition, chemosensory perception, and sex determination, as well as potential losses in immune pathway genes. Conclusions These findings are of fundamental interest for insect evolution and biological control applications. They provide a strong foundation for further functional studies into coevolution between parasitoids and their hosts. Both genomes are available at https://bipaa.genouest.org. KW - Parasitoid wasp KW - Aphid host KW - Aphidius ervi KW - Lysiphlebus fabarum KW - de novo genome assembly KW - DNA methylation loss KW - Chemosensory genes KW - Venom proteins KW - GC content KW - Toll and Imd pathways Y1 - 2020 U6 - https://doi.org/10.1186/s12864-020-6764-0 SN - 1471-2164 VL - 21 PB - BioMed Central CY - London ER -