TY - JOUR A1 - Autenrieth, Marijke A1 - Hartmann, Stefanie A1 - Lah, Ljerka A1 - Roos, Anna A1 - Dennis, Alice B. A1 - Tiedemann, Ralph T1 - High-quality whole-genome sequence of an abundant Holarctic odontocete, the harbour porpoise (Phocoena phocoena) JF - Molecular ecology resources N2 - The harbour porpoise (Phocoena phocoena) is a highly mobile cetacean found across the Northern hemisphere. It occurs in coastal waters and inhabits basins that vary broadly in salinity, temperature and food availability. These diverse habitats could drive subtle differentiation among populations, but examination of this would be best conducted with a robust reference genome. Here, we report the first harbour porpoise genome, assembled de novo from an individual originating in the Kattegat Sea (Sweden). The genome is one of the most complete cetacean genomes currently available, with a total size of 2.39 Gb and 50% of the total length found in just 34 scaffolds. Using 122 of the longest scaffolds, we were able to show high levels of synteny with the genome of the domestic cattle (Bos taurus). Our draft annotation comprises 22,154 predicted genes, which we further annotated through matches to the NCBI nucleotide database, GO categorization and motif prediction. Within the predicted genes, we have confirmed the presence of >20 genes or gene families that have been associated with adaptive evolution in other cetaceans. Overall, this genome assembly and draft annotation represent a crucial addition to the genomic resources currently available for the study of porpoises and Phocoenidae evolution, phylogeny and conservation. KW - cetaceans KW - genomics/proteomics KW - mammals KW - molecular evolution Y1 - 2018 U6 - https://doi.org/10.1111/1755-0998.12932 SN - 1755-098X SN - 1755-0998 VL - 18 IS - 6 SP - 1469 EP - 1481 PB - Wiley CY - Hoboken ER - TY - JOUR A1 - Barlow, Axel A1 - Cahill, James A. A1 - Hartmann, Stefanie A1 - Theunert, Christoph A1 - Xenikoudakis, Georgios A1 - Gonzalez-Fortes, Gloria M. A1 - Paijmans, Johanna L. A. A1 - Rabeder, Gernot A1 - Frischauf, Christine A1 - Garcia-Vazquez, Ana A1 - Murtskhvaladze, Marine A1 - Saarma, Urmas A1 - Anijalg, Peeter A1 - Skrbinsek, Tomaz A1 - Bertorelle, Giorgio A1 - Gasparian, Boris A1 - Bar-Oz, Guy A1 - Pinhasi, Ron A1 - Slatkin, Montgomery A1 - Dalen, Love A1 - Shapiro, Beth A1 - Hofreiter, Michael T1 - Partial genomic survival of cave bears in living brown bears JF - Nature Ecology & Evolution N2 - Although many large mammal species went extinct at the end of the Pleistocene epoch, their DNA may persist due to past episodes of interspecies admixture. However, direct empirical evidence of the persistence of ancient alleles remains scarce. Here, we present multifold coverage genomic data from four Late Pleistocene cave bears (Ursus spelaeus complex) and show that cave bears hybridized with brown bears (Ursus arctos) during the Pleistocene. We develop an approach to assess both the directionality and relative timing of gene flow. We find that segments of cave bear DNA still persist in the genomes of living brown bears, with cave bears contributing 0.9 to 2.4% of the genomes of all brown bears investigated. Our results show that even though extinction is typically considered as absolute, following admixture, fragments of the gene pool of extinct species can survive for tens of thousands of years in the genomes of extant recipient species. Y1 - 2018 U6 - https://doi.org/10.1038/s41559-018-0654-8 SN - 2397-334X VL - 2 IS - 10 SP - 1563 EP - 1570 PB - Nature Publ. Group CY - London ER - TY - GEN A1 - Barlow, Axel A1 - Hartmann, Stefanie A1 - Gonzalez, Javier A1 - Hofreiter, Michael A1 - Paijmans, Johanna L. A. T1 - Consensify BT - a method for generating pseudohaploid genome sequences from palaeogenomic datasets with reduced error rates T2 - Postprints der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe N2 - A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes. T3 - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - 1033 KW - palaeogenomics KW - ancient DNA KW - sequencing error KW - error reduction KW - D statistics KW - bioinformatics Y1 - 2020 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-472521 SN - 1866-8372 IS - 1033 ER - TY - JOUR A1 - Barlow, Axel A1 - Hartmann, Stefanie A1 - Gonzalez, Javier A1 - Hofreiter, Michael A1 - Paijmans, Johanna L. A. T1 - Consensify BT - a method for generating pseudohaploid genome sequences from palaeogenomic datasets with reduced error rates JF - Genes / Molecular Diversity Preservation International N2 - A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes. KW - palaeogenomics KW - ancient DNA KW - sequencing error KW - error reduction KW - D statistics KW - bioinformatics Y1 - 2020 U6 - https://doi.org/10.3390/genes11010050 SN - 2073-4425 VL - 11 IS - 1 PB - MDPI CY - Basel ER - TY - JOUR A1 - Bartel, Manuela A1 - Hartmann, Stefanie A1 - Lehmann, Karola A1 - Postel, Kai A1 - Quesada, Humberto A1 - Philipp, Eva E. R. A1 - Heilmann, Katja A1 - Micheel, Burkhard A1 - Stuckas, Heiko T1 - Identification of sperm proteins as candidate biomarkers for the analysis of reproductive isolation in Mytilus: a case study for the enkurin locus JF - Marine biology : international journal on life in oceans and coastal waters N2 - Sperm proteins of the marine sessile mussels of the Mytilus edulis species complex are models to investigate reproductive isolation and speciation. This study aimed at identifying sperm proteins and their corresponding genes. This was aided by the use of monoclonal antibodies that preferentially bind to yet unknown sperm molecules. By identifying their target molecules, this approach identified proteins with relevance to Mytilus sperm function. This procedure identified 16 proteins, for example, enkurin, laminin, porin and heat shock proteins. The potential use of these proteins as genetic markers to study reproductive isolation is exemplified by analysing the enkurin locus. Enkurin evolution is driven by purifying selection, the locus displays high levels of intraspecific variation and species-specific alleles group in distinct phylogenetic clusters. These findings characterize enkurin as informative candidate biomarker for analyses of clinal variation and differential introgression in hybrid zones, for example, to understand determinants of reproductive isolation in Baltic Mytilus populations. Y1 - 2012 U6 - https://doi.org/10.1007/s00227-012-2005-7 SN - 0025-3162 VL - 159 IS - 10 SP - 2195 EP - 2207 PB - Springer CY - New York ER - TY - GEN A1 - Bleidorn, Christoph A1 - Podsiadlowski, Lars A1 - Zhong, Min A1 - Eeckhaut, Igor A1 - Hartmann, Stefanie A1 - Halanych, Kenneth M. A1 - Tiedemann, Ralph T1 - On the phylogenetic position of Myzostomida : can 77 genes get it wrong? N2 - Background: Phylogenomic analyses recently became popular to address questions about deep metazoan phylogeny. Ribosomal proteins (RP) dominate many of these analyses or are, in some cases, the only genes included. Despite initial hopes, hylogenomic analyses including tens to hundreds of genes still fail to robustly place many bilaterian taxa. Results: Using the phylogenetic position of myzostomids as an example, we show that phylogenies derived from RP genes and mitochondrial genes produce incongruent results. Whereas the former support a position within a clade of platyzoan taxa, mitochondrial data recovers an annelid affinity, which is strongly supported by the gene order data and is congruent with morphology. Using hypothesis testing, our RP data significantly rejects the annelids affinity, whereas a platyzoan relationship is significantly rejected by the mitochondrial data. Conclusion: We conclude (i) that reliance of a set of markers belonging to a single class of macromolecular complexes might bias the analysis, and (ii) that concatenation of all available data might introduce conflicting signal into phylogenetic analyses. We therefore strongly recommend testing for data incongruence in phylogenomic analyses. Furthermore, judging all available data, we consider the annelid affinity hypothesis more plausible than a possible platyzoan affinity for myzostomids, and suspect long branch attraction is influencing the RP data. However, this hypothesis needs further confirmation by future analyses. T3 - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - paper 123 KW - Cirriferum myzostomida KW - Mitochondrial genomes KW - Transfer-rna KW - Data sets KW - Sequence Y1 - 2009 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus-44893 ER - TY - JOUR A1 - Bonizzoni, Mariangela A1 - Bourjea, Jerome A1 - Chen, Bin A1 - Crain, B. J. A1 - Cui, Liwang A1 - Fiorentino, V. A1 - Hartmann, Stefanie A1 - Hendricks, S. A1 - Ketmaier, Valerio A1 - Ma, Xiaoguang A1 - Muths, Delphine A1 - Pavesi, Laura A1 - Pfautsch, Simone A1 - Rieger, M. A. A1 - Santonastaso, T. A1 - Sattabongkot, Jetsumon A1 - Taron, C. H. A1 - Taron, D. J. A1 - Tiedemann, Ralph A1 - Yan, Guiyun A1 - Zheng, Bin A1 - Zhong, Daibin T1 - Permanent genetic resources added to molecular ecology resources database 1 April 2011-31 May 2011 JF - Molecular ecology resources N2 - This article documents the addition of 92 microsatellite marker loci to the Molecular Ecology Resources Database. Loci were developed for the following species: Anopheles minimus, An. sinensis, An. dirus, Calephelis mutica, Lutjanus kasmira, Murella muralis and Orchestia montagui. These loci were cross-tested on the following species: Calephelis arizonensi, Calephelis borealis, Calephelis nemesis, Calephelis virginiensis and Lutjanus bengalensis. Y1 - 2011 U6 - https://doi.org/10.1111/j.1755-0998.2011.03046.x SN - 1755-098X VL - 11 IS - 5 SP - 935 EP - 936 PB - Wiley-Blackwell CY - Malden ER - TY - JOUR A1 - Burleigh, J. Gordon A1 - Bansal, Mukul S. A1 - Eulenstein, Oliver A1 - Hartmann, Stefanie A1 - Wehe, Andre A1 - Vision, Todd J. T1 - Genome-Scale Phylogenetics inferring the plant tree of life from 18,896 gene trees JF - Systematic biology N2 - Phylogenetic analyses using genome-scale data sets must confront incongruence among gene trees, which in plants is exacerbated by frequent gene duplications and losses. Gene tree parsimony (GTP) is a phylogenetic optimization criterion in which a species tree that minimizes the number of gene duplications induced among a set of gene trees is selected. The run time performance of previous implementations has limited its use on large-scale data sets. We used new software that incorporates recent algorithmic advances to examine the performance of GTP on a plant data set consisting of 18,896 gene trees containing 510,922 protein sequences from 136 plant taxa (giving a combined alignment length of >2.9 million characters). The relationships inferred from the GTP analysis were largely consistent with previous large-scale studies of backbone plant phylogeny and resolved some controversial nodes. The placement of taxa that were present in few gene trees generally varied the most among GTP bootstrap replicates. Excluding these taxa either before or after the GTP analysis revealed high levels of phylogenetic support across plants. The analyses supported magnoliids sister to a eudicot + monocot clade and did not support the eurosid I and II clades. This study presents a nuclear genomic perspective on the broad-scale phylogenic relationships among plants, and it demonstrates that nuclear genes with a history of duplication and loss can be phylogenetically informative for resolving the plant tree of life. KW - Gene tree-species tree reconciliation KW - gene tree parsimony KW - plant phylogeny KW - phylogenomics Y1 - 2011 U6 - https://doi.org/10.1093/sysbio/syq072 SN - 1063-5157 VL - 60 IS - 2 SP - 117 EP - 125 PB - Oxford Univ. Press CY - Oxford ER - TY - JOUR A1 - Cheng, Fuxia A1 - Hartmann, Stefanie A1 - Gupta, Mayetri A1 - Ibrahim, Joseph G. A1 - Vision, Todd J. T1 - A hierarchical model for incomplete alignments in phylogenetic inference N2 - Motivation: Full-length DNA and protein sequences that span the entire length of a gene are ideally used for multiple sequence alignments (MSAs) and the subsequent inference of their relationships. Frequently, however, MSAs contain a substantial amount of missing data. For example, expressed sequence tags (ESTs), which are partial sequences of expressed genes, are the predominant source of sequence data for many organisms. The patterns of missing data typical for EST-derived alignments greatly compromise the accuracy of estimated phylogenies. Results: We present a statistical method for inferring phylogenetic trees from EST-based incomplete MSA data. We propose a class of hierarchical models for modeling pairwise distances between the sequences, and develop a fully Bayesian approach for estimation of the model parameters. Once the distance matrix is estimated, the phylogenetic tree may be constructed by applying neighbor-joining (or any other algorithm of choice). We also show that maximizing the marginal likelihood from the Bayesian approach yields similar results to a pro. le likelihood estimation. The proposed methods are illustrated using simulated protein families, for which the true phylogeny is known, and one real protein family. Y1 - 2009 UR - http://bioinformatics.oxfordjournals.org/ U6 - https://doi.org/10.1093/bioinformatics/btp015 SN - 1367-4803 ER - TY - JOUR A1 - Dennis, Alice B. A1 - Ballesteros, Gabriel I. A1 - Robin, Stéphanie A1 - Schrader, Lukas A1 - Bast, Jens A1 - Berghöfer, Jan A1 - Beukeboom, Leo W. A1 - Belghazi, Maya A1 - Bretaudeau, Anthony A1 - Buellesbach, Jan A1 - Cash, Elizabeth A1 - Colinet, Dominique A1 - Dumas, Zoé A1 - Errbii, Mohammed A1 - Falabella, Patrizia A1 - Gatti, Jean-Luc A1 - Geuverink, Elzemiek A1 - Gibson, Joshua D. A1 - Hertaeg, Corinne A1 - Hartmann, Stefanie A1 - Jacquin-Joly, Emmanuelle A1 - Lammers, Mark A1 - Lavandero, Blas I. A1 - Lindenbaum, Ina A1 - Massardier-Galata, Lauriane A1 - Meslin, Camille A1 - Montagné, Nicolas A1 - Pak, Nina A1 - Poirié, Marylène A1 - Salvia, Rosanna A1 - Smith, Chris R. A1 - Tagu, Denis A1 - Tares, Sophie A1 - Vogel, Heiko A1 - Schwander, Tanja A1 - Simon, Jean-Christophe A1 - Figueroa, Christian C. A1 - Vorburger, Christoph A1 - Legeai, Fabrice A1 - Gadau, Jürgen T1 - Functional insights from the GC-poor genomes of two aphid parasitoids, Aphidius ervi and Lysiphlebus fabarum JF - BMC Genomics N2 - Background Parasitoid wasps have fascinating life cycles and play an important role in trophic networks, yet little is known about their genome content and function. Parasitoids that infect aphids are an important group with the potential for biological control. Their success depends on adapting to develop inside aphids and overcoming both host aphid defenses and their protective endosymbionts. Results We present the de novo genome assemblies, detailed annotation, and comparative analysis of two closely related parasitoid wasps that target pest aphids: Aphidius ervi and Lysiphlebus fabarum (Hymenoptera: Braconidae: Aphidiinae). The genomes are small (139 and 141 Mbp) and the most AT-rich reported thus far for any arthropod (GC content: 25.8 and 23.8%). This nucleotide bias is accompanied by skewed codon usage and is stronger in genes with adult-biased expression. AT-richness may be the consequence of reduced genome size, a near absence of DNA methylation, and energy efficiency. We identify missing desaturase genes, whose absence may underlie mimicry in the cuticular hydrocarbon profile of L. fabarum. We highlight key gene groups including those underlying venom composition, chemosensory perception, and sex determination, as well as potential losses in immune pathway genes. Conclusions These findings are of fundamental interest for insect evolution and biological control applications. They provide a strong foundation for further functional studies into coevolution between parasitoids and their hosts. Both genomes are available at https://bipaa.genouest.org. KW - Parasitoid wasp KW - Aphid host KW - Aphidius ervi KW - Lysiphlebus fabarum KW - de novo genome assembly KW - DNA methylation loss KW - Chemosensory genes KW - Venom proteins KW - GC content KW - Toll and Imd pathways Y1 - 2020 U6 - https://doi.org/10.1186/s12864-020-6764-0 SN - 1471-2164 VL - 21 PB - BioMed Central CY - London ER - TY - GEN A1 - Dennis, Alice B. A1 - Ballesteros, Gabriel I. A1 - Robin, Stéphanie A1 - Schrader, Lukas A1 - Bast, Jens A1 - Berghöfer, Jan A1 - Beukeboom, Leo W. A1 - Belghazi, Maya A1 - Bretaudeau, Anthony A1 - Buellesbach, Jan A1 - Cash, Elizabeth A1 - Colinet, Dominique A1 - Dumas, Zoé A1 - Errbii, Mohammed A1 - Falabella, Patrizia A1 - Gatti, Jean-Luc A1 - Geuverink, Elzemiek A1 - Gibson, Joshua D. A1 - Hertaeg, Corinne A1 - Hartmann, Stefanie A1 - Jacquin-Joly, Emmanuelle A1 - Lammers, Mark A1 - Lavandero, Blas I. A1 - Lindenbaum, Ina A1 - Massardier-Galata, Lauriane A1 - Meslin, Camille A1 - Montagné, Nicolas A1 - Pak, Nina A1 - Poirié, Marylène A1 - Salvia, Rosanna A1 - Smith, Chris R. A1 - Tagu, Denis A1 - Tares, Sophie A1 - Vogel, Heiko A1 - Schwander, Tanja A1 - Simon, Jean-Christophe A1 - Figueroa, Christian C. A1 - Vorburger, Christoph A1 - Legeai, Fabrice A1 - Gadau, Jürgen T1 - Functional insights from the GC-poor genomes of two aphid parasitoids, Aphidius ervi and Lysiphlebus fabarum T2 - Postprints der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe N2 - Background Parasitoid wasps have fascinating life cycles and play an important role in trophic networks, yet little is known about their genome content and function. Parasitoids that infect aphids are an important group with the potential for biological control. Their success depends on adapting to develop inside aphids and overcoming both host aphid defenses and their protective endosymbionts. Results We present the de novo genome assemblies, detailed annotation, and comparative analysis of two closely related parasitoid wasps that target pest aphids: Aphidius ervi and Lysiphlebus fabarum (Hymenoptera: Braconidae: Aphidiinae). The genomes are small (139 and 141 Mbp) and the most AT-rich reported thus far for any arthropod (GC content: 25.8 and 23.8%). This nucleotide bias is accompanied by skewed codon usage and is stronger in genes with adult-biased expression. AT-richness may be the consequence of reduced genome size, a near absence of DNA methylation, and energy efficiency. We identify missing desaturase genes, whose absence may underlie mimicry in the cuticular hydrocarbon profile of L. fabarum. We highlight key gene groups including those underlying venom composition, chemosensory perception, and sex determination, as well as potential losses in immune pathway genes. Conclusions These findings are of fundamental interest for insect evolution and biological control applications. They provide a strong foundation for further functional studies into coevolution between parasitoids and their hosts. Both genomes are available at https://bipaa.genouest.org. T3 - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - 989 KW - Parasitoid wasp KW - Aphid host KW - Aphidius ervi KW - GC content KW - de novo genome assembly KW - DNA methylation loss KW - Chemosensory genes KW - Toll and Imd pathways KW - Venom proteins KW - Lysiphlebus fabarum Y1 - 2020 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-476129 SN - 1866-8372 IS - 989 ER - TY - GEN A1 - Gurke, Marie A1 - Vidal-Gorosquieta, Amalia A1 - Pajimans, Johanna L. A. A1 - Wȩcek, Karolina A1 - Barlow, Axel A1 - González-Fortes, Gloria M. A1 - Hartmann, Stefanie A1 - Grandal-d’Anglade, Aurora A1 - Hofreiter, Michael T1 - Insight into the introduction of domestic cattle and the process of Neolithization to the Spanish region Galicia by genetic evidence T2 - Postprints der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe N2 - Domestic cattle were brought to Spain by early settlers and agricultural societies. Due to missing Neolithic sites in the Spanish region of Galicia, very little is known about this process in this region. We sampled 18 cattle subfossils from different ages and different mountain caves in Galicia, of which 11 were subject to sequencing of the mitochondrial genome and phylogenetic analysis, to provide insight into the introduction of cattle to this region. We detected high similarity between samples from different time periods and were able to compare the time frame of the first domesticated cattle in Galicia to data from the connecting region of Cantabria to show a plausible connection between the Neolithization of these two regions. Our data shows a close relationship of the early domesticated cattle of Galicia and modern cow breeds and gives a general insight into cattle phylogeny. We conclude that settlers migrated to this region of Spain from Europe and introduced common European breeds to Galicia. KW - Haplogroups KW - Mitochondria KW - Cattle KW - Genomics KW - Domestic animals KW - Livestock KW - Single nucleotide polymorphisms KW - Neolithic period Y1 - 2021 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-520875 SN - 1866-8372 IS - 4 ER - TY - JOUR A1 - Gurke, Marie A1 - Vidal-Gorosquieta, Amalia A1 - Pajimans, Johanna L. A. A1 - Wȩcek, Karolina A1 - Barlow, Axel A1 - González-Fortes, Gloria M. A1 - Hartmann, Stefanie A1 - Grandal-d’Anglade, Aurora A1 - Hofreiter, Michael T1 - Insight into the introduction of domestic cattle and the process of Neolithization to the Spanish region Galicia by genetic evidence JF - PLoS ONE N2 - Domestic cattle were brought to Spain by early settlers and agricultural societies. Due to missing Neolithic sites in the Spanish region of Galicia, very little is known about this process in this region. We sampled 18 cattle subfossils from different ages and different mountain caves in Galicia, of which 11 were subject to sequencing of the mitochondrial genome and phylogenetic analysis, to provide insight into the introduction of cattle to this region. We detected high similarity between samples from different time periods and were able to compare the time frame of the first domesticated cattle in Galicia to data from the connecting region of Cantabria to show a plausible connection between the Neolithization of these two regions. Our data shows a close relationship of the early domesticated cattle of Galicia and modern cow breeds and gives a general insight into cattle phylogeny. We conclude that settlers migrated to this region of Spain from Europe and introduced common European breeds to Galicia. KW - Haplogroups KW - Mitochondria KW - Cattle KW - Genomics KW - Domestic animals KW - Livestock KW - Single nucleotide polymorphisms KW - Neolithic period Y1 - 2020 U6 - https://doi.org/10.1371/journal.pone.0249537 SN - 1932-6203 VL - 16 IS - 4 PB - Public Library of Science CY - San Francisco ER - TY - THES A1 - Hartmann, Stefanie T1 - Phylogenomics: comparative genome analysis ursing large-scale gene family data Y1 - 2011 CY - Potsdam ER - TY - GEN A1 - Hartmann, Stefanie A1 - Hasenkamp, Natascha A1 - Mayer, Jens A1 - Michaux, Johan A1 - Morand, Serge A1 - Mazzoni, Camila J. A1 - Roca, Alfred L. A1 - Greenwood, Alex D. T1 - Endogenous murine leukemia retroviral variation across wild European and inbred strains of house mouse T2 - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe N2 - Background: Endogenous murine leukemia retroviruses (MLVs) are high copy number proviral elements difficult to comprehensively characterize using standard low throughput sequencing approaches. However, high throughput approaches generate data that is challenging to process, interpret and present. Results: Next generation sequencing (NGS) data was generated for MLVs from two wild caught Mus musculus domesticus (from mainland France and Corsica) and for inbred laboratory mouse strains C3H, LP/J and SJL. Sequence reads were grouped using a novel sequence clustering approach as applied to retroviral sequences. A Markov cluster algorithm was employed, and the sequence reads were queried for matches to specific xenotropic (Xmv), polytropic (Pmv) and modified polytropic (Mpmv) viral reference sequences. Conclusions: Various MLV subtypes were more widespread than expected among the mice, which may be due to the higher coverage of NGS, or to the presence of similar sequence across many different proviral loci. The results did not correlate with variation in the major MLV receptor Xpr1, which can restrict exogenous MLVs, suggesting that endogenous MLV distribution may reflect gene flow more than past resistance to infection. T3 - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - 1329 KW - murine leukemia virus KW - endogenous retrovirus KW - Xpr1 KW - XMRV KW - genomic evolution KW - Markov cluster algorithm Y1 - 2015 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-431200 SN - 1866-8372 IS - 1329 ER - TY - JOUR A1 - Hartmann, Stefanie A1 - Hasenkamp, Natascha A1 - Mayer, Jens A1 - Michaux, Johan A1 - Morand, Serge A1 - Mazzoni, Camila J. A1 - Roca, Alfred L. A1 - Greenwood, Alex D. T1 - Endogenous murine leukemia retroviral variation across wild European and inbred strains of house mouse JF - BMC genomics N2 - Background: Endogenous murine leukemia retroviruses (MLVs) are high copy number proviral elements difficult to comprehensively characterize using standard low throughput sequencing approaches. However, high throughput approaches generate data that is challenging to process, interpret and present. Results: Next generation sequencing (NGS) data was generated for MLVs from two wild caught Mus musculus domesticus (from mainland France and Corsica) and for inbred laboratory mouse strains C3H, LP/J and SJL. Sequence reads were grouped using a novel sequence clustering approach as applied to retroviral sequences. A Markov cluster algorithm was employed, and the sequence reads were queried for matches to specific xenotropic (Xmv), polytropic (Pmv) and modified polytropic (Mpmv) viral reference sequences. Conclusions: Various MLV subtypes were more widespread than expected among the mice, which may be due to the higher coverage of NGS, or to the presence of similar sequence across many different proviral loci. The results did not correlate with variation in the major MLV receptor Xpr1, which can restrict exogenous MLVs, suggesting that endogenous MLV distribution may reflect gene flow more than past resistance to infection. KW - Murine leukemia virus KW - Endogenous retrovirus KW - Xpr1 KW - XMRV KW - Genomic evolution KW - Markov cluster algorithm Y1 - 2015 U6 - https://doi.org/10.1186/s12864-015-1766-z SN - 1471-2164 VL - 16 PB - BioMed Central CY - London ER - TY - JOUR A1 - Hartmann, Stefanie A1 - Helm, Conrad A1 - Nickel, Birgit A1 - Meyer, Matthias A1 - Struck, Torsten H. A1 - Tiedemann, Ralph A1 - Selbig, Joachim A1 - Bleidorn, Christoph T1 - Exploiting gene families for phylogenomic analysis of myzostomid transcriptome data JF - PLoS one N2 - Background: In trying to understand the evolutionary relationships of organisms, the current flood of sequence data offers great opportunities, but also reveals new challenges with regard to data quality, the selection of data for subsequent analysis, and the automation of steps that were once done manually for single-gene analyses. Even though genome or transcriptome data is available for representatives of most bilaterian phyla, some enigmatic taxa still have an uncertain position in the animal tree of life. This is especially true for myzostomids, a group of symbiotic ( or parasitic) protostomes that are either placed with annelids or flatworms. Methodology: Based on similarity criteria, Illumina-based transcriptome sequences of one myzostomid were compared to protein sequences of one additional myzostomid and 29 reference metazoa and clustered into gene families. These families were then used to investigate the phylogenetic position of Myzostomida using different approaches: Alignments of 989 sequence families were concatenated, and the resulting superalignment was analyzed under a Maximum Likelihood criterion. We also used all 1,878 gene trees with at least one myzostomid sequence for a supertree approach: the individual gene trees were computed and then reconciled into a species tree using gene tree parsimony. Conclusions: Superalignments require strictly orthologous genes, and both the gene selection and the widely varying amount of data available for different taxa in our dataset may cause anomalous placements and low bootstrap support. In contrast, gene tree parsimony is designed to accommodate multilocus gene families and therefore allows a much more comprehensive data set to be analyzed. Results of this supertree approach showed a well-resolved phylogeny, in which myzostomids were part of the annelid radiation, and major bilaterian taxa were found to be monophyletic. Y1 - 2012 U6 - https://doi.org/10.1371/journal.pone.0029843 SN - 1932-6203 VL - 7 IS - 1 PB - PLoS CY - San Fransisco ER - TY - GEN A1 - Hartmann, Stefanie A1 - Preick, Michaela A1 - Abelt, Silke A1 - Scheffel, André A1 - Hofreiter, Michael T1 - Annotated genome sequences of the carnivorous plant Roridula gorgonias and a non-carnivorous relative, Clethra arborea T2 - Postprints der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe N2 - Objective Plant carnivory is distributed across the tree of life and has evolved at least six times independently, but sequenced and annotated nuclear genomes of carnivorous plants are currently lacking. We have sequenced and structurally annotated the nuclear genome of the carnivorous Roridula gorgonias and that of a non-carnivorous relative, Madeira’s lily-of-the-valley-tree, Clethra arborea, both within the Ericales. This data adds an important resource to study the evolutionary genetics of plant carnivory across angiosperm lineages and also for functional and systematic aspects of plants within the Ericales. Results Our assemblies have total lengths of 284 Mbp (R. gorgonias) and 511 Mbp (C. arborea) and show high BUSCO scores of 84.2% and 89.5%, respectively. We used their predicted genes together with publicly available data from other Ericales’ genomes and transcriptomes to assemble a phylogenomic data set for the inference of a species tree. However, groups of orthologs showed a marked absence of species represented by a transcriptome. We discuss possible reasons and caution against combining predicted genes from genome- and transriptome-based assemblies. T3 - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - 1141 KW - Carnivorous plant KW - Roridula gorgonias KW - Clethra arborea KW - Genome assembly KW - Transcriptome assembly KW - Phylogenomics KW - Orthologous Matrix (OMA) Project Y1 - 2021 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-503752 SN - 1866-8372 ER - TY - JOUR A1 - Hartmann, Stefanie A1 - Preick, Michaela A1 - Abelt, Silke A1 - Scheffel, André A1 - Hofreiter, Michael T1 - Annotated genome sequences of the carnivorous plant Roridula gorgonias and a non-carnivorous relative, Clethra arborea JF - BMC Research Notes N2 - Objective Plant carnivory is distributed across the tree of life and has evolved at least six times independently, but sequenced and annotated nuclear genomes of carnivorous plants are currently lacking. We have sequenced and structurally annotated the nuclear genome of the carnivorous Roridula gorgonias and that of a non-carnivorous relative, Madeira’s lily-of-the-valley-tree, Clethra arborea, both within the Ericales. This data adds an important resource to study the evolutionary genetics of plant carnivory across angiosperm lineages and also for functional and systematic aspects of plants within the Ericales. Results Our assemblies have total lengths of 284 Mbp (R. gorgonias) and 511 Mbp (C. arborea) and show high BUSCO scores of 84.2% and 89.5%, respectively. We used their predicted genes together with publicly available data from other Ericales’ genomes and transcriptomes to assemble a phylogenomic data set for the inference of a species tree. However, groups of orthologs showed a marked absence of species represented by a transcriptome. We discuss possible reasons and caution against combining predicted genes from genome- and transriptome-based assemblies. KW - Carnivorous plant KW - Roridula gorgonias KW - Clethra arborea KW - Genome assembly KW - Transcriptome assembly KW - Phylogenomics KW - Orthologous Matrix (OMA) Project Y1 - 2020 U6 - https://doi.org/10.1186/s13104-020-05254-4 SN - 1756-0500 VL - 13 PB - Biomed Central CY - London ER - TY - BOOK A1 - Hartmann, Stefanie A1 - Selbig, Joachim T1 - Introductory Bioinformatics Y1 - 2009 SN - 978-3-8370-5189-6 PB - Books on Demand CY - Norderstedt ER -