TY - GEN A1 - Barlow, Axel A1 - Hartmann, Stefanie A1 - Gonzalez, Javier A1 - Hofreiter, Michael A1 - Paijmans, Johanna L. A. T1 - Consensify BT - a method for generating pseudohaploid genome sequences from palaeogenomic datasets with reduced error rates T2 - Postprints der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe N2 - A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes. T3 - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - 1033 KW - palaeogenomics KW - ancient DNA KW - sequencing error KW - error reduction KW - D statistics KW - bioinformatics Y1 - 2020 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-472521 SN - 1866-8372 IS - 1033 ER - TY - GEN A1 - Taron, Ulrike H. A1 - Lell, Moritz A1 - Barlow, Axel A1 - Paijmans, Johanna L. A. T1 - Testing of Alignment Parameters for Ancient Samples BT - Evaluating and Optimizing Mapping Parameters for Ancient Samples Using the TAPAS Tool T2 - Genes N2 - High-throughput sequence data retrieved from ancient or other degraded samples has led to unprecedented insights into the evolutionary history of many species, but the analysis of such sequences also poses specific computational challenges. The most commonly used approach involves mapping sequence reads to a reference genome. However, this process becomes increasingly challenging with an elevated genetic distance between target and reference or with the presence of contaminant sequences with high sequence similarity to the target species. The evaluation and testing of mapping efficiency and stringency are thus paramount for the reliable identification and analysis of ancient sequences. In this paper, we present ‘TAPAS’, (Testing of Alignment Parameters for Ancient Samples), a computational tool that enables the systematic testing of mapping tools for ancient data by simulating sequence data reflecting the properties of an ancient dataset and performing test runs using the mapping software and parameter settings of interest. We showcase TAPAS by using it to assess and improve mapping strategy for a degraded sample from a banded linsang (Prionodon linsang), for which no closely related reference is currently available. This enables a 1.8-fold increase of the number of mapped reads without sacrificing mapping specificity. The increase of mapped reads effectively reduces the need for additional sequencing, thus making more economical use of time, resources, and sample material. T3 - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - 415 KW - ancient DNA KW - short-read mapping KW - palaeogenomics KW - alignment sensitivity / specificity Y1 - 2018 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-409683 ER - TY - GEN A1 - Hofreiter, Michael A1 - Paijmans, Johanna L. A. A1 - Goodchild, Helen A1 - Speller, Camilla F. A1 - Barlow, Axel A1 - Gonzalez-Fortes, Gloria M. A1 - Thomas, Jessica A. A1 - Ludwig, Arne A1 - Collins, Matthew J. T1 - The future of ancient DNA BT - technical advances and conceptual shifts T2 - Postprints der Universität Potsdam : Mathematisch Naturwissenschaftliche Reihe N2 - Technological innovations such as next generation sequencing and DNA hybridisation enrichment have resulted in multi-fold increases in both the quantity of ancient DNA sequence data and the time depth for DNA retrieval. To date, over 30 ancient genomes have been sequenced, moving from 0.7x coverage (mammoth) in 2008 to more than 50x coverage (Neanderthal) in 2014. Studies of rapid evolutionary changes, such as the evolution and spread of pathogens and the genetic responses of hosts, or the genetics of domestication and climatic adaptation, are developing swiftly and the importance of palaeogenomics for investigating evolutionary processes during the last million years is likely to increase considerably. However, these new datasets require new methods of data processing and analysis, as well as conceptual changes in interpreting the results. In this review we highlight important areas of future technical and conceptual progress and discuss research topics in the rapidly growing field of palaeogenomics. T3 - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - 908 KW - ancient DNA KW - hybridisation capture KW - multi-locus data KW - next generation sequencing (NGS) KW - palaeogenomics KW - population genomics Y1 - 2020 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-438816 SN - 1866-8372 IS - 908 SP - 284 EP - 295 ER -