TY  - GEN
A1  - Barlow, Axel
A1  - Hartmann, Stefanie
A1  - Gonzalez, Javier
A1  - Hofreiter, Michael
A1  - Paijmans, Johanna L. A.
T1  - Consensify
BT  - a method for generating pseudohaploid genome sequences from palaeogenomic datasets with reduced error rates
T2  - Postprints der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe
N2  - A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes.
T3  - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - 1033 
KW  - palaeogenomics
KW  - ancient DNA
KW  - sequencing error
KW  - error reduction
KW  - D statistics
KW  - bioinformatics
Y1  - 2020
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-472521
SN  - 1866-8372
IS  - 1033
ER  - 
TY  - GEN
A1  - Taron, Ulrike H.
A1  - Lell, Moritz
A1  - Barlow, Axel
A1  - Paijmans, Johanna L. A.
T1  - Testing of Alignment Parameters for Ancient Samples
BT  - Evaluating and Optimizing Mapping Parameters for Ancient Samples Using the TAPAS Tool
T2  - Genes
N2  - High-throughput sequence data retrieved from ancient or other degraded samples has led to unprecedented insights into the evolutionary history of many species, but the analysis of such sequences also poses specific computational challenges. The most commonly used approach involves mapping sequence reads to a reference genome. However, this process becomes increasingly challenging with an elevated genetic distance between target and reference or with the presence of contaminant sequences with high sequence similarity to the target species. The evaluation and testing of mapping efficiency and stringency are thus paramount for the reliable identification and analysis of ancient sequences. In this paper, we present ‘TAPAS’, (Testing of Alignment Parameters for Ancient Samples), a computational tool that enables the systematic testing of mapping tools for ancient data by simulating sequence data reflecting the properties of an ancient dataset and performing test runs using the mapping software and parameter settings of interest. We showcase TAPAS by using it to assess and improve mapping strategy for a degraded sample from a banded linsang (Prionodon linsang), for which no closely related reference is currently available. This enables a 1.8-fold increase of the number of mapped reads without sacrificing mapping specificity. The increase of mapped reads effectively reduces the need for additional sequencing, thus making more economical use of time, resources, and sample material.
T3  - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - 415 
KW  - ancient DNA
KW  - short-read mapping
KW  - palaeogenomics
KW  - alignment sensitivity / specificity
Y1  - 2018
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-409683
ER  - 
TY  - GEN
A1  - Hofreiter, Michael
A1  - Paijmans, Johanna L. A.
A1  - Goodchild, Helen
A1  - Speller, Camilla F.
A1  - Barlow, Axel
A1  - Gonzalez-Fortes, Gloria M.
A1  - Thomas, Jessica A.
A1  - Ludwig, Arne
A1  - Collins, Matthew J.
T1  - The future of ancient DNA
BT  - technical advances and conceptual shifts
T2  - Postprints der Universität Potsdam : Mathematisch Naturwissenschaftliche Reihe
N2  - Technological innovations such as next generation sequencing and DNA hybridisation enrichment have resulted in multi-fold increases in both the quantity of ancient DNA sequence data and the time depth for DNA retrieval. To date, over 30 ancient genomes have been sequenced, moving from 0.7x coverage (mammoth) in 2008 to more than 50x coverage (Neanderthal) in 2014. Studies of rapid evolutionary changes, such as the evolution and spread of pathogens and the genetic responses of hosts, or the genetics of domestication and climatic adaptation, are developing swiftly and the importance of palaeogenomics for investigating evolutionary processes during the last million years is likely to increase considerably. However, these new datasets require new methods of data processing and analysis, as well as conceptual changes in interpreting the results. In this review we highlight important areas of future technical and conceptual progress and discuss research topics in the rapidly growing field of palaeogenomics.
T3  - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - 908 
KW  - ancient DNA
KW  - hybridisation capture
KW  - multi-locus data
KW  - next generation sequencing (NGS)
KW  - palaeogenomics
KW  - population genomics
Y1  - 2020
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-438816
SN  - 1866-8372
IS  - 908
SP  - 284
EP  - 295
ER  -