TY - JOUR A1 - Barlow, Axel A1 - Hartmann, Stefanie A1 - Gonzalez, Javier A1 - Hofreiter, Michael A1 - Paijmans, Johanna L. A. T1 - Consensify BT - a method for generating pseudohaploid genome sequences from palaeogenomic datasets with reduced error rates JF - Genes / Molecular Diversity Preservation International N2 - A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes. KW - palaeogenomics KW - ancient DNA KW - sequencing error KW - error reduction KW - D statistics KW - bioinformatics Y1 - 2020 U6 - https://doi.org/10.3390/genes11010050 SN - 2073-4425 VL - 11 IS - 1 PB - MDPI CY - Basel ER - TY - JOUR A1 - Alberti, Federica A1 - Gonzalez, Javier A1 - Paijmans, Johanna L. A. A1 - Basler, Nikolas A1 - Preick, Michaela A1 - Henneberger, Kirstin A1 - Trinks, Alexandra A1 - Rabeder, Gernot A1 - Conard, Nicholas J. A1 - Muenzel, Susanne C. A1 - Joger, Ulrich A1 - Fritsch, Guido A1 - Hildebrandt, Thomas A1 - Hofreiter, Michael A1 - Barlow, Axel T1 - Optimized DNA sampling of ancient bones using Computed Tomography scans JF - Molecular ecology resources N2 - The prevalence of contaminant microbial DNA in ancient bone samples represents the principal limiting factor for palaeogenomic studies, as it may comprise more than 99% of DNA molecules obtained. Efforts to exclude or reduce this contaminant fraction have been numerous but also variable in their success. Here, we present a simple but highly effective method to increase the relative proportion of endogenous molecules obtained from ancient bones. Using computed tomography (CT) scanning, we identify the densest region of a bone as optimal for sampling. This approach accurately identifies the densest internal regions of petrous bones, which are known to be a source of high-purity ancient DNA. For ancient long bones, CT scans reveal a high-density outermost layer, which has been routinely removed and discarded prior to DNA extraction. For almost all long bones investigated, we find that targeted sampling of this outermost layer provides an increase in endogenous DNA content over that obtained from softer, trabecular bone. This targeted sampling can produce as much as 50-fold increase in the proportion of endogenous DNA, providing a directly proportional reduction in sequencing costs for shotgun sequencing experiments. The observed increases in endogenous DNA proportion are not associated with any reduction in absolute endogenous molecule recovery. Although sampling the outermost layer can result in higher levels of human contamination, some bones were found to have more contamination associated with the internal bone structures. Our method is highly consistent, reproducible and applicable across a wide range of bone types, ages and species. We predict that this discovery will greatly extend the potential to study ancient populations and species in the genomics era. KW - ancient DNA KW - computer tomography KW - palaeogenomics KW - paleogenetics KW - petrous bone Y1 - 2018 U6 - https://doi.org/10.1111/1755-0998.12911 SN - 1755-098X SN - 1755-0998 VL - 18 IS - 6 SP - 1196 EP - 1208 PB - Wiley CY - Hoboken ER - TY - GEN A1 - Barlow, Axel A1 - Hartmann, Stefanie A1 - Gonzalez, Javier A1 - Hofreiter, Michael A1 - Paijmans, Johanna L. A. T1 - Consensify BT - a method for generating pseudohaploid genome sequences from palaeogenomic datasets with reduced error rates T2 - Postprints der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe N2 - A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes. T3 - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - 1033 KW - palaeogenomics KW - ancient DNA KW - sequencing error KW - error reduction KW - D statistics KW - bioinformatics Y1 - 2020 U6 - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-472521 SN - 1866-8372 IS - 1033 ER -