TY  - JOUR
A1  - Barlow, Axel
A1  - Hartmann, Stefanie
A1  - Gonzalez, Javier
A1  - Hofreiter, Michael
A1  - Paijmans, Johanna L. A.
T1  - Consensify
BT  - a method for generating pseudohaploid genome sequences from palaeogenomic datasets with reduced error rates
JF  - Genes / Molecular Diversity Preservation International
N2  - A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes.
KW  - palaeogenomics
KW  - ancient DNA
KW  - sequencing error
KW  - error reduction
KW  - D statistics
KW  - bioinformatics
Y1  - 2020
U6  - https://doi.org/10.3390/genes11010050
SN  - 2073-4425
VL  - 11
IS  - 1
PB  - MDPI
CY  - Basel
ER  - 
TY  - JOUR
A1  - Alberti, Federica
A1  - Gonzalez, Javier
A1  - Paijmans, Johanna L. A.
A1  - Basler, Nikolas
A1  - Preick, Michaela
A1  - Henneberger, Kirstin
A1  - Trinks, Alexandra
A1  - Rabeder, Gernot
A1  - Conard, Nicholas J.
A1  - Muenzel, Susanne C.
A1  - Joger, Ulrich
A1  - Fritsch, Guido
A1  - Hildebrandt, Thomas
A1  - Hofreiter, Michael
A1  - Barlow, Axel
T1  - Optimized DNA sampling of ancient bones using Computed Tomography scans
JF  - Molecular ecology resources
N2  - The prevalence of contaminant microbial DNA in ancient bone samples represents the principal limiting factor for palaeogenomic studies, as it may comprise more than 99% of DNA molecules obtained. Efforts to exclude or reduce this contaminant fraction have been numerous but also variable in their success. Here, we present a simple but highly effective method to increase the relative proportion of endogenous molecules obtained from ancient bones. Using computed tomography (CT) scanning, we identify the densest region of a bone as optimal for sampling. This approach accurately identifies the densest internal regions of petrous bones, which are known to be a source of high-purity ancient DNA. For ancient long bones, CT scans reveal a high-density outermost layer, which has been routinely removed and discarded prior to DNA extraction. For almost all long bones investigated, we find that targeted sampling of this outermost layer provides an increase in endogenous DNA content over that obtained from softer, trabecular bone. This targeted sampling can produce as much as 50-fold increase in the proportion of endogenous DNA, providing a directly proportional reduction in sequencing costs for shotgun sequencing experiments. The observed increases in endogenous DNA proportion are not associated with any reduction in absolute endogenous molecule recovery. Although sampling the outermost layer can result in higher levels of human contamination, some bones were found to have more contamination associated with the internal bone structures. Our method is highly consistent, reproducible and applicable across a wide range of bone types, ages and species. We predict that this discovery will greatly extend the potential to study ancient populations and species in the genomics era.
KW  - ancient DNA
KW  - computer tomography
KW  - palaeogenomics
KW  - paleogenetics
KW  - petrous bone
Y1  - 2018
U6  - https://doi.org/10.1111/1755-0998.12911
SN  - 1755-098X
SN  - 1755-0998
VL  - 18
IS  - 6
SP  - 1196
EP  - 1208
PB  - Wiley
CY  - Hoboken
ER  - 
TY  - GEN
A1  - Barlow, Axel
A1  - Hartmann, Stefanie
A1  - Gonzalez, Javier
A1  - Hofreiter, Michael
A1  - Paijmans, Johanna L. A.
T1  - Consensify
BT  - a method for generating pseudohaploid genome sequences from palaeogenomic datasets with reduced error rates
T2  - Postprints der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe
N2  - A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes.
T3  - Zweitveröffentlichungen der Universität Potsdam : Mathematisch-Naturwissenschaftliche Reihe - 1033 
KW  - palaeogenomics
KW  - ancient DNA
KW  - sequencing error
KW  - error reduction
KW  - D statistics
KW  - bioinformatics
Y1  - 2020
U6  - http://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:kobv:517-opus4-472521
SN  - 1866-8372
IS  - 1033
ER  -