• search hit 2 of 35
Back to Result List

Consensify

  • A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affectedA standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes.show moreshow less

Export metadata

Additional Services

Search Google Scholar Statistics
Metadaten
Author details:Axel BarlowORCiDGND, Stefanie HartmannORCiDGND, Javier Gonzalez, Michael HofreiterORCiDGND, Johanna L. A. PaijmansORCiDGND
DOI:https://doi.org/10.3390/genes11010050
ISSN:2073-4425
Pubmed ID:https://pubmed.ncbi.nlm.nih.gov/31906474
Title of parent work (English):Genes / Molecular Diversity Preservation International
Subtitle (English):a method for generating pseudohaploid genome sequences from palaeogenomic datasets with reduced error rates
Publisher:MDPI
Place of publishing:Basel
Publication type:Article
Language:English
Date of first publication:2020/01/02
Publication year:2020
Release date:2023/11/13
Tag:D statistics; ancient DNA; bioinformatics; error reduction; palaeogenomics; sequencing error
Volume:11
Issue:1
Article number:50
Number of pages:22
Funding institution:University of Leicester [M38DF64]; European Research Council (ERC); consolidator grant 'gene flow'European Research Council (ERC) [310763]
Organizational units:Mathematisch-Naturwissenschaftliche Fakultät / Institut für Biochemie und Biologie
DDC classification:5 Naturwissenschaften und Mathematik / 57 Biowissenschaften; Biologie / 570 Biowissenschaften; Biologie
Peer review:Referiert
Publishing method:Open Access / Gold Open-Access
DOAJ gelistet
License (German):License LogoCC-BY - Namensnennung 4.0 International
Accept ✔
This website uses technically necessary session cookies. By continuing to use the website, you agree to this. You can find our privacy policy here.