Refine
Year of publication
Document Type
- Article (19)
- Postprint (11)
- Doctoral Thesis (3)
- Review (2)
- Preprint (1)
Language
- English (36)
Is part of the Bibliography
- yes (36)
Keywords
- ancient DNA (36) (remove)
High-throughput sequence data retrieved from ancient or other degraded samples has led to unprecedented insights into the evolutionary history of many species, but the analysis of such sequences also poses specific computational challenges. The most commonly used approach involves mapping sequence reads to a reference genome. However, this process becomes increasingly challenging with an elevated genetic distance between target and reference or with the presence of contaminant sequences with high sequence similarity to the target species. The evaluation and testing of mapping efficiency and stringency are thus paramount for the reliable identification and analysis of ancient sequences. In this paper, we present ‘TAPAS’, (Testing of Alignment Parameters for Ancient Samples), a computational tool that enables the systematic testing of mapping tools for ancient data by simulating sequence data reflecting the properties of an ancient dataset and performing test runs using the mapping software and parameter settings of interest. We showcase TAPAS by using it to assess and improve mapping strategy for a degraded sample from a banded linsang (Prionodon linsang), for which no closely related reference is currently available. This enables a 1.8-fold increase of the number of mapped reads without sacrificing mapping specificity. The increase of mapped reads effectively reduces the need for additional sequencing, thus making more economical use of time, resources, and sample material.
High-throughput sequence data retrieved from ancient or other degraded samples has led to unprecedented insights into the evolutionary history of many species, but the analysis of such sequences also poses specific computational challenges. The most commonly used approach involves mapping sequence reads to a reference genome. However, this process becomes increasingly challenging with an elevated genetic distance between target and reference or with the presence of contaminant sequences with high sequence similarity to the target species. The evaluation and testing of mapping efficiency and stringency are thus paramount for the reliable identification and analysis of ancient sequences. In this paper, we present ‘TAPAS’, (Testing of Alignment Parameters for Ancient Samples), a computational tool that enables the systematic testing of mapping tools for ancient data by simulating sequence data reflecting the properties of an ancient dataset and performing test runs using the mapping software and parameter settings of interest. We showcase TAPAS by using it to assess and improve mapping strategy for a degraded sample from a banded linsang (Prionodon linsang), for which no closely related reference is currently available. This enables a 1.8-fold increase of the number of mapped reads without sacrificing mapping specificity. The increase of mapped reads effectively reduces the need for additional sequencing, thus making more economical use of time, resources, and sample material.
The future of ancient DNA
(2015)
Technological innovations such as next generation sequencing and DNA hybridisation enrichment have resulted in multi-fold increases in both the quantity of ancient DNA sequence data and the time depth for DNA retrieval. To date, over 30 ancient genomes have been sequenced, moving from 0.7x coverage (mammoth) in 2008 to more than 50x coverage (Neanderthal) in 2014. Studies of rapid evolutionary changes, such as the evolution and spread of pathogens and the genetic responses of hosts, or the genetics of domestication and climatic adaptation, are developing swiftly and the importance of palaeogenomics for investigating evolutionary processes during the last million years is likely to increase considerably. However, these new datasets require new methods of data processing and analysis, as well as conceptual changes in interpreting the results. In this review we highlight important areas of future technical and conceptual progress and discuss research topics in the rapidly growing field of palaeogenomics.
Technological innovations such as next generation sequencing and DNA hybridisation enrichment have resulted in multi-fold increases in both the quantity of ancient DNA sequence data and the time depth for DNA retrieval. To date, over 30 ancient genomes have been sequenced, moving from 0.7x coverage (mammoth) in 2008 to more than 50x coverage (Neanderthal) in 2014. Studies of rapid evolutionary changes, such as the evolution and spread of pathogens and the genetic responses of hosts, or the genetics of domestication and climatic adaptation, are developing swiftly and the importance of palaeogenomics for investigating evolutionary processes during the last million years is likely to increase considerably. However, these new datasets require new methods of data processing and analysis, as well as conceptual changes in interpreting the results. In this review we highlight important areas of future technical and conceptual progress and discuss research topics in the rapidly growing field of palaeogenomics.
The horse is a fascinating animal symbolizing power, beauty, strength and grace. Among all the animal species domesticated the horse had the largest impact on the course of human history due to its importance for warfare and transportation. Studying the process of horse domestication contributes to the knowledge about the history of horses and even of our own species.
Research based on molecular methods has increasingly focused on the genetic basis of horse domestication. Mitochondrial DNA (mtDNA) analyses of modern and ancient horses detected immense maternal diversity, probably due to many mares that contributed to the domestic population. However, mtDNA does not provide an informative phylogeographic structure. In contrast, Y chromosome analyses displayed almost complete uniformity in modern stallions but relatively high diversity in a few ancient horses. Further molecular markers that seem to be well suited to infer the domestication history of horses or genetic and phenotypic changes during this process are loci associated with phenotypic traits.
This doctoral thesis consists of three different parts for which I analyzed various single nucleotide polymorphisms (SNPs) associated with coat color, locomotion or Y chromosomal variation of horses. These SNPs were genotyped in 350 ancient horses from the Chalcolithic (5,000 BC) to the Middle Ages (11th century). The distribution of the samples ranges from China to the Iberian Peninsula and Iceland. By applying multiplexed next-generation sequencing (NGS) I sequenced short amplicons covering the relevant positions: i) eight coat-color-associated mutations in six genes to deduce the coat color phenotype; ii) the so-called ’Gait-keeper’ SNP in the DMRT3 gene to screen for the ability to amble; iii) 16 SNPs previously detected in ancient horses to infer the corresponding haplotype. Based on these data I investigated the occurrence and frequencies of alleles underlying the respective phenotypes as well as Y chromosome haplotypes at different times and regions. Also, selection coefficients for several Y chromosome lineages or phenotypes were estimated.
Concerning coat color differences in ancient horses my work constitutes the most comprehensive study to date. I detected an increase of chestnut horses in the Middle Ages as well as differential selection for spotted and solid phenotypes over time which reflects changing human preferences.
With regard to ambling horses, the corresponding allele was present in medieval English and Icelandic horses. Based on these results I argue that Norse settlers, who frequently invaded parts of Britain, brought ambling individuals to Iceland from the British Isles which can be regarded the origin of this trait. Moreover, these settlers appear to have selected for ambling in Icelandic horses.
Relating to the third trait, the paternal diversity, these findings represent the largest ancient dataset of Y chromosome variation in non-humans. I proved the existence of several Y chromosome haplotypes in early domestic horses. The decline of Y chromosome variation coincides with the movement of nomadic peoples from the Eurasian steppes and later with different breeding practices in the Roman period.
In conclusion, positive selection was estimated for several phenotypes/lineages
in different regions or times which indicates that these were preferred by humans. Furthermore, I could successfully infer the distribution and dispersal of horses in association with human movements and actions. Thereby, a better understanding of the influence of people on the changing appearance and genetic diversity of domestic horses could be gained. My results also emphasize the close relationship of ancient genetics and archeology or history and that only in combination well-founded conclusions can be reached.
Leopard complex spotting is inherited by the incompletely dominant locus, LP, which also causes congenital stationary night blindness in homozygous horses. We investigated an associated single nucleotide polymorphism in the TRPM1 gene in 96 archaeological bones from 31 localities from Late Pleistocene (approx. 17 000 YBP) to medieval times. The first genetic evidence of LP spotting in Europe dates back to the Pleistocene. We tested for temporal changes in the LP associated allele frequency and estimated coefficients of selection by means of approximate Bayesian computation analyses. Our results show that at least some of the observed frequency changes are congruent with shifts in artificial selection pressure for the leopard complex spotting phenotype. In early domestic horses from Kirklareli-Kanligecit (Turkey) dating to 2700-2200 BC, a remarkably high number of leopard spotted horses (six of 10 individuals) was detected including one adult homozygote. However, LP seems to have largely disappeared during the late Bronze Age, suggesting selection against this phenotype in early domestic horses. During the Iron Age, LP reappeared, probably by reintroduction into the domestic gene pool from wild animals. This picture of alternating selective regimes might explain how genetic diversity was maintained in domestic animals despite selection for specific traits at different times.