Refine
Has Fulltext
- yes (3)
Year of publication
- 2020 (3) (remove)
Document Type
- Postprint (3) (remove)
Language
- English (3) (remove)
Is part of the Bibliography
- yes (3)
Keywords
- Carnivorous plant (1)
- Clethra arborea (1)
- D statistics (1)
- Genome assembly (1)
- Genomics (1)
- Mitochondria (1)
- Museum collections (1)
- Orthologous Matrix (OMA) Project (1)
- Paleogenetics (1)
- Phylogenetic analysis (1)
Institute
- Institut für Biochemie und Biologie (3) (remove)
Objective
Plant carnivory is distributed across the tree of life and has evolved at least six times independently, but sequenced and annotated nuclear genomes of carnivorous plants are currently lacking. We have sequenced and structurally annotated the nuclear genome of the carnivorous Roridula gorgonias and that of a non-carnivorous relative, Madeira’s lily-of-the-valley-tree, Clethra arborea, both within the Ericales. This data adds an important resource to study the evolutionary genetics of plant carnivory across angiosperm lineages and also for functional and systematic aspects of plants within the Ericales.
Results
Our assemblies have total lengths of 284 Mbp (R. gorgonias) and 511 Mbp (C. arborea) and show high BUSCO scores of 84.2% and 89.5%, respectively. We used their predicted genes together with publicly available data from other Ericales’ genomes and transcriptomes to assemble a phylogenomic data set for the inference of a species tree. However, groups of orthologs showed a marked absence of species represented by a transcriptome. We discuss possible reasons and caution against combining predicted genes from genome- and transriptome-based assemblies.
Consensify
(2020)
A standard practise in palaeogenome analysis is the conversion of mapped short read data into pseudohaploid sequences, frequently by selecting a single high-quality nucleotide at random from the stack of mapped reads. This controls for biases due to differential sequencing coverage, but it does not control for differential rates and types of sequencing error, which are frequently large and variable in datasets obtained from ancient samples. These errors have the potential to distort phylogenetic and population clustering analyses, and to mislead tests of admixture using D statistics. We introduce Consensify, a method for generating pseudohaploid sequences, which controls for biases resulting from differential sequencing coverage while greatly reducing error rates. The error correction is derived directly from the data itself, without the requirement for additional genomic resources or simplifying assumptions such as contemporaneous sampling. For phylogenetic and population clustering analysis, we find that Consensify is less affected by artefacts than methods based on single read sampling. For D statistics, Consensify is more resistant to false positives and appears to be less affected by biases resulting from different laboratory protocols than other frequently used methods. Although Consensify is developed with palaeogenomic data in mind, it is applicable for any low to medium coverage short read datasets. We predict that Consensify will be a useful tool for future studies of palaeogenomes.
Utilising a reconstructed ancestral mitochondrial genome of a clade to design hybridisation capture baits can provide the opportunity for recovering mitochondrial sequences from all its descendent and even sister lineages. This approach is useful for taxa with no extant close relatives, as is often the case for rare or extinct species, and is a viable approach for the analysis of historical museum specimens. Asiatic linsangs (genus Prionodon) exemplify this situation, being rare Southeast Asian carnivores for which little molecular data is available. Using ancestral capture we recover partial mitochondrial genome sequences for seven banded linsangs (P. linsang) from historical specimens, representing the first intraspecific genetic dataset for this species. We additionally assemble a high quality mitogenome for the banded linsang using shotgun sequencing for time-calibrated phylogenetic analysis. This reveals a deep divergence between the two Asiatic linsang species (P. linsang, P. pardicolor), with an estimated divergence of ~12 million years (Ma). Although our sample size precludes any robust interpretation of the population structure of the banded linsang, we recover two distinct matrilines with an estimated tMRCA of ~1 Ma. Our results can be used as a basis for further investigation of the Asiatic linsangs, and further demonstrate the utility of ancestral capture for studying divergent taxa without close relatives.