Refine
Language
- English (3)
Is part of the Bibliography
- yes (3)
Keywords
- genome assembly (2)
- in silico (2)
- mate-pairs (2)
- scaffolding (2)
- shotgun sequencing (2)
- adaptation (1)
- gene family evolution (1)
- genomics (1)
- mustelids (1)
- positive (1)
Institute
- Institut für Biochemie und Biologie (3) (remove)
Species of the mustelid subfamily Guloninae inhabit diverse habitats on multiple continents, and occupy a variety of ecological niches. They differ in feeding ecologies, reproductive strategies and morphological adaptations. To identify candidate loci associated with adaptations to their respective environments, we generated a de novo assembly of the tayra (Eira barbara), the earliest diverging species in the subfamily, and compared this with the genomes available for the wolverine (Gulo gulo) and the sable (Martes zibellina). Our comparative genomic analyses included searching for signs of positive selection, examining changes in gene family sizes and searching for species-specific structural variants. Among candidate loci associated with phenotypic traits, we observed many related to diet, body condition and reproduction. For example, for the tayra, which has an atypical gulonine reproductive strategy of aseasonal breeding, we observed species-specific changes in many pregnancy-related genes. For the wolverine, a circumpolar hypercarnivore that must cope with seasonal food scarcity, we observed many changes in genes associated with diet and body condition. All types of genomic variation examined (single nucleotide polymorphisms, gene family expansions, structural variants) contributed substantially to the identification of candidate loci. This argues strongly for consideration of variation other than single nucleotide polymorphisms in comparative genomics studies aiming to identify loci of adaptive significance.
Background
Contiguous genome assemblies are a highly valued biological resource because of the higher number of completely annotated genes and genomic elements that are usable compared to fragmented draft genomes. Nonetheless, contiguity is difficult to obtain if only low coverage data and/or only distantly related reference genome assemblies are available.
Findings
In order to improve genome contiguity, we have developed Cross-Species Scaffolding—a new pipeline that imports long-range distance information directly into the de novo assembly process by constructing mate-pair libraries in silico.
Conclusions
We show how genome assembly metrics and gene prediction dramatically improve with our pipeline by assembling two primate genomes solely based on ∼30x coverage of shotgun sequencing data.
Background
Contiguous genome assemblies are a highly valued biological resource because of the higher number of completely annotated genes and genomic elements that are usable compared to fragmented draft genomes. Nonetheless, contiguity is difficult to obtain if only low coverage data and/or only distantly related reference genome assemblies are available.
Findings
In order to improve genome contiguity, we have developed Cross-Species Scaffolding—a new pipeline that imports long-range distance information directly into the de novo assembly process by constructing mate-pair libraries in silico.
Conclusions
We show how genome assembly metrics and gene prediction dramatically improve with our pipeline by assembling two primate genomes solely based on ∼30x coverage of shotgun sequencing data.