86 research outputs found
The era of reference genomes in conservation genomics
Progress in genome sequencing now enables the large-scale generation of reference genomes. Various international initiatives aim to generate reference genomes representing global biodiversity. These genomes provide unique insights into genomic diversity and architecture, thereby enabling comprehensive analyses of population and functional genomics, and are expected to revolutionize conservation genomics.Peer reviewe
Widespread false gene gains caused by duplication errors in genome assemblies
Abstract
Background
False duplications in genome assemblies lead to false biological conclusions. We quantified false duplications in popularly used previous genome assemblies for platypus, zebra finch, and Annas Hummingbird, and their new counterparts of the same species generated by the Vertebrate Genomes Project, of which the Vertebrate Genomes Project pipeline attempted to eliminate false duplications through haplotype phasing and purging. These assemblies are among the first generated by the Vertebrate Genomes Project where there was a prior chromosomal level reference assembly to compare with.
Results
Whole genome alignments revealed that 4 to 16% of the sequences are falsely duplicated in the previous assemblies, impacting hundreds to thousands of genes. These lead to overestimated gene family expansions. The main source of the false duplications is heterotype duplications, where the haplotype sequences were relatively more divergent than other parts of the genome leading the assembly algorithms to classify them as separate genes or genomic regions. A minor source is sequencing errors. Ancient ATP nucleotide binding gene families have a higher prevalence of false duplications compared to other gene families. Although present in a smaller proportion, we observe false duplications remaining in the Vertebrate Genomes Project assemblies that can be identified and purged.
Conclusions
This study highlights the need for more advanced assembly methods that better separate haplotypes and sequence errors, and the need for cautious analyses on gene gains
The Mitogenome Relationships and Phylogeography of Barn Swallows (Hirundo rustica)
The barn swallow (Hirundo rustica) poses a number of fascinating scientific questions, including the taxonomic status of postulated subspecies. Here, we obtained and assessed the sequence variation of 411 complete mitogenomes, mainly from the European H. r. rustica, but other subspecies as well. In almost every case, we observed subspecies-specific haplogroups, which we employed together with estimated radiation times to postulate a model for the geographical and temporal worldwide spread of the species. The female barn swallow carrying the Hirundo rustica ancestral mitogenome left Africa (or its vicinity) around 280 thousand years ago (kya), and her descendants expanded first into Eurasia and then, at least 51â
kya, into the Americas, from where a relatively recent (<20â
kya) back migration to Asia took place. The exception to the haplogroup subspecies specificity is represented by the sedentary Levantine H. r. transitiva that extensively shares haplogroup A with the migratory European H. r. rustica and, to a lesser extent, haplogroup B with the Egyptian H. r. savignii. Our data indicate that rustica and transitiva most likely derive from a sedentary Levantine population source that split at the end of the Younger Dryas (YD) (11.7â
kya). Since then, however, transitiva received genetic inputs from and admixed with both the closely related rustica and the adjacent savignii. Demographic analyses confirm this species' strong link with climate fluctuations and human activities making it an excellent indicator for monitoring and assessing the impact of current global changes on wildlife
The era of reference genomes in conservation genomics
Progress in genome sequencing now enables the large-scale generation of reference genomes. Various international initiatives aim to generate reference genomes representing global biodiversity. These genomes provide unique insights into genomic diversity and architecture, thereby enabling comprehensive analyses of population and functional genomics, and are expected to revolutionize conservation genomics
Bridging the gap in African biodiversity genomics and bioinformatics:Open Institute of the African BioGenome Project:
The Open Institute of the African BioGenome Project empowers African scientists and institutions with the skill sets, capacity and infrastructure to advance scientific knowledge and innovation and drive economic growth
Population genomics of the critically endangered kÄkÄpĆ
Summary The kÄkÄpĆ is a flightless parrot endemic to New Zealand. Once common in the archipelago, only 201 individuals remain today, most of them descending from an isolated island population. We report the first genome-wide analyses of the species, including a high-quality genome assembly for kÄkÄpĆ, one of the first chromosome-level reference genomes sequenced by the Vertebrate Genomes Project (VGP). We also sequenced and analyzed 35 modern genomes from the sole surviving island population and 14 genomes from the extinct mainland population. While theory suggests that such a small population is likely to have accumulated deleterious mutations through genetic drift, our analyses on the impact of the long-term small population size in kÄkÄpĆ indicate that present-day island kÄkÄpĆ have a reduced number of harmful mutations compared to mainland individuals. We hypothesize that this reduced mutational load is due to the island population having been subjected to a combination of genetic drift and purging of deleterious mutations, through increased inbreeding and purifying selection, since its isolation from the mainland âŒ10,000 years ago. Our results provide evidence that small populations can survive even when isolated for hundreds of generations. This work provides key insights into kÄkÄpĆ breeding and recovery and more generally into the application of genetic tools in conservation efforts for endangered species
The swan genome and transcriptome, its not all black and white
BACKGROUND: The Australian black swan (Cygnus atratus) is an iconic species with contrasting plumage to that of the closely related northern hemisphere white swans. The relative geographic isolation of the black swan may have resulted in a limited immune repertoire and increased susceptibility to infectious diseases, notably infectious diseases from which Australia has been largely shielded. Unlike mallard ducks and the mute swan (Cygnus olor), the black swan is extremely sensitive to highly pathogenic avian influenza. Understanding this susceptibility has been impaired by the absence of any available swan genome and transcriptome information. RESULTS: Here, we generate the first chromosome-length black and mute swan genomes annotated with transcriptome data, all using long-read based pipelines generated for vertebrate species. We use these genomes and transcriptomes to show that unlike other wild waterfowl, black swans lack an expanded immune gene repertoire, lack a key viral pattern-recognition receptor in endothelial cells and mount a poorly controlled inflammatory response to highly pathogenic avian influenza. We also implicate genetic differences in SLC45A2 gene in the iconic plumage of the black swan. CONCLUSION: Together, these data suggest that the immune system of the black swan is such that should any avian viral infection become established in its native habitat, the black swan would be in a significant peril. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-022-02838-0
- âŠ