69 research outputs found
Genome dynamics of Bartonella grahamii in micro-populations of woodland rodents
<p>Abstract</p> <p>Background</p> <p>Rodents represent a high-risk reservoir for the emergence of new human pathogens. The recent completion of the 2.3 Mb genome of <it>Bartonella grahamii</it>, one of the most prevalent blood-borne bacteria in wild rodents, revealed a higher abundance of genes for host-cell interaction systems than in the genomes of closely related human pathogens. The sequence variability within the global <it>B. grahamii </it>population was recently investigated by multi locus sequence typing, but no study on the variability of putative host-cell interaction systems has been performed.</p> <p>Results</p> <p>To study the population dynamics of <it>B. grahamii</it>, we analyzed the genomic diversity on a whole-genome scale of 27 <it>B. grahamii </it>strains isolated from four different species of wild rodents in three geographic locations separated by less than 30 km. Even using highly variable spacer regions, only 3 sequence types were identified. This low sequence diversity contrasted with a high variability in genome content. Microarray comparative genome hybridizations identified genes for outer surface proteins, including a repeated region containing the <it>fha </it>gene for filamentous hemaggluttinin and a plasmid that encodes a type IV secretion system, as the most variable. The estimated generation times in liquid culture medium for a subset of strains ranged from 5 to 22 hours, but did not correlate with sequence type or presence/absence patterns of the <it>fha </it>gene or the plasmid.</p> <p>Conclusion</p> <p>Our study has revealed a geographic microstructure of <it>B. grahamii </it>in wild rodents. Despite near-identity in nucleotide sequence, major differences were observed in gene presence/absence patterns that did not segregate with host species. This suggests that genetically similar strains can infect a range of different hosts.</p
Combination of short-read, long-read, and optical mapping assemblies reveals large-scale tandem repeat arrays with population genetic implications
Accurate and contiguous genome assembly is key to a comprehensive understanding of the processes shaping genomic diversity and evolution. Yet, it is frequently constrained by constitutive heterochromatin, usually characterized by highly repetitive DNA. As a key feature of genome architecture associated with centromeric and subtelomeric regions, it locally influences meiotic recombination. In this study, we assess the impact of large tandem repeat arrays on the recombination rate landscape in an avian speciation model, the Eurasian crow. We assembled two high-quality genome references using single-molecule real-time sequencing (long-read assembly [LR]) and single-molecule optical maps (optical map assembly [OM]). A three-way comparison including the published short-read assembly (SR) constructed for the same individual allowed assessing assembly properties and pinpointing misassemblies. By combining information from all three assemblies, we characterized 36 previously unidentified large repetitive regions in the proximity of sequence assembly breakpoints, the majority of which contained complex arrays of a 14-kb satellite repeat or its 1.2-kb subunit. Using whole-genome population resequencing data, we estimated the population-scaled recombination rate (ρ) and found it to be significantly reduced in these regions. These findings are consistent with an effect of low recombination in regions adjacent to centromeric or subtelomeric heterochromatin and add to our understanding of the processes generating widespread heterogeneity in genetic diversity and differentiation along the genome. By combining three different technologies, our results highlight the importance of adding a layer of information on genome structure that is inaccessible to each approach independently
Identification of Colletotrichum species associated with anthracnose disease of coffee in Vietnam
Colletotrichum gloeosporioides, C. acutatum, C. capsici and C. boninense associated with anthracnose disease on coffee (Coffea spp.) in Vietnam were identified based on morphology and DNA analysis. Phylogenetic analysis of DNA sequences from the internal transcribed spacer region of nuclear rDNA and a portion of mitochondrial small subunit rRNA were concordant and allowed good separation of the taxa. We found several Colletotrichum isolates of unknown species and their taxonomic position remains unresolved. The majority of Vietnamese isolates belonged to C. gloeosporioides and they grouped together with the coffee berry disease (CBD) fungus, C. kahawae. However, C. kahawae could be distinguished from the Vietnamese C. gloeosporioides isolates based on ammonium tartrate utilization, growth rate and pathogenictity. C. gloeosporioides isolates were more pathogenic on detached green berries than isolates of the other species, i.e. C. acutatum, C capsici and C. boninense. Some of the C. gloeosporioides isolates produced slightly sunken lesion on green berries resembling CBD symptoms but it did not destroy the bean. We did not find any evidence of the presence of C. kahawae in Vietnam
The draft genome of the microscopic Nemertoderma westbladi sheds light on the evolution of Acoelomorpha genomes
Background: Xenacoelomorpha is a marine clade of microscopic worms that is an important model system for understanding the evolution of key bilaterian novelties, such as the excretory system. Nevertheless, Xenacoelomorpha genomics has been restricted to a few species that either can be cultured in the lab or are centimetres long. Thus far, no genomes are available for Nemertodermatida, one of the group’s main clades and whose origin has been dated more than 400 million years ago.Methods: DNA was extracted from a single specimen and sequenced with HiFi following the PacBio Ultra-Low DNA Input protocol. After genome assembly, decontamination, and annotation, the genome quality was benchmarked using two acoel genomes and one Illumina genome as reference. The gene content of three cnidarians, three acoelomorphs, four deuterostomes, and eight protostomes was clustered in orthogroups to make inferences of gene content evolution. Finally, we focused on the genes related to the ultrafiltration excretory system to compare patterns of presence/absence and gene architecture among these clades.Results: We present the first nemertodermatid genome sequenced from a single specimen of Nemertoderma westbladi. Although genome contiguity remains challenging (N50: 60 kb), it is very complete (BUSCO: 80.2%, Metazoa; 88.6%, Eukaryota) and the quality of the annotation allows fine-detail analyses of genome evolution. Acoelomorph genomes seem to be relatively conserved in terms of the percentage of repeats, number of genes, number of exons per gene and intron size. In addition, a high fraction of genes present in both protostomes and deuterostomes are absent in Acoelomorpha. Interestingly, we show that all genes related to the excretory system are present in Xenacoelomorpha except Osr, a key element in the development of these organs and whose acquisition seems to be interconnected with the origin of the specialised excretory system.Conclusion: Overall, these analyses highlight the potential of the Ultra-Low Input DNA protocol and HiFi to generate high-quality genomes from single animals, even for relatively large genomes, making it a feasible option for sequencing challenging taxa, which will be an exciting resource for comparative genomics analyses
Understanding fungal functional biodiversity during the mitigation of environmentally dispersed pentachlorophenol in cork oak forest soils
Pentachlorophenol (PCP) is globally dispersed and contamination of soil with this biocide adversely affects its functional biodiversity, particularly of fungi - key colonizers. Their functional role as a community is poorly understood, although a few pathways have been already elucidated in pure cultures. This constitutes here our main challenge - elucidate how fungi influence the pollutant mitigation processes in forest soils. Circumstantial evidence exists that cork oak forests in N. W. Tunisia - economically critical managed forests are likely to be contaminated with PCP, but the scientific evidence has previously been lacking. Our data illustrate significant forest contamination through the detection of undefined active sources of PCP. By solving the taxonomic diversity and the PCP-derived metabolomes of both the cultivable fungi and the fungal community, we demonstrate here that most strains (predominantly penicillia) participate in the pollutant biotic degradation. They form an array of degradation intermediates and by-products, including several hydroquinone, resorcinol and catechol derivatives, either chlorinated or not. The degradation pathway of the fungal community includes uncharacterized derivatives, e.g. tetrachloroguaiacol isomers. Our study highlights fungi key role in the mineralization and short lifetime of PCP in forest soils and provide novel tools to monitor its degradation in other fungi dominated food webs. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd
Ecological genomics in the Northern krill uncovers loci for local adaptation across ocean basins
29 pages, 4 figures, supplementary information https://doi.org/10.1038/s41467-024-50239-7.-- Data availability: The sequence data generated in this study have been deposited in the public European Nucleotide Archive (ENA) database under accession code PRJEB61785. The processed data and results are available at the SciLifeLab Data Repository at https://doi.org/10.17044/scilifelab.c.6626216. The genome assembly is available in ENA and NCBI under accession code GCA_964058975.1 and SNP datasets are available under accession code PRJEB77093 in the European Variation Archive (EVA). Subsets of the data are provided in the Supplementary Information. Source data is provided as a Source Data file. This study made use data from the following public databases: AlphaFold Protein Structure Database https://alphafold.ebi.ac.uk/; Climate Reanalyzer https://climatereanalyzer.org/; Dfam (Dfam_3.5) https://dfam.org/home; EggNOG (v5.0) http://eggnog5.embl.de/; FlyBase database (release FB2021_01) https://flybase.org/; GOrilla https://cbl-gorilla.cs.technion.ac.il/; GyDB2 https://gydb.org; HomeoDB https://homeodb.zoo.ox.ac.uk/; KrillDB2 https://krilldb2.bio.unipd.it/; MITOS2 http://mitos.bioinf.uni-leipzig.de/; NCBI Conserved Domain Database (CDD) https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi; NCBI Genome Database https://www.ncbi.nlm.nih.gov/genome/; NCBI RefSeq database (release 204) https://www.ncbi.nlm.nih.gov/refseq/; OrthoDB (v10.1) https://www.orthodb.org/; Pfam (release 34.0) http://pfam.xfam.org/; Repbase (RepBaseRepeatMaskerEdition 20181026) https://www.girinst.org/server/RepBase/; REXdb http://repeatexplorer.org/?page_id=918; ShinyGO 0.77 http://bioinformatics.sdstate.edu/go77/; SILVA rRNA database project (release 132) https://www.arb-silva.de/; The SWISS-MODEL Repository https://swissmodel.expasy.org/; TOPCONS https://topcons.cbr.su.se/; UniProtKB/Swiss-Prot https://www.uniprot.org/. Biological tissue from the reference specimen tissue is available in the LIB Biobank at Museum Koenig Bonn under accession ZFMK-TIS-82493. Three additional specimens are deposited under accessions ZFMK-TIS-82494 through ZFMK-TIS-82496. Source data are provided with this paper.-- Code availability: Public code is available at https://github.com/NBISweden/genecovr and https://github.com/andreaswallberg/Ecological-Genomics-Northern-Krill. A copy of the Github repositories is available on Zenodo: https://zenodo.org/doi/10.5281/zenodo.10827407Krill are vital as food for many marine animals but also impacted by global warming. To learn how they and other zooplankton may adapt to a warmer world we studied local adaptation in the widespread Northern krill (Meganyctiphanes norvegica). We assemble and characterize its large genome and compare genome-scale variation among 74 specimens from the colder Atlantic Ocean and warmer Mediterranean Sea. The 19 Gb genome likely evolved through proliferation of retrotransposons, now targeted for inactivation by extensive DNA methylation, and contains many duplicated genes associated with molting and vision. Analysis of 760 million SNPs indicates extensive homogenizing gene-flow among populations. Nevertheless, we detect signatures of adaptive divergence across hundreds of genes, implicated in photoreception, circadian regulation, reproduction and thermal tolerance, indicating polygenic adaptation to light and temperature. The top gene candidate for ecological adaptation was nrf-6, a lipid transporter with a Mediterranean variant that may contribute to early spring reproduction. Such variation could become increasingly important for fitness in Atlantic stocks. Our study underscores the widespread but uneven distribution of adaptive variation, necessitating characterization of genetic variation among natural zooplankton populations to understand their adaptive potential, predict risks and support ocean conservation in the face of climate changeOpen access funding provided by Uppsala University.With the institutional support of the ‘Severo Ochoa Centre of Excellence’ accreditation (CEX2019-000928-S)Peer reviewe
Standards recommendations for the Earth BioGenome Project
A global international initiative, such as the Earth BioGenome Project (EBP), requires both agreement and coordination on standards to ensure that the collective effort generates rapid progress toward its goals. To this end, the EBP initiated five technical standards committees comprising volunteer members from the global genomics scientific community: Sample Collection and Processing, Sequencing and Assembly, Annotation, Analysis, and IT and Informatics. The current versions of the resulting standards documents are available on the EBP website, with the recognition that opportunities, technologies, and challenges may improve or change in the future, requiring flexibility for the EBP to meet its goals. Here, we describe some highlights from the proposed standards, and areas where additional challenges will need to be met
Run-Off Replication of Host-Adaptability Genes Is Associated with Gene Transfer Agents in the Genome of Mouse-Infecting Bartonella grahamii
The genus Bartonella comprises facultative intracellular bacteria adapted to mammals, including previously recognized and emerging human pathogens. We report the 2,341,328 bp genome sequence of Bartonella grahamii, one of the most prevalent Bartonella species in wild rodents. Comparative genomics revealed that rodent-associated Bartonella species have higher copy numbers of genes for putative host-adaptability factors than the related human-specific pathogens. Many of these gene clusters are located in a highly dynamic region of 461 kb. Using hybridization to a microarray designed for the B. grahamii genome, we observed a massive, putatively phage-derived run-off replication of this region. We also identified a novel gene transfer agent, which packages the bacterial genome, with an over-representation of the amplified DNA, in 14 kb pieces. This is the first observation associating the products of run-off replication with a gene transfer agent. Because of the high concentration of gene clusters for host-adaptation proteins in the amplified region, and since the genes encoding the gene transfer agent and the phage origin are well conserved in Bartonella, we hypothesize that these systems are driven by selection. We propose that the coupling of run-off replication with gene transfer agents promotes diversification and rapid spread of host-adaptability factors, facilitating host shifts in Bartonella
The European Reference Genome Atlas: piloting a decentralised approach to equitable biodiversity genomics
A genomic database of all Earth’s eukaryotic species could contribute to many scientific discoveries; however, only a tiny fraction of species have genomic information available. In 2018, scientists across the world united under the Earth BioGenome Project (EBP), aiming to produce a database of high-quality reference genomes containing all ~1.5 million recognized eukaryotic species. As the European node of the EBP, the European Reference Genome Atlas (ERGA) sought to implement a new decentralised, equitable and inclusive model for producing reference genomes. For this, ERGA launched a Pilot Project establishing the first distributed reference genome production infrastructure and testing it on 98 eukaryotic species from 33 European countries. Here we outline the infrastructure and explore its effectiveness for scaling high-quality reference genome production, whilst considering equity and inclusion. The outcomes and lessons learned provide a solid foundation for ERGA while offering key learnings to other transnational, national genomic resource projects and the EBP.info:eu-repo/semantics/publishedVersio
The European Reference Genome Atlas: piloting a decentralised approach to equitable biodiversity genomics.
ABSTRACT: A global genome database of all of Earth’s species diversity could be a treasure trove of scientific discoveries. However, regardless of the major advances in genome sequencing technologies, only a tiny fraction of species have genomic information available. To contribute to a more complete planetary genomic database, scientists and institutions across the world have united under the Earth BioGenome Project (EBP), which plans to sequence and assemble high-quality reference genomes for all ∼1.5 million recognized eukaryotic species through a stepwise phased approach. As the initiative transitions into Phase II, where 150,000 species are to be sequenced in just four years, worldwide participation in the project will be fundamental to success. As the European node of the EBP, the European Reference Genome Atlas (ERGA) seeks to implement a new decentralised, accessible, equitable and inclusive model for producing high-quality reference genomes, which will inform EBP as it scales. To embark on this mission, ERGA launched a Pilot Project to establish a network across Europe to develop and test the first infrastructure of its kind for the coordinated and distributed reference genome production on 98 European eukaryotic species from sample providers across 33 European countries. Here we outline the process and challenges faced during the development of a pilot infrastructure for the production of reference genome resources, and explore the effectiveness of this approach in terms of high-quality reference genome production, considering also equity and inclusion. The outcomes and lessons learned during this pilot provide a solid foundation for ERGA while offering key learnings to other transnational and national genomic resource projects.info:eu-repo/semantics/publishedVersio
- …