28 research outputs found

    The Atlantic salmon genome provides insights into rediploidization

    Get PDF
    The whole-genome duplication 80 million years ago of the common ancestor of salmonids (salmonid-specific fourth vertebrate whole-genome duplication, Ss4R) provides unique opportunities to learn about the evolutionary fate of a duplicated vertebrate genome in 70 extant lineages. Here we present a high-quality genome assembly for Atlantic salmon (Salmo salar), and show that large genomic reorganizations, coinciding with bursts of transposon-mediated repeat expansions, were crucial for the post-Ss4R rediploidization process. Comparisons of duplicate gene expression patterns across a wide range of tissues with orthologous genes from a pre-Ss4R outgroup unexpectedly demonstrate far more instances of neofunctionalization than subfunctionalization. Surprisingly, we find that genes that were retained as duplicates after the teleost-specific whole-genome duplication 320 million years ago were not more likely to be retained after the Ss4R, and that the duplicate retention was not influenced to a great extent by the nature of the predicted protein interactions of the gene products. Finally, we demonstrate that the Atlantic salmon assembly can serve as a reference sequence for the study of other salmonids for a range of purposes.publishedVersio

    Genome Evolution of a Tertiary Dinoflagellate Plastid

    Get PDF
    The dinoflagellates have repeatedly replaced their ancestral peridinin-plastid by plastids derived from a variety of algal lineages ranging from green algae to diatoms. Here, we have characterized the genome of a dinoflagellate plastid of tertiary origin in order to understand the evolutionary processes that have shaped the organelle since it was acquired as a symbiont cell. To address this, the genome of the haptophyte-derived plastid in Karlodinium veneficum was analyzed by Sanger sequencing of library clones and 454 pyrosequencing of plastid enriched DNA fractions. The sequences were assembled into a single contig of 143 kb, encoding 70 proteins, 3 rRNAs and a nearly full set of tRNAs. Comparative genomics revealed massive rearrangements and gene losses compared to the haptophyte plastid; only a small fraction of the gene clusters usually found in haptophytes as well as other types of plastids are present in K. veneficum. Despite the reduced number of genes, the K. veneficum plastid genome has retained a large size due to expanded intergenic regions. Some of the plastid genes are highly diverged and may be pseudogenes or subject to RNA editing. Gene losses and rearrangements are also features of the genomes of the peridinin-containing plastids, apicomplexa and Chromera, suggesting that the evolutionary processes that once shaped these plastids have occurred at multiple independent occasions over the history of the Alveolata

    Dissemination of Cephalosporin Resistance Genes between Escherichia coli Strains from Farm Animals and Humans by Specific Plasmid Lineages

    Get PDF
    Third-generation cephalosporins are a class of β-lactam antibiotics that are often used for the treatment of human infections caused by Gram-negative bacteria, especially Escherichia coli. Worryingly, the incidence of human infections caused by third-generation cephalosporin-resistant E. coli is increasing worldwide. Recent studies have suggested that these E. coli strains, and their antibiotic resistance genes, can spread from food-producing animals, via the food-chain, to humans. However, these studies used traditional typing methods, which may not have provided sufficient resolution to reliably assess the relatedness of these strains. We therefore used whole-genome sequencing (WGS) to study the relatedness of cephalosporin-resistant E. coli from humans, chicken meat, poultry and pigs. One strain collection included pairs of human and poultry-associated strains that had previously been considered to be identical based on Multi-Locus Sequence Typing, plasmid typing and antibiotic resistance gene sequencing. The second collection included isolates from farmers and their pigs. WGS analysis revealed considerable heterogeneity between human and poultry-associated isolates. The most closely related pairs of strains from both sources carried 1263 Single-Nucleotide Polymorphisms (SNPs) per Mbp core genome. In contrast, epidemiologically linked strains from humans and pigs differed by only 1.8 SNPs per Mbp core genome. WGS-based plasmid reconstructions revealed three distinct plasmid lineages (IncI1- and IncK-type) that carried cephalosporin resistance genes of the Extended-Spectrum Beta-Lactamase (ESBL)- and AmpC-types. The plasmid backbones within each lineage were virtually identical and were shared by genetically unrelated human and animal isolates. Plasmid reconstructions from short-read sequencing data were validated by long-read DNA sequencing for two strains. Our findings failed to demonstrate evidence for recent clonal transmission of cephalosporin-resistant E. coli strains from poultry to humans, as has been suggested based on traditional, low-resolution typing methods. Instead, our data suggest that cephalosporin resistance genes are mainly disseminated in animals and humans via distinct plasmids

    Improved metagenome assemblies and taxonomic binning using long-read circular consensus sequence data

    Get PDF
    DNA assembly is a core methodological step in metagenomic pipelines used to study the structure and function within microbial communities. Here we investigate the utility of Pacific Biosciences long and high accuracy circular consensus sequencing (CCS) reads for metagenomic projects. We compared the application and performance of both PacBio CCS and Illumina HiSeq data with assembly and taxonomic binning algorithms using metagenomic samples representing a complex microbial community. Eight SMRT cells produced approximately 94 Mb of CCS reads from a biogas reactor microbiome sample that averaged 1319 nt in length and 99.7% accuracy. CCS data assembly generated a comparative number of large contigs greater than 1 kb, to those assembled from a ~190x larger HiSeq dataset (~18 Gb) produced from the same sample (i.e approximately 62% of total contigs). Hybrid assemblies using PacBio CCS and HiSeq contigs produced improvements in assembly statistics, including an increase in the average contig length and number of large contigs. The incorporation of CCS data produced significant enhancements in taxonomic binning and genome reconstruction of two dominant phylotypes, which assembled and binned poorly using HiSeq data alone. Collectively these results illustrate the value of PacBio CCS reads in certain metagenomics applications

    Complete genome and methylome analysis of Neisseria meningitidis associated with increased serogroup Y disease

    No full text
    Invasive meningococcal disease (IMD) due to serogroup Y&nbsp;Neisseria meningitidis&nbsp;emerged in Europe during the 2000s. Draft genomes of serogroup Y isolates in Sweden revealed that although the population structure of these isolates was similar to other serogroup Y isolates internationally, a distinct strain (YI) and more specifically a sublineage (1) of this strain was responsible for the increase of serogroup Y IMD in Sweden. We performed single molecule real-time (SMRT) sequencing on eight serogroup Y isolates from different sublineages to unravel the genetic and epigenetic factors delineating them, in order to understand the serogroup Y emergence. Extensive comparisons between the serogroup Y sublineages of all coding sequences, complex genomic regions, intergenic regions, and methylation motifs revealed small point mutations in genes mainly encoding hypothetical and metabolic proteins, and non-synonymous variants in genes involved in adhesion, iron acquisition, and endotoxin production. The methylation motif CACNNNNNTAC was only found in isolates of sublineage 2. Only seven genes were putatively differentially expressed, and another two genes encoding hypothetical proteins were only present in sublineage 2. These data suggest that the serogroup Y IMD increase in Sweden was most probably due to small changes in genes important for colonization and transmission.</p

    Adaptation to the High-Arctic island environment despite long-term reduced genetic variation in Svalbard reindeer

    No full text
    Summary: Typically much smaller in number than their mainland counterparts, island populations are ideal systems to investigate genetic threats to small populations. The Svalbard reindeer (Rangifer tarandus platyrhynchus) is an endemic subspecies that colonized the Svalbard archipelago ca. 6,000–8,000 years ago and now shows numerous physiological and morphological adaptations to its arctic habitat. Here, we report a de-novo chromosome-level assembly for Svalbard reindeer and analyze 133 reindeer genomes spanning Svalbard and most of the species’ Holarctic range, to examine the genomic consequences of long-term isolation and small population size in this insular subspecies. Empirical data, demographic reconstructions, and forward simulations show that long-term isolation and high inbreeding levels may have facilitated the reduction of highly deleterious—and to a lesser extent, moderately deleterious—variation. Our study indicates that long-term reduced genetic diversity did not preclude local adaptation to the High Arctic, suggesting that even severely bottlenecked populations can retain evolutionary potential
    corecore