243 research outputs found
The evolution of developmental programs : a case study in the gastropod mollusc Patella vulgata
At the interface between evolutionary biology and developmental biology is the so-called field of evolutionary developmental biology (evo-devo in short). This field asks how different adult animals (species) came into being by heritable changes during their embryonic development. One way to study this question is a comparative analysis of genes that are important for the development of different species. Many genes appear to be conserved during evolution, i.e. the same gene can have a comparable role during the development of very different species. This developmental genetic unity underlying very different animals has puzzled evolutionists as well as developmental biologists. It has led to two main questions:
-what aspects of gene function can be considered as conserved, thus evolutionary very ancient, perhaps even ancestral to metazoans (or bilaterians)?
-how can such different animals be generated from such similar sets of genes?
This study is involved with the phylum Mollusca (snails, slugs, octopuses and the like). Very little is known on the genetics of early development of molluscs. Data on the role of genes during the development of molluscan animals, therefore, is likely to provide new insights and will contribute to answering the abovementioned questions.
The study organism of this thesis is the gastropod mollusk Patella vulgata. Orthologs (gene homologs) of seven genes are described: snail, twist, orthodenticle, orthopedia, engrailed, dpp en hedgehog. The question asked is in this thesis is: what insights do we obtain when we broaden our comparative analysis of the role developmental genes play to molluscs?
The research on these seven genes has given a number of new insights. One is a better understanding on what role of these genes has been conserved during evolution. It is very likely that the common evolutionary ancestor to all animals already had these genes with these particular functions.
Despite the fact that many animals have genes in common, often with the same function, not all animals are alike. The research described in this thesis contributed to our understanding of this apparent paradox. A gene can have a certain function, which allows it to contribute to the development of very different structures or organs. For example, the engrailed gene is involved in generating boundaries between groups of cells. During development, cells expressing the engrailed gene form a compartment excluding other, neighbouring cells that form another compartment (and do not express engrailed). Both compartments become different parts of the adult organism. In the fruitfly, for example, engrailed plays a role in boundary formation between segments, the building blocks of the body of the fly. But also in molluscs, such as Patella, engrailed is involved in boundary formation, namely between cells that form the shell and the surrounding cells that do not contribute to shell formation.
The fact that such very different structures such as segments in flies and shells in snails use the same gene for their formation can thus be explained by the fact that engrailed is a boundary formation gene. This is the way evolution creates different animals from the same set of genes: by recombining similar building blocks in new ways, new structures can be formed, a phenomenon called tinkering (or bricolage)
Assessing diversity of the female urine microbiota by high throughput sequencing of 16S rDNA amplicons
<p>Abstract</p> <p>Background</p> <p>Urine within the urinary tract is commonly regarded as "sterile" in cultivation terms. Here, we present a comprehensive in-depth study of bacterial 16S rDNA sequences associated with urine from healthy females by means of culture-independent high-throughput sequencing techniques.</p> <p>Results</p> <p>Sequencing of the V1V2 and V6 regions of the 16S ribosomal RNA gene using the 454 GS FLX system was performed to characterize the possible bacterial composition in 8 culture-negative (<100,000 CFU/ml) healthy female urine specimens. Sequences were compared to 16S rRNA databases and showed significant diversity, with the predominant genera detected being <it>Lactobacillus</it>, <it>Prevotella </it>and <it>Gardnerella</it>. The bacterial profiles in the female urine samples studied were complex; considerable variation between individuals was observed and a common microbial signature was not evident. Notably, a significant amount of sequences belonging to bacteria with a known pathogenic potential was observed. The number of operational taxonomic units (OTUs) for individual samples varied substantially and was in the range of 20 - 500.</p> <p>Conclusions</p> <p>Normal female urine displays a noticeable and variable bacterial 16S rDNA sequence richness, which includes fastidious and anaerobic bacteria previously shown to be associated with female urogenital pathology.</p
A genome-wide analysis of nonribosomal peptide synthetase gene clusters and their peptides in a Planktothrix rubescens strain
<p>Abstract</p> <p>Background</p> <p>Cyanobacteria often produce several different oligopeptides, with unknown biological functions, by nonribosomal peptide synthetases (NRPS). Although some cyanobacterial NRPS gene cluster types are well described, the entire NRPS genomic content within a single cyanobacterial strain has never been investigated. Here we have combined a genome-wide analysis using massive parallel pyrosequencing ("454") and mass spectrometry screening of oligopeptides produced in the strain <it>Planktothrix rubescens </it>NIVA CYA 98 in order to identify all putative gene clusters for oligopeptides.</p> <p>Results</p> <p>Thirteen types of oligopeptides were uncovered by mass spectrometry (MS) analyses. Microcystin, cyanopeptolin and aeruginosin synthetases, highly similar to already characterized NRPS, were present in the genome. Two novel NRPS gene clusters were associated with production of anabaenopeptins and microginins, respectively. Sequence-depth of the genome and real-time PCR data revealed three copies of the microginin gene cluster. Since NRPS gene cluster candidates for microviridin and oscillatorin synthesis could not be found, putative (gene encoded) precursor peptide sequences to microviridin and oscillatorin were found in the genes <it>mdn</it>A and <it>osc</it>A, respectively. The genes flanking the microviridin and oscillatorin precursor genes encode putative modifying enzymes of the precursor oligopeptides. We therefore propose ribosomal pathways involving modifications and cyclisation for microviridin and oscillatorin. The microviridin, anabaenopeptin and cyanopeptolin gene clusters are situated in close proximity to each other, constituting an oligopeptide island.</p> <p>Conclusion</p> <p>Altogether seven nonribosomal peptide synthetase (NRPS) gene clusters and two gene clusters putatively encoding ribosomal oligopeptide biosynthetic pathways were revealed. Our results demonstrate that whole genome shotgun sequencing combined with MS-directed determination of oligopeptides successfully can identify NRPS gene clusters and the corresponding oligopeptides. The analyses suggest independent evolution of all NRPS gene clusters as functional units. Our data indicate that the <it>Planktothrix </it>genome displays evolution of dual pathways (NRPS and ribosomal) for production of oligopeptides in order to maximize the diversity of oligopeptides with similar but functional discrete bioactivities.</p
A High-Quality Assembly of the Nine-Spined Stickleback (Pungitius pungitius) Genome
The Gasterosteidae fish family hosts several species that are important models for eco-evolutionary, genetic, and genomic research. In particular, a wealth of genetic and genomic data has been generated for the three-spined stickleback (Gasterosteus aculeatus), the "ecology's supermodel," whereas the genomic resources for the nine-spined stickleback (Pungitius pungitius) have remained relatively scarce. Here, we report a high-quality chromosome-level genome assembly of P. pungitius consisting of 5,303 contigs (N50 = 1.2Mbp) with a total size of 521 Mbp. These contigs were mapped to 21 linkage groups using a high-density linkage map, yielding a final assembly with 98.5% BUSCO completeness. A total of 25,062 protein-coding genes were annotated, and about 23% of the assembly was found to consist of repetitive elements. A comprehensive analysis of repetitive elements uncovered centromere-specific tandem repeats and provided insights into the evolution of retrotransposons. A multigene phylogenetic analysis inferred a divergence time of about 26 million years ago (Ma) between nine- and three-spined sticklebacks, which is far older than the commonly assumed estimate of 13 Ma. Compared with the three-spined stickleback, we identified an additional duplication of several genes in the hemoglobin cluster. Sequencing data from populations adapted to different environments indicated potential copy number variations in hemoglobin genes. Furthermore, genome-wide synteny comparisons between three- and nine-spined sticklebacks identified chromosomal rearrangements underlying the karyotypic differences between the two species. The high-quality chromosome-scale assembly of the nine-spined stickleback genome obtained with long-read sequencing technology provides a crucial resource for comparative and population genomic investigations of stickleback fishes and teleosts.Peer reviewe
A novel totivirus and piscine reovirus (PRV) in Atlantic salmon (Salmo salar) with cardiomyopathy syndrome (CMS)
<p>Abstract</p> <p>Background</p> <p>Cardiomyopathy syndrome (CMS) is a severe disease affecting large farmed Atlantic salmon. Mortality often appears without prior clinical signs, typically shortly prior to slaughter. We recently reported the finding and the complete genomic sequence of a novel piscine reovirus (PRV), which is associated with another cardiac disease in Atlantic salmon; heart and skeletal muscle inflammation (HSMI). In the present work we have studied whether PRV or other infectious agents may be involved in the etiology of CMS.</p> <p>Results</p> <p>Using high throughput sequencing on heart samples from natural outbreaks of CMS and from fish experimentally challenged with material from fish diagnosed with CMS a high number of sequence reads identical to the PRV genome were identified. In addition, a sequence contig from a novel totivirus could also be constructed. Using RT-qPCR, levels of PRV in tissue samples were quantified and the totivirus was detected in all samples tested from CMS fish but not in controls. <it>In situ </it>hybridization supported this pattern indicating a possible association between CMS and the novel piscine totivirus.</p> <p>Conclusions</p> <p>Although causality for CMS in Atlantic salmon could not be proven for either of the two viruses, our results are compatible with a hypothesis where, in the experimental challenge studied, PRV behaves as an opportunist whereas the totivirus might be more directly linked with the development of CMS.</p
GSuite HyperBrowser: integrative analysis of dataset collections across the genome and epigenome
Background: Recent large-scale undertakings such as ENCODE and Roadmap Epigenomics have generated experimental data mapped to the human reference genome (as genomic tracks) representing a variety of functional elements across a large number of cell types. Despite the high potential value of these publicly available data for a broad variety of investigations, little attention has been given to the analytical methodology necessary for their widespread utilisation. Findings: We here present a first principled treatment of the analysis of collections of genomic tracks. We have developed novel computational and statistical methodology to permit comparative and confirmatory analyses across multiple and disparate data sources. We delineate a set of generic questions that are useful across a broad range of investigations and discuss the implications of choosing different statistical measures and null models. Examples include contrasting analyses across different tissues or diseases. The methodology has been implemented in a comprehensive open-source software system, the GSuite HyperBrowser. To make the functionality accessible to biologists, and to facilitate reproducible analysis, we have also developed a web-based interface providing an expertly guided and customizable way of utilizing the methodology. With this system, many novel biological questions can flexibly be posed and rapidly answered. Conclusions: Through a combination of streamlined data acquisition, interoperable representation of dataset collections, and customizable statistical analysis with guided setup and interpretation, the GSuite HyperBrowser represents a first comprehensive solution for integrative analysis of track collections across the genome and epigenome. The software is available at: https://hyperbrowser.uio.no.This work was supported by the Research Council of Norway (under grant agreements 221580, 218241, and 231217/F20), by the Norwegian Cancer Society (under grant agreements 71220’PR-2006-0433 and 3485238-2013), and by the South-Eastern Norway Regional Health Authority (under grant agreement 2014041).Peer Reviewe
Genome Evolution of a Tertiary Dinoflagellate Plastid
The dinoflagellates have repeatedly replaced their ancestral peridinin-plastid by plastids derived from a variety of algal lineages ranging from green algae to diatoms. Here, we have characterized the genome of a dinoflagellate plastid of tertiary origin in order to understand the evolutionary processes that have shaped the organelle since it was acquired as a symbiont cell. To address this, the genome of the haptophyte-derived plastid in Karlodinium veneficum was analyzed by Sanger sequencing of library clones and 454 pyrosequencing of plastid enriched DNA fractions. The sequences were assembled into a single contig of 143 kb, encoding 70 proteins, 3 rRNAs and a nearly full set of tRNAs. Comparative genomics revealed massive rearrangements and gene losses compared to the haptophyte plastid; only a small fraction of the gene clusters usually found in haptophytes as well as other types of plastids are present in K. veneficum. Despite the reduced number of genes, the K. veneficum plastid genome has retained a large size due to expanded intergenic regions. Some of the plastid genes are highly diverged and may be pseudogenes or subject to RNA editing. Gene losses and rearrangements are also features of the genomes of the peridinin-containing plastids, apicomplexa and Chromera, suggesting that the evolutionary processes that once shaped these plastids have occurred at multiple independent occasions over the history of the Alveolata
The Atlantic salmon genome provides insights into rediploidization
The whole-genome duplication 80 million years ago of the common ancestor of salmonids (salmonid-specific fourth vertebrate whole-genome duplication, Ss4R) provides unique opportunities to learn about the evolutionary fate of a duplicated vertebrate genome in 70 extant lineages. Here we present a high-quality genome assembly for Atlantic salmon (Salmo salar), and show that large genomic reorganizations, coinciding with bursts of transposon-mediated repeat expansions, were crucial for the post-Ss4R rediploidization process. Comparisons of duplicate gene expression patterns across a wide range of tissues with orthologous genes from a pre-Ss4R outgroup unexpectedly demonstrate far more instances of neofunctionalization than subfunctionalization. Surprisingly, we find that genes that were retained as duplicates after the teleost-specific whole-genome duplication 320 million years ago were not more likely to be retained after the Ss4R, and that the duplicate retention was not influenced to a great extent by the nature of the predicted protein interactions of the gene products. Finally, we demonstrate that the Atlantic salmon assembly can serve as a reference sequence for the study of other salmonids for a range of purposes.publishedVersio
Genome Fragmentation Is Not Confined to the Peridinin Plastid in Dinoflagellates
When plastids are transferred between eukaryote lineages through series of endosymbiosis, their environment changes dramatically. Comparison of dinoflagellate plastids that originated from different algal groups has revealed convergent evolution, suggesting that the host environment mainly influences the evolution of the newly acquired organelle. Recently the genome from the anomalously pigmented dinoflagellate Karlodinium veneficum plastid was uncovered as a conventional chromosome. To determine if this haptophyte-derived plastid contains additional chromosomal fragments that resemble the mini-circles of the peridin-containing plastids, we have investigated its genome by in-depth sequencing using 454 pyrosequencing technology, PCR and clone library analysis. Sequence analyses show several genes with significantly higher copy numbers than present in the chromosome. These genes are most likely extrachromosomal fragments, and the ones with highest copy numbers include genes encoding the chaperone DnaK(Hsp70), the rubisco large subunit (rbcL), and two tRNAs (trnE and trnM). In addition, some photosystem genes such as psaB, psaA, psbB and psbD are overrepresented. Most of the dnaK and rbcL sequences are found as shortened or fragmented gene sequences, typically missing the 3′-terminal portion. Both dnaK and rbcL are associated with a common sequence element consisting of about 120 bp of highly conserved AT-rich sequence followed by a trnE gene, possibly serving as a control region. Decatenation assays and Southern blot analysis indicate that the extrachromosomal plastid sequences do not have the same organization or lengths as the minicircles of the peridinin dinoflagellates. The fragmentation of the haptophyte-derived plastid genome K. veneficum suggests that it is likely a sign of a host-driven process shaping the plastid genomes of dinoflagellates
The khmer software package: enabling efficient nucleotide sequence analysis
The khmer package is a freely available software library for working efficiently with fixed length DNA words, or k-mers. khmer provides implementations of a probabilistic k-mer counting data structure, a compressible De Bruijn graph representation, De Bruijn graph partitioning, and digital normalization. khmer is implemented in C++ and Python, and is freely available under the BSD license at https://github.com/dib-lab/khmer/
- …