177 research outputs found
fastlin: an ultra-fast program for Mycobacterium tuberculosis complex lineage typing
SUMMARY: Fastlin is a bioinformatics tool designed for rapid Mycobacterium tuberculosis complex (MTBC) lineage typing. It utilizes an ultra-fast alignment-free approach to detect previously identified barcode single nucleotide polymorphisms associated with specific MTBC lineages. In a comprehensive benchmarking against existing tools, fastlin demonstrated high accuracy and significantly faster running times. AVAILABILITY AND IMPLEMENTATION: fastlin is freely available at https://github.com/rderelle/fastlin and can easily be installed using Conda
Complementary approaches to understanding the plant circadian clock
Circadian clocks are oscillatory genetic networks that help organisms adapt
to the 24-hour day/night cycle. The clock of the green alga Ostreococcus tauri
is the simplest plant clock discovered so far. Its many advantages as an
experimental system facilitate the testing of computational predictions.
We present a model of the Ostreococcus clock in the stochastic process
algebra Bio-PEPA and exploit its mapping to different analysis techniques, such
as ordinary differential equations, stochastic simulation algorithms and
model-checking. The small number of molecules reported for this system tests
the limits of the continuous approximation underlying differential equations.
We investigate the difference between continuous-deterministic and
discrete-stochastic approaches. Stochastic simulation and model-checking allow
us to formulate new hypotheses on the system behaviour, such as the presence of
self-sustained oscillations in single cells under constant light conditions.
We investigate how to model the timing of dawn and dusk in the context of
model-checking, which we use to compute how the probability distributions of
key biochemical species change over time. These show that the relative
variation in expression level is smallest at the time of peak expression,
making peak time an optimal experimental phase marker. Building on these
analyses, we use approaches from evolutionary systems biology to investigate
how changes in the rate of mRNA degradation impacts the phase of a key protein
likely to affect fitness. We explore how robust this circadian clock is towards
such potential mutational changes in its underlying biochemistry. Our work
shows that multiple approaches lead to a more complete understanding of the
clock
Robustness of circadian clocks to daylight fluctuations: hints from the picoeucaryote Ostreococcus tauri
The development of systemic approaches in biology has put emphasis on
identifying genetic modules whose behavior can be modeled accurately so as to
gain insight into their structure and function. However most gene circuits in a
cell are under control of external signals and thus quantitative agreement
between experimental data and a mathematical model is difficult. Circadian
biology has been one notable exception: quantitative models of the internal
clock that orchestrates biological processes over the 24-hour diurnal cycle
have been constructed for a few organisms, from cyanobacteria to plants and
mammals. In most cases, a complex architecture with interlocked feedback loops
has been evidenced. Here we present first modeling results for the circadian
clock of the green unicellular alga Ostreococcus tauri. Two plant-like clock
genes have been shown to play a central role in Ostreococcus clock. We find
that their expression time profiles can be accurately reproduced by a minimal
model of a two-gene transcriptional feedback loop. Remarkably, best adjustment
of data recorded under light/dark alternation is obtained when assuming that
the oscillator is not coupled to the diurnal cycle. This suggests that coupling
to light is confined to specific time intervals and has no dynamical effect
when the oscillator is entrained by the diurnal cycle. This intringuing
property may reflect a strategy to minimize the impact of fluctuations in
daylight intensity on the core circadian oscillator, a type of perturbation
that has been rarely considered when assessing the robustness of circadian
clocks
3-D Ultrastructure of O. tauri: Electron Cryotomography of an Entire Eukaryotic Cell
The hallmark of eukaryotic cells is their segregation of key biological functions into discrete, membrane-bound organelles. Creating accurate models of their ultrastructural complexity has been difficult in part because of the limited resolution of light microscopy and the artifact-prone nature of conventional electron microscopy. Here we explored the potential of the emerging technology electron cryotomography to produce three-dimensional images of an entire eukaryotic cell in a near-native state. Ostreococcus tauri was chosen as the specimen because as a unicellular picoplankton with just one copy of each organelle, it is the smallest known eukaryote and was therefore likely to yield the highest resolution images. Whole cells were imaged at various stages of the cell cycle, yielding 3-D reconstructions of complete chloroplasts, mitochondria, endoplasmic reticula, Golgi bodies, peroxisomes, microtubules, and putative ribosome distributions in-situ. Surprisingly, the nucleus was seen to open long before mitosis, and while one microtubule (or two in some predivisional cells) was consistently present, no mitotic spindle was ever observed, prompting speculation that a single microtubule might be sufficient to segregate multiple chromosomes
Morphology, Genome Plasticity, and Phylogeny in the Genus Ostreococcus Reveal a Cryptic Species, O. mediterraneus sp. nov. (Mamiellales, Mamiellophyceae)
Coastal marine waters in many regions worldwide support abundant populations of extremely small (1-3 μm diameter) unicellular eukaryotic green algae, dominant taxa including several species in the class Mamiellophyceae. Their diminutive size conceals surprising levels of genetic diversity and defies classical species’ descriptions. We present a detailed analysis within the genus Ostreococcus and show that morphological characteristics cannot be used to describe diversity within this group. Karyotypic analyses of the best-characterized species O. tauri show it to carry two chromosomes that vary in size between individual clonal lines, probably an evolutionarily ancient feature that emerged before species’ divergences within the Mamiellales. By using a culturing technique specifically adapted to members of the genus Ostreococcus, we purified >30 clonal lines of a new species, Ostreococcus mediterraneus sp. nov., previously known as Ostreococcus clade D, that has been overlooked in several studies based on PCR-amplification of genetic markers from environment-extracted DNA. Phylogenetic analyses of the S-adenosylmethionine synthetase gene, and of the complete small subunit ribosomal RNA gene, including detailed comparisons of predicted ITS2 (internal transcribed spacer 2) secondary structures, clearly support that this is a separate species. In addition, karyotypic analyses reveal that the chromosomal location of its ribosomal RNA gene cluster differs from other Ostreococcus clades
Seamless, rapid, and accurate analyses of outbreak genomic data using split k-mer analysis.
Sequence variation observed in populations of pathogens can be used for important public health and evolutionary genomic analyses, especially outbreak analysis and transmission reconstruction. Identifying this variation is typically achieved by aligning sequence reads to a reference genome, but this approach is susceptible to reference biases and requires careful filtering of called genotypes. There is a need for tools that can process this growing volume of bacterial genome data, providing rapid results, but that remain simple so they can be used without highly trained bioinformaticians, expensive data analysis, and long-term storage and processing of large files. Here we describe split k-mer analysis (SKA2), a method that supports both reference-free and reference-based mapping to quickly and accurately genotype populations of bacteria using sequencing reads or genome assemblies. SKA2 is highly accurate for closely related samples, and in outbreak simulations, we show superior variant recall compared with reference-based methods, with no false positives. SKA2 can also accurately map variants to a reference and be used with recombination detection methods to rapidly reconstruct vertical evolutionary history. SKA2 is many times faster than comparable methods and can be used to add new genomes to an existing call set, allowing sequential use without the need to reanalyze entire collections. With an inherent absence of reference bias, high accuracy, and a robust implementation, SKA2 has the potential to become the tool of choice for genotyping bacteria. SKA2 is implemented in Rust and is freely available as open-source software
Critical mutation rate has an exponential dependence on population size for eukaryotic-length genomes with crossover
The critical mutation rate (CMR) determines the shift between survival-of-the-fittest and survival of individuals with greater mutational robustness (“flattest”). We identify an inverse relationship between CMR and sequence length in an in silico system with a two-peak fitness landscape; CMR decreases to no more than five orders of magnitude above estimates of eukaryotic per base mutation rate. We confirm the CMR reduces exponentially at low population sizes, irrespective of peak radius and distance, and increases with the number of genetic crossovers. We also identify an inverse relationship between CMR and the number of genes, confirming that, for a similar number of genes to that for the plant Arabidopsis thaliana (25,000), the CMR is close to its known wild-type mutation rate; mutation rates for additional organisms were also found to be within one order of magnitude of the CMR. This is the first time such a simulation model has been assigned input and produced output within range for a given biological organism. The decrease in CMR with population size previously observed is maintained; there is potential for the model to influence understanding of populations undergoing bottleneck, stress, and conservation strategy for populations near extinction
Phylogenetic Relationships within the Opisthokonta Based on Phylogenomic Analyses of Conserved Single-Copy Protein Domains
Many of the eukaryotic phylogenomic analyses published to date were based on alignments of hundreds to thousands of genes. Frequently, in such analyses, the most realistic evolutionary models currently available are often used to minimize the impact of systematic error. However, controversy remains over whether or not idiosyncratic gene family dynamics (i.e., gene duplications and losses) and incorrect orthology assignments are always appropriately taken into account. In this paper, we present an innovative strategy for overcoming orthology assignment problems. Rather than identifying and eliminating genes with paralogy problems, we have constructed a data set comprised exclusively of conserved single-copy protein domains that, unlike most of the commonly used phylogenomic data sets, should be less confounded by orthology miss-assignments. To evaluate the power of this approach, we performed maximum likelihood and Bayesian analyses to infer the evolutionary relationships within the opisthokonts (which includes Metazoa, Fungi, and related unicellular lineages). We used this approach to test 1) whether Filasterea and Ichthyosporea form a clade, 2) the interrelationships of early-branching metazoans, and 3) the relationships among early-branching fungi. We also assessed the impact of some methods that are known to minimize systematic error, including reducing the distance between the outgroup and ingroup taxa or using the CAT evolutionary model. Overall, our analyses support the Filozoa hypothesis in which Ichthyosporea are the first holozoan lineage to emerge followed by Filasterea, Choanoflagellata, and Metazoa. Blastocladiomycota appears as a lineage separate from Chytridiomycota, although this result is not strongly supported. These results represent independent tests of previous phylogenetic hypotheses, highlighting the importance of sophisticated approaches for orthology assignment in phylogenomic analyses. © The Author 2011. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved
A Comprehensive Classification and Evolutionary Analysis of Plant Homeobox Genes
The full complement of homeobox transcription factor sequences, including genes and pseudogenes, was determined from the analysis of 10 complete genomes from flowering plants, moss, Selaginella, unicellular green algae, and red algae. Our exhaustive genome-wide searches resulted in the discovery in each class of a greater number of homeobox genes than previously reported. All homeobox genes can be unambiguously classified by sequence evolutionary analysis into 14 distinct classes also characterized by conserved intron–exon structure and by unique codomain architectures. We identified many new genes belonging to previously defined classes (HD-ZIP I to IV, BEL, KNOX, PLINC, WOX). Other newly identified genes allowed us to characterize PHD, DDT, NDX, and LD genes as members of four new evolutionary classes and to define two additional classes, which we named SAWADEE and PINTOX. Our comprehensive analysis allowed us to identify several newly characterized conserved motifs, including novel zinc finger motifs in SAWADEE and DDT. Members of the BEL and KNOX classes were found in Chlorobionta (green plants) and in Rhodophyta. We found representatives of the DDT, WOX, and PINTOX classes only in green plants, including unicellular green algae, moss, and vascular plants. All 14 homeobox gene classes were represented in flowering plants, Selaginella, and moss, suggesting that they had already differentiated in the last common ancestor of moss and vascular plants
- …