177 research outputs found

    Geometry Selects Highly Designable Structures

    Full text link
    By enumerating all sequences of length 20, we study the designability of structures in a two-dimensional Hydrophobic-Polar (HP) lattice model in a wide range of inter-monomer interaction parameters. We find that although the histogram of designability depends on interaction parameters, the set of highly designable structures is invariant. So in the HP lattice model the High Designability should be a purely geometrical feature. Our results suggest two geometrical properties for highly designable structures, they have maximum number of contacts and unique neighborhood vector representation. Also we show that contribution of perfectly stable sequences in designability of structures plays a major role to make them highly designable.Comment: 6 figure, To be appear in JC

    Connecting growth with gene expression: of noise and numbers.

    Get PDF
    Growth is a dynamic process whereby cells accumulate mass. Growth rates of single cells are connected to RNA and protein synthesis rates, and therefore with biomolecule numbers. Noise in gene expression depends on these numbers, and is thus linked with cellular growth. Whether these global attributes of the cell participate in gene regulation is still largely unexplored. New experimental and modelling studies suggest that systemic variations in biomolecule numbers can coordinate cellular processes, including growth itself, through global regulatory feedback that acts in addition to genetic regulatory networks. Here, we review these findings and speculate on possible implications of this less appreciated layer of gene regulation for cellular physiology and adaptation to changing environments

    A coarse-grained resource allocation model of carbon and nitrogen metabolism in unicellular microbes

    Get PDF
    Coarse-grained resource allocation models (C-GRAMs) are simple mathematical models of cell physiology, where large components of the macromolecular composition are abstracted into single entities. The dynamics and steady-state behaviour of such models provides insights on optimal allocation of cellular resources and have explained experimentally observed cellular growth laws, but current models do not account for the uptake of compound sources of carbon and nitrogen. Here, we formulate a C-GRAM with nitrogen and carbon pathways converging on biomass production, with parametrizations accounting for respirofermentative and purely respiratory growth. The model describes the effects of the uptake of sugars, ammonium and/or compound nutrients such as amino acids on the translational resource allocation towards proteome sectors that maximized the growth rate. It robustly recovers cellular growth laws including the Monod law and the ribosomal growth law. Furthermore, we show how the growth-maximizing balance between carbon uptake, recycling, and excretion depends on the nutrient environment. Lastly, we find a robust linear correlation between the ribosome fraction and the abundance of amino acid equivalents in the optimal cell, which supports the view that simple regulation of translational gene expression can enable cells to achieve an approximately optimal growth state

    Semi-supervised classification and visualisation of multi-view data

    Get PDF
    An increasing number of multi-view data are being published by studies in several fields. This type of data corresponds to multiple data-views, each representing a different aspect of the same set of samples. We have recently proposed multi-SNE, an extension of t-SNE, that produces a single visualisation of multi-view data. The multi-SNE approach provides low-dimensional embeddings of the samples, produced by being updated iteratively through the different data-views. Here, we further extend multi-SNE to a semi-supervised approach, that classifies unlabelled samples by regarding the labelling information as an extra data-view. We look deeper into the performance, limitations and strengths of multi-SNE and its extension, S-multi-SNE, by applying the two methods on various multi-view datasets with different challenges. We show that by including the labelling information, the projection of the samples improves drastically and it is accompanied by a strong classification performance

    Biosensor Architectures for High-Fidelity Reporting of Cellular Signaling

    Get PDF
    Understanding mechanisms of information processing in cellular signaling networks requires quantitative measurements of protein activities in living cells. Biosensors are molecular probes that have been developed to directly track the activity of specific signaling proteins and their use is revolutionizing our understanding of signal transduction. The use of biosensors relies on the assumption that their activity is linearly proportional to the activity of the signaling protein they have been engineered to track. We use mechanistic mathematical models of common biosensor architectures (single-chain FRET-based biosensors), which include both intramolecular and intermolecular reactions, to study the validity of the linearity assumption. As a result of the classic mechanism of zero-order ultrasensitivity, we find that biosensor activity can be highly nonlinear so that small changes in signaling protein activity can give rise to large changes in biosensor activity and vice versa. This nonlinearity is abolished in architectures that favor the formation of biosensor oligomers, but oligomeric biosensors produce complicated FRET states. Based on this finding, we show that high-fidelity reporting is possible when a single-chain intermolecular biosensor is used that cannot undergo intramolecular reactions and is restricted to forming dimers. We provide phase diagrams that compare various trade-offs, including observer effects, which further highlight the utility of biosensor architectures that favor intermolecular over intramolecular binding. We discuss challenges in calibrating and constructing biosensors and highlight the utility of mathematical models in designing novel probes for cellular signaling

    Analytical distributions for stochastic gene expression

    Full text link
    Gene expression is significantly stochastic making modeling of genetic networks challenging. We present an approximation that allows the calculation of not only the mean and variance but also the distribution of protein numbers. We assume that proteins decay substantially slower than their mRNA and confirm that many genes satisfy this relation using high-throughput data from budding yeast. For a two-stage model of gene expression, with transcription and translation as first-order reactions, we calculate the protein distribution for all times greater than several mRNA lifetimes and thus qualitatively predict the distribution of times for protein levels to first cross an arbitrary threshold. If in addition the promoter fluctuates between inactive and active states, we can find the steady-state protein distribution, which can be bimodal if promoter fluctuations are slow. We show that our assumptions imply that protein synthesis occurs in geometrically distributed bursts and allows mRNA to be eliminated from a master equation description. In general, we find that protein distributions are asymmetric and may be poorly characterized by their mean and variance. Through maximum likelihood methods, our expressions should therefore allow more quantitative comparisons with experimental data. More generally, we introduce a technique to derive a simpler, effective dynamics for a stochastic system by eliminating a fast variable.Comment: Supplementary information can be found on PNAS websit

    Fission yeast obeys a linear size law under nutrient titration

    Get PDF
    Steady-state cell size and geometry depend on growth conditions. Here, we use an experimental setup based on continuous culture and single-cell imaging to study how cell volume, length, width and surface-to-volume ratio vary across a range of growth conditions including nitrogen and carbon titration, the choice of nitrogen source, and translation inhibition. Overall, we find cell geometry is not fully determined by growth rate and depends on the specific mode of growth rate modulation. However, under nitrogen and carbon titrations, we observe that the cell volume and the growth rate follow the same linear scaling

    Modelling capture efficiency of single-cell RNA-sequencing data improves inference of transcriptome-wide burst kinetics

    Get PDF
    MOTIVATION: Gene expression is characterised by stochastic bursts of transcription that occur at brief and random periods of promoter activity. The kinetics of gene expression burstiness differs across the genome and is dependent on the promoter sequence, among other factors. Single-cell RNA sequencing (scRNA-seq) has made it possible to quantify the cell-to-cell variability in transcription at a global genome-wide level. However, scRNA-seq data is prone to technical variability, including low and variable capture efficiency of transcripts from individual cells. RESULTS: Here, we propose a novel mathematical theory for the observed variability in scRNA-seq data. Our method captures burst kinetics and variability in both the cell size and capture efficiency, which allows us to propose several likelihood-based and simulation-based methods for the inference of burst kinetics from scRNA-seq data. Using both synthetic and real data, we show that the simulation-based methods provide an accurate, robust and flexible tool for inferring burst kinetics from scRNA-seq data. In particular, in a supervised manner, a simulation-based inference method based on neural networks proves to be accurate and useful when applied to both allele and non-allele-specific scRNA-seq data. AVAILABILITY: The code for Neural Network and Approximate Bayesian Computation inference is available at https://github.com/WT215/nnRNA and https://github.com/WT215/Julia_ABC respectively. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online
    corecore