4,888 research outputs found
Recommended from our members
Revealing Dynamic Mechanisms of Cell Fate Decisions From Single-Cell Transcriptomic Data.
Cell fate decisions play a pivotal role in development, but technologies for dissecting them are limited. We developed a multifunction new method, Topographer, to construct a "quantitative" Waddington's landscape of single-cell transcriptomic data. This method is able to identify complex cell-state transition trajectories and to estimate complex cell-type dynamics characterized by fate and transition probabilities. It also infers both marker gene networks and their dynamic changes as well as dynamic characteristics of transcriptional bursting along the cell-state transition trajectories. Applying this method to single-cell RNA-seq data on the differentiation of primary human myoblasts, we not only identified three known cell types, but also estimated both their fate probabilities and transition probabilities among them. We found that the percent of genes expressed in a bursty manner is significantly higher at (or near) the branch point (~97%) than before or after branch (below 80%), and that both gene-gene and cell-cell correlation degrees are apparently lower near the branch point than away from the branching. Topographer allows revealing of cell fate mechanisms in a coherent way at three scales: cell lineage (macroscopic), gene network (mesoscopic), and gene expression (microscopic)
Stochastic neural network models for gene regulatory networks
Recent advances in gene-expression profiling technologies provide large amounts of gene expression data. This raises the possibility for a functional understanding of genome dynamics by means of mathematical modelling. As gene expression involves intrinsic noise, stochastic models are essential for better descriptions of gene regulatory networks. However, stochastic modelling for large scale gene expression data sets is still in the very early developmental stage. In this paper we present some stochastic models by introducing stochastic processes into neural network models that can describe intermediate regulation for large scale gene networks. Poisson random variables are used to represent chance events in the processes of synthesis and degradation. For expression data with normalized concentrations, exponential or normal random variables are used to realize fluctuations. Using a network with three genes, we show how to use stochastic simulations for studying robustness and stability properties of gene expression patterns under the influence of noise, and how to use stochastic models to predict statistical distributions of expression levels in population of cells. The discussion suggest that stochastic neural network models can give better description of gene regulatory networks and provide criteria for measuring the reasonableness o mathematical models
Quantitative model for inferring dynamic regulation of the tumour suppressor gene p53
Background: The availability of various "omics" datasets creates a prospect of performing the study of genome-wide genetic regulatory networks. However, one of the major challenges of using mathematical models to infer genetic regulation from microarray datasets is the lack of information for protein concentrations and activities. Most of the previous researches were based on an assumption that the mRNA levels of a gene are consistent with its protein activities, though it is not always the case. Therefore, a more sophisticated modelling framework together with the corresponding inference methods is needed to accurately estimate genetic regulation from "omics" datasets.
Results: This work developed a novel approach, which is based on a nonlinear mathematical model, to infer genetic regulation from microarray gene expression data. By using the p53 network as a test system, we used the nonlinear model to estimate the activities of transcription factor (TF) p53 from the expression levels of its target genes, and to identify the activation/inhibition status of p53 to its target genes. The predicted top 317 putative p53 target genes were supported by DNA sequence analysis. A comparison between our prediction and the other published predictions of p53 targets suggests that most of putative p53 targets may share a common depleted or enriched sequence signal on their upstream non-coding region.
Conclusions: The proposed quantitative model can not only be used to infer the regulatory relationship between TF and its down-stream genes, but also be applied to estimate the protein activities of TF from the expression levels of its target genes
Experimental design trade-offs for gene regulatory network inference: an in silico study of the yeast Saccharomyces cerevisiae cell cycle
Time-series of high throughput gene sequencing data intended for gene
regulatory network (GRN) inference are often short due to the high costs of
sampling cell systems. Moreover, experimentalists lack a set of quantitative
guidelines that prescribe the minimal number of samples required to infer a
reliable GRN model. We study the temporal resolution of data vs quality of GRN
inference in order to ultimately overcome this deficit. The evolution of a
Markovian jump process model for the Ras/cAMP/PKA pathway of proteins and
metabolites in the G1 phase of the Saccharomyces cerevisiae cell cycle is
sampled at a number of different rates. For each time-series we infer a linear
regression model of the GRN using the LASSO method. The inferred network
topology is evaluated in terms of the area under the precision-recall curve
AUPR. By plotting the AUPR against the number of samples, we show that the
trade-off has a, roughly speaking, sigmoid shape. An optimal number of samples
corresponds to values on the ridge of the sigmoid
A stochastic and dynamical view of pluripotency in mouse embryonic stem cells
Pluripotent embryonic stem cells are of paramount importance for biomedical
research thanks to their innate ability for self-renewal and differentiation
into all major cell lines. The fateful decision to exit or remain in the
pluripotent state is regulated by complex genetic regulatory network. Latest
advances in transcriptomics have made it possible to infer basic topologies of
pluripotency governing networks. The inferred network topologies, however, only
encode boolean information while remaining silent about the roles of dynamics
and molecular noise in gene expression. These features are widely considered
essential for functional decision making. Herein we developed a framework for
extending the boolean level networks into models accounting for individual
genetic switches and promoter architecture which allows mechanistic
interrogation of the roles of molecular noise, external signaling, and network
topology. We demonstrate the pluripotent state of the network to be a broad
attractor which is robust to variations of gene expression. Dynamics of exiting
the pluripotent state, on the other hand, is significantly influenced by the
molecular noise originating from genetic switching events which makes cells
more responsive to extracellular signals. Lastly we show that steady state
probability landscape can be significantly remodeled by global gene switching
rates alone which can be taken as a proxy for how global epigenetic
modifications exert control over stability of pluripotent states.Comment: 11 pages, 7 figure
A hierarchical Bayesian model for inference of copy number variants and their association to gene expression
A number of statistical models have been successfully developed for the
analysis of high-throughput data from a single source, but few methods are
available for integrating data from different sources. Here we focus on
integrating gene expression levels with comparative genomic hybridization (CGH)
array measurements collected on the same subjects. We specify a measurement
error model that relates the gene expression levels to latent copy number
states which, in turn, are related to the observed surrogate CGH measurements
via a hidden Markov model. We employ selection priors that exploit the
dependencies across adjacent copy number states and investigate MCMC stochastic
search techniques for posterior inference. Our approach results in a unified
modeling framework for simultaneously inferring copy number variants (CNV) and
identifying their significant associations with mRNA transcripts abundance. We
show performance on simulated data and illustrate an application to data from a
genomic study on human cancer cell lines.Comment: Published in at http://dx.doi.org/10.1214/13-AOAS705 the Annals of
Applied Statistics (http://www.imstat.org/aoas/) by the Institute of
Mathematical Statistics (http://www.imstat.org
Modelling capture efficiency of single-cell RNA-sequencing data improves inference of transcriptome-wide burst kinetics
Motivation:
Gene expression is characterized by stochastic bursts of transcription that occur at brief and random periods of promoter activity. The kinetics of gene expression burstiness differs across the genome and is dependent on the promoter sequence, among other factors. Single-cell RNA sequencing (scRNA-seq) has made it possible to quantify the cell-to-cell variability in transcription at a global genome-wide level. However, scRNA-seq data are prone to technical variability, including low and variable capture efficiency of transcripts from individual cells.
//
Results:
Here, we propose a novel mathematical theory for the observed variability in scRNA-seq data. Our method captures burst kinetics and variability in both the cell size and capture efficiency, which allows us to propose several likelihood-based and simulation-based methods for the inference of burst kinetics from scRNA-seq data. Using both synthetic and real data, we show that the simulation-based methods provide an accurate, robust and flexible tool for inferring burst kinetics from scRNA-seq data. In particular, in a supervised manner, a simulation-based inference method based on neural networks proves to be accurate and useful when applied to both allele and nonallele-specific scRNA-seq data.
//
Availability and implementation:
The code for Neural Network and Approximate Bayesian Computation inference is available at https://github.com/WT215/nnRNA and https://github.com/WT215/Julia_ABC, respectively
Mathematical modelling plant signalling networks
During the last two decades, molecular genetic studies and the completion of the sequencing of the Arabidopsis thaliana genome have increased knowledge of hormonal regulation in plants. These signal transduction pathways act in concert through gene regulatory and signalling networks whose main components have begun to be elucidated. Our understanding of the resulting cellular processes is hindered by the complex, and sometimes counter-intuitive, dynamics of the networks, which may be interconnected through feedback controls and cross-regulation. Mathematical modelling provides a valuable tool to investigate such dynamics and to perform in silico experiments that may not be easily carried out in a laboratory. In this article, we firstly review general methods for modelling gene and signalling networks and their application in plants. We then describe specific models of hormonal perception and cross-talk in plants. This sub-cellular analysis paves the way for more comprehensive mathematical studies of hormonal transport and signalling in a multi-scale setting
- …