16 research outputs found
A Bayesian approach to inferring chemical signal timing and amplitude in a temporal logic gate using the cell population distributional response
Stochastic gene expression poses an important challenge for engineering robust behaviors in a heterogeneous cell population. Cells address this challenge by operating on distributions of cellular responses generated by noisy processes. Similarly, a previously published temporal logic gate considers the distribution of responses across a cell population under chemical inducer pulsing events. The design uses a system of two integrases to engineer an E. coli strain with four DNA states that records the temporal order of two chemical signal events. The heterogeneous cell population response was used to infer the timing and duration of the two chemical signals for a small set of events. Here we use the temporal logic gate system to address the problem of extracting information about chemical signal events. We use the heterogeneous cell population response to infer whether any event has occurred or not and also to infer its properties such as timing and amplitude. Bayesian inference provides a natural framework to answer our questions about chemical signal occurrence, timing, and amplitude. We develop a probabilistic model that incorporates uncertainty in the how well our model captures the cell population and in how well a sample of measured cells represents the entire population. Using our probabilistic model and cell population measurements taken every five minutes on generated data, we ask how likely it was to observe the data for parameter values that describe square-shaped inducer pulses. We compare the likelihood functions associated with the probabilistic models for the event with the chemical signal pulses turned on versus turned off. Hence, we can determine whether an event of chemical induction of integrase expression has occurred or not. Using Markov Chain Monte Carlo, we sample the posterior distribution of chemical pulse parameters to identify likely pulses that produce the data measurements. We implement this method and obtain accurate results for detecting chemical inducer pulse timing, length, and amplitude. We can detect and identify chemical inducer pulses as short as half an hour, as well as all pulse amplitudes that fall under biologically relevant conditions
A Bayesian approach to inferring chemical signal timing and amplitude in a temporal logic gate using the cell population distributional response
Stochastic gene expression poses an important challenge for engineering robust behaviors in a heterogeneous cell population. Cells address this challenge by operating on distributions of cellular responses generated by noisy processes. Similarly, a previously published temporal logic gate considers the distribution of responses across a cell population under chemical inducer pulsing events. The design uses a system of two integrases to engineer an E. coli strain with four DNA states that records the temporal order of two chemical signal events. The heterogeneous cell population response was used to infer the timing and duration of the two chemical signals for a small set of events. Here we use the temporal logic gate system to address the problem of extracting information about chemical signal events. We use the heterogeneous cell population response to infer whether any event has occurred or not and also to infer its properties such as timing and amplitude. Bayesian inference provides a natural framework to answer our questions about chemical signal occurrence, timing, and amplitude. We develop a probabilistic model that incorporates uncertainty in the how well our model captures the cell population and in how well a sample of measured cells represents the entire population. Using our probabilistic model and cell population measurements taken every five minutes on generated data, we ask how likely it was to observe the data for parameter values that describe square-shaped inducer pulses. We compare the likelihood functions associated with the probabilistic models for the event with the chemical signal pulses turned on versus turned off. Hence, we can determine whether an event of chemical induction of integrase expression has occurred or not. Using Markov Chain Monte Carlo, we sample the posterior distribution of chemical pulse parameters to identify likely pulses that produce the data measurements. We implement this method and obtain accurate results for detecting chemical inducer pulse timing, length, and amplitude. We can detect and identify chemical inducer pulses as short as half an hour, as well as all pulse amplitudes that fall under biologically relevant conditions
Bayesian System Identification using auxiliary stochastic dynamical systems
Bayesian approaches to statistical inference and system identification became practical with the development of effective sampling methods like Markov Chain Monte Carlo (MCMC). However, because the size and complexity of inference problems has dramatically increased, improved MCMC methods are required. Dynamical systems based samplers are an effective extension of traditional MCMC methods. These samplers treat the posterior probability distribution as the potential energy function of a dynamical system, enabling them to better exploit the structure of the inference problem. We present an algorithm, Second-Order Langevin MCMC (SOL-MC), a stochastic dynamical system based MCMC algorithm, which uses the damped second-order Langevin stochastic differential equation (SDE) to sample a posterior distribution. We design the SDE such that the desired posterior probability distribution is its stationary distribution. Since this method is based upon an underlying dynamical system, we can utilize existing work to develop, implement, and optimize the sampler's performance. As such, we can choose parameters which speed up the convergence to the stationary distribution and reduce temporal state and energy correlations in the samples. We then apply this sampler to a system identification problem for a non-linear hysteretic structure model to investigate this method under globally identifiable and unidentifiable conditions
Bayesian System Identification using auxiliary stochastic dynamical systems
Bayesian approaches to statistical inference and system identification became practical with the development of effective sampling methods like Markov Chain Monte Carlo (MCMC). However, because the size and complexity of inference problems has dramatically increased, improved MCMC methods are required. Dynamical systems based samplers are an effective extension of traditional MCMC methods. These samplers treat the posterior probability distribution as the potential energy function of a dynamical system, enabling them to better exploit the structure of the inference problem. We present an algorithm, Second-Order Langevin MCMC (SOL-MC), a stochastic dynamical system based MCMC algorithm, which uses the damped second-order Langevin stochastic differential equation (SDE) to sample a posterior distribution. We design the SDE such that the desired posterior probability distribution is its stationary distribution. Since this method is based upon an underlying dynamical system, we can utilize existing work to develop, implement, and optimize the sampler's performance. As such, we can choose parameters which speed up the convergence to the stationary distribution and reduce temporal state and energy correlations in the samples. We then apply this sampler to a system identification problem for a non-linear hysteretic structure model to investigate this method under globally identifiable and unidentifiable conditions
Heterogeneity, correlations and financial contagion
We consider a model of contagion in financial networks recently introduced in [1], and we characterize the effect of a few features empirically observed in real networks on the stability of the system. Notably, we consider the effect of heterogeneous degree distributions, heterogeneous balance sheet size and degree correlations between banks. We study the probability of contagion conditional on the failure of a random bank, the most connected bank and the biggest bank, and we consider the effect of targeted policies aimed at increasing the capital requirements of a few banks with high connectivity or big balance sheets. Networks with heterogeneous degree distributions are shown to be more resilient to contagion triggered by the failure of a random bank, but more fragile with respect to contagion triggered by the failure of highly connected nodes. A power law distribution of balance sheet size is shown to induce an inefficient diversification that makes the system more prone to contagion events. A targeted policy aimed at reinforcing the stability of the biggest banks is shown to improve the stability of the system in the regime of high average degree. Finally, disassortative mixing, such as that observed in real banking networks, is shown to enhance the stability of the system.
Heterogeneity, correlations and financial contagion
We consider a model of contagion in financial networks recently introduced in the literature, and we characterize the effect of a few features empirically observed in real networks on the stability of the system. Notably, we consider the effect of heterogeneous degree distributions, heterogeneous balance sheet size and degree correlations between banks. We study the probability of contagion conditional on the failure of a random bank, the most connected bank and the biggest bank, and we consider the effect of targeted policies aimed at increasing the capital requirements of a few banks with high connectivity or big balance sheets. Networks with heterogeneous degree distributions are shown to be more resilient to contagion triggered by the failure of a random bank, but more fragile with respect to contagion triggered by the failure of highly connected nodes. A power law distribution of balance sheet size is shown to induce an inefficient diversification that makes the system more prone to contagion events. A targeted policy aimed at reinforcing the stability of the biggest banks is shown to improve the stability of the system in the regime of high average degree. Finally, disassortative mixing, such as that observed in real banking networks, is shown to enhance the stability of the system.