247 research outputs found
Statistical analysis of network data and evolution on GPUs: High-performance statistical computing
Network analysis typically involves as set of repetitive tasks that are particularly amenable to poor-man's parallelization. This is therefore an ideal application are for GPU architectures, which help to alleviate the tedium inherent to statistically sound analysis of network data. Here we will illustrate the use of GPUs in a range of applications, which include percolation processes on networks, the evolution of protein-protein interaction networks, and the fusion of different types of biomedical and disease data in the context of molecular interaction networks. We will pay particular attention to the numerical performance of different routines that are frequently invoked in network analysis problems. We conclude with a review over recent developments in the generation of random numbers that address the specific requirements posed by GPUs and high-performance computing needs
Bayesian design of synthetic biological systems
Here we introduce a new design framework for synthetic biology that exploits
the advantages of Bayesian model selection. We will argue that the difference
between inference and design is that in the former we try to reconstruct the
system that has given rise to the data that we observe, while in the latter, we
seek to construct the system that produces the data that we would like to
observe, i.e. the desired behavior. Our approach allows us to exploit methods
from Bayesian statistics, including efficient exploration of models spaces and
high-dimensional parameter spaces, and the ability to rank models with respect
to their ability to generate certain types of data. Bayesian model selection
furthermore automatically strikes a balance between complexity and (predictive
or explanatory) performance of mathematical models. In order to deal with the
complexities of molecular systems we employ an approximate Bayesian computation
scheme which only requires us to simulate from different competing models in
order to arrive at rational criteria for choosing between them. We illustrate
the advantages resulting from combining the design and modeling (or in-silico
prototyping) stages currently seen as separate in synthetic biology by
reference to deterministic and stochastic model systems exhibiting adaptive and
switch-like behavior, as well as bacterial two-component signaling systems.Comment: 36 pages, 16 figure
Great cities look small
Great cities connect people; failed cities isolate people. Despite the
fundamental importance of physical, face-to-face social-ties in the functioning
of cities, these connectivity networks are not explicitly observed in their
entirety. Attempts at estimating them often rely on unrealistic
over-simplifications such as the assumption of spatial homogeneity. Here we
propose a mathematical model of human interactions in terms of a local strategy
of maximising the number of beneficial connections attainable under the
constraint of limited individual travelling-time budgets. By incorporating
census and openly-available online multi-modal transport data, we are able to
characterise the connectivity of geometrically and topologically complex
cities. Beyond providing a candidate measure of greatness, this model allows
one to quantify and assess the impact of transport developments, population
growth, and other infrastructure and demographic changes on a city. Supported
by validations of GDP and HIV infection rates across United States metropolitan
areas, we illustrate the effect of changes in local and city-wide
connectivities by considering the economic impact of two contemporary inter-
and intra-city transport developments in the United Kingdom: High Speed Rail 2
and London Crossrail. This derivation of the model suggests that the scaling of
different urban indicators with population size has an explicitly mechanistic
origin.Comment: 19 pages, 8 figure
Parametric and non-parametric gradient matching for network inference:a comparison
Abstract Background Reverse engineering of gene regulatory networks from time series gene-expression data is a challenging problem, not only because of the vast sets of candidate interactions but also due to the stochastic nature of gene expression. We limit our analysis to nonlinear differential equation based inference methods. In order to avoid the computational cost of large-scale simulations, a two-step Gaussian process interpolation based gradient matching approach has been proposed to solve differential equations approximately. Results We apply a gradient matching inference approach to a large number of candidate models, including parametric differential equations or their corresponding non-parametric representations, we evaluate the network inference performance under various settings for different inference objectives. We use model averaging, based on the Bayesian Information Criterion (BIC), to combine the different inferences. The performance of different inference approaches is evaluated using area under the precision-recall curves. Conclusions We found that parametric methods can provide comparable, and often improved inference compared to non-parametric methods; the latter, however, require no kinetic information and are computationally more efficient
- …