927 research outputs found
Deep Reinforcement Learning for Control of Probabilistic Boolean Networks
Probabilistic Boolean Networks (PBNs) were introduced as a computational
model for the study of complex dynamical systems, such as Gene Regulatory
Networks (GRNs). Controllability in this context is the process of making
strategic interventions to the state of a network in order to drive it towards
some other state that exhibits favourable biological properties. In this paper
we study the ability of a Double Deep Q-Network with Prioritized Experience
Replay in learning control strategies within a finite number of time steps that
drive a PBN towards a target state, typically an attractor. The control method
is model-free and does not require knowledge of the network's underlying
dynamics, making it suitable for applications where inference of such dynamics
is intractable. We present extensive experiment results on two synthetic PBNs
and the PBN model constructed directly from gene-expression data of a study on
metastatic-melanoma
Deep Reinforcement Learning for Controlled Traversing of the Attractor Landscape of Boolean Models in the Context of Cellular Reprogramming
Cellular reprogramming can be used for both the prevention and cure of
different diseases. However, the efficiency of discovering reprogramming
strategies with classical wet-lab experiments is hindered by lengthy time
commitments and high costs. In this study, we develop a novel computational
framework based on deep reinforcement learning that facilitates the
identification of reprogramming strategies. For this aim, we formulate a
control problem in the context of cellular reprogramming for the frameworks of
BNs and PBNs under the asynchronous update mode. Furthermore, we introduce the
notion of a pseudo-attractor and a procedure for identification of
pseudo-attractor state during training. Finally, we devise a computational
framework for solving the control problem, which we test on a number of
different models
Recommended from our members
Statistical methods for the integrative analysis of single-cell multi-omics data
Single-cell profiling techniques have provided an unprecedented opportunity to study cellular heterogeneity at the molecular level. This represents a remarkable advance over traditional bulk sequencing methods, particularly to study lineage diversification and cell fate commitment events in heterogeneous biological processes. While the large majority of single-cell studies are focused on quantifying RNA expression, transcriptomic readouts provide only a single dimension of cellular heterogeneity. Recently, technological advances have enabled multiple biological layers to be probed in parallel one cell at a time, unveiling a powerful approach for investigating multiple dimensions of cellular heterogeneity. However, the increasing availability of multi-modal data sets needs to be accompanied by the development of suitable integrative strategies to fully exploit the data generated. In this thesis I worked in collaboration with different research groups to introduce innovative experimental and computational strategies for the integrative study of multi-omics at single-cell resolution.
The first contribution is the development of scNMT-seq, a protocol for the simultaneous profiling of RNA expression, DNA methylation and chromatin accessibility in single cells. I demonstrate how this assay provides a powerful approach for investigating regulatory relationships between the epigenome and the transcriptome within individual cells.
The second contribution is Multi-Omics Factor Analysis (MOFA), a statistical framework for the unsupervised integration of multi-omics data sets. MOFA is a Bayesian latent variable model that can be viewed as a statistically rigorous generalization of Principal Component Analysis to multi-omics data. The method provides a principled approach to retrieve, in an unsupervised manner, the underlying sources of sample heterogeneity while at the same time disentangling which axes of heterogeneity are shared across multiple modalities and which are specific to individual data modalities.
The third contribution is the generation of a comprehensive molecular roadmap of mouse gastrulation at single-cell resolution. We employed scNMT-seq to simultaneously profile RNA expression, DNA methylation and chromatin accessibility for hundreds of cells, spanning multiple time points from the exit from pluripotency to primary germ layer specification. Using MOFA, and other tools, I performed an integrative analysis of the multi-modal measurements, revealing novel insights into the role of the epigenome in regulating this key developmental process.
The fourth contribution is an extended formulation of the MOFA model tailored to the analysis of large-scale single-cell data with complex experimental designs. I extended the model to incorporate a flexible regularisation that enables the joint analysis of multiple omics as well as multiple sample groups (batches and/or experimental conditions). In addition, I implemented a GPU-accelerated stochastic variational inference framework, thus enabling the scalable analysis of potentially millions of samples
Computational Methods for the Analysis of Genomic Data and Biological Processes
In recent decades, new technologies have made remarkable progress in helping to understand biological systems. Rapid advances in genomic profiling techniques such as microarrays or high-performance sequencing have brought new opportunities and challenges in the fields of computational biology and bioinformatics. Such genetic sequencing techniques allow large amounts of data to be produced, whose analysis and cross-integration could provide a complete view of organisms. As a result, it is necessary to develop new techniques and algorithms that carry out an analysis of these data with reliability and efficiency. This Special Issue collected the latest advances in the field of computational methods for the analysis of gene expression data, and, in particular, the modeling of biological processes. Here we present eleven works selected to be published in this Special Issue due to their interest, quality, and originality
Evolution from the ground up with Amee – From basic concepts to explorative modeling
Evolutionary theory has been the foundation of biological research for about a century
now, yet over the past few decades, new discoveries and theoretical advances have rapidly
transformed our understanding of the evolutionary process. Foremost among them are
evolutionary developmental biology, epigenetic inheritance, and various forms of evolu-
tionarily relevant phenotypic plasticity, as well as cultural evolution, which ultimately led
to the conceptualization of an extended evolutionary synthesis. Starting from abstract
principles rooted in complexity theory, this thesis aims to provide a unified conceptual
understanding of any kind of evolution, biological or otherwise. This is used in the second
part to develop Amee, an agent-based model that unifies development, niche construction,
and phenotypic plasticity with natural selection based on a simulated ecology. Amee
is implemented in Utopia, which allows performant, integrated implementation and
simulation of arbitrary agent-based models. A phenomenological overview over Amee’s
capabilities is provided, ranging from the evolution of ecospecies down to the evolution
of metabolic networks and up to beyond-species-level biological organization, all of
which emerges autonomously from the basic dynamics. The interaction of development,
plasticity, and niche construction has been investigated, and it has been shown that while
expected natural phenomena can, in principle, arise, the accessible simulation time and
system size are too small to produce natural evo-devo phenomena and –structures. Amee thus can be used to simulate the evolution of a wide variety of processes
Enhanced understanding of protein glycosylation in CHO cells through computational tools and experimentation
Chinese hamster ovary (CHO) cells are the workhorse of the multibillion-dollar biopharmaceuticals industry. They have been extensively harnessed for recombinant protein synthesis, as they exhibit high titres and human-like post translational modifications (PTM), such as protein N-linked glycosylation. More specifically, N-linked glycosylation is a crucial PTM that includes the addition of an oligosaccharide in the backbone of the protein and strongly affects therapeutic efficacy and immunogenicity. In addition, the Quality by Design (QbD) paradigm that is broadly applied in academic research, necessitates a comprehensive understanding of the underlying biological relationships between the process parameters and the product quality attributes. To that end, computational tools have been vastly employed to elucidate cellular functions and predict the effect of process parameters on cell growth, product synthesis and quality. This thesis reports several advancements in the use of mathematical models for describing and optimizing bioprocesses. Firstly, a kinetic mathematical model describing CHO cell growth, metabolism, antibody synthesis and N-linked glycosylation was proposed, in order to capture the effect of galactose and uridine supplementation on cell growth and monoclonal antibody (mAb) glycosylation. Subsequently, the model was utilized to optimize galactosylation, a desired quality attribute of therapeutic mAbs. Following the QbD paradigm for ensuring product titre and quality, the kinetic model was subsequently used to identify an in silico Design Space (DS) that was also experimentally verified. An elaborate parameter estimation methodology was also developed in order to adapt the existing model to data from a newly introduced CHO cell line, without altering model structure. In an effort to reduce the burden of parameter estimation, the N-linked glycosylation submodel was replaced with an artificial neural network that was used as a standalone machine learning algorithm to predict the effect of feeding alterations and genetic engineering on the glycan distribution of several therapeutic proteins. In addition, a hybrid model configuration (HyGlycoM) incorporating the ANN-glycosylation model was also formulated to link extracellular process conditions to glycan distribution. The latter was found to outperform its fully kinetic equivalent when compared to experimental data. Finally, a comprehensive investigation of mAb galactosylation bottlenecks was carried out. Five fed-batch experiments with different concentrations of galactose and uridine supplemented throughout the culturing period, were carried out and were found to present similar mAb galactosylation. In order to identify the bottlenecks that limit galactosylation, further experimental analysis, including the investigation of glycans microheterogeneity of CHO host cell proteins (HCPs), was conducted. The experimental results were used to parameterize a novel and significant extension of the kinetic glycosylation model that simultaneously describes the N-linked glycosylation of both HCPs and the mAb product. Flux balance analysis was also used to analyse carbon and nitrogen metabolism using the experimental amino acid concentration profiles. In addition to the expression levels of the beta-1,4-galactosyltransferase enzyme, constraints imposed by the transport of the galactosylation sugar donor in the Golgi compartments and the consumption of resources towards HCPs glycosylation, were found to considerably influence mAb galactosylation.Open Acces
Adaptive networks for robotics and the emergence of reward anticipatory circuits
Currently the central challenge facing evolutionary robotics is to determine
how best to extend the range and complexity of behaviour supported by evolved
neural systems. Implicit in the work described in this thesis is the idea that this
might best be achieved through devising neural circuits (tractable to evolutionary
exploration) that exhibit complementary functional characteristics. We concentrate
on two problem domains; locomotion and sequence learning. For locomotion
we compare the use of GasNets and other adaptive networks. For sequence learning
we introduce a novel connectionist model inspired by the role of dopamine
in the basal ganglia (commonly interpreted as a form of reinforcement learning).
This connectionist approach relies upon a new neuron model inspired by notions
of energy efficient signalling. Two reward adaptive circuit variants were investigated.
These were applied respectively to two learning problems; where action
sequences are required to take place in a strict order, and secondly, where action
sequences are robust to intermediate arbitrary states. We conclude the thesis
by proposing a formal model of functional integration, encompassing locomotion
and sequence learning, extending ideas proposed by W. Ross Ashby.
A general model of the adaptive replicator is presented, incoporating subsystems
that are tuned to continuous variation and discrete or conditional events.
Comparisons are made with Ross W. Ashby's model of ultrastability and his
ideas on adaptive behaviour. This model is intended to support our assertion
that, GasNets (and similar networks) and reward adaptive circuits of the type
presented here, are intrinsically complementary. In conclusion we present some
ideas on how the co-evolution of GasNet and reward adaptive circuits might lead
us to significant improvements in the synthesis of agents capable of exhibiting
complex adaptive behaviour
Characterizing the Huntington's disease, Parkinson's disease, and pan-neurodegenerative gene expression signature with RNA sequencing
Huntington's disease (HD) and Parkinson's disease (PD) are devastating neurodegenerative disorders that are characterized pathologically by degeneration of neurons in the brain and clinically by loss of motor function and cognitive decline in mid to late life. The cause of neuronal degeneration in these diseases is unclear, but both are histologically marked by aggregation of specific proteins in specific brain regions. In HD, fragments of a mutant Huntingtin protein aggregate and cause medium spiny interneurons of the striatum to degenerate. In contrast, PD brains exhibit aggregation of toxic fragments of the alpha synuclein protein throughout the central nervous system and trigger degeneration of dopaminergic neurons in the substantia nigra. Considering the commonalities and differences between these diseases, identifying common biological patterns across HD and PD as well as signatures unique to each may provide significant insight into the molecular mechanisms underlying neurodegeneration as a general process. State-of-the-art high-throughput sequencing technology allows for unbiased, whole genome quantification of RNA molecules within a biological sample that can be used to assess the level of activity, or expression, of thousands of genes simultaneously. In this thesis, I present three studies characterizing the RNA expression profiles of post-mortem HD and PD subjects using high-throughput mRNA sequencing data sets. The first study describes an analysis of differential expression between HD individuals and neurologically normal controls that indicates a widespread increase in immune, neuroinflammatory, and developmental gene expression. The second study expands upon the first study by making methodological improvements and extends the differential expression analysis to include PD subjects, with the goal of comparing and contrasting HD and PD gene expression profiles. This study was designed to identify common mechanisms underlying the neurodegenerative phenotype, transcending those of each unique disease, and has revealed specific biological processes, in particular those related to NFkB inflammation, common to HD and PD. The last study describes a novel methodology for combining mRNA and miRNA expression that seeks to identify associations between mRNA-miRNA modules and continuous clinical variables of interest, including CAG repeat length and clinical age of onset in HD
- …