4,010 research outputs found
Systems approaches and algorithms for discovery of combinatorial therapies
Effective therapy of complex diseases requires control of highly non-linear
complex networks that remain incompletely characterized. In particular, drug
intervention can be seen as control of signaling in cellular networks.
Identification of control parameters presents an extreme challenge due to the
combinatorial explosion of control possibilities in combination therapy and to
the incomplete knowledge of the systems biology of cells. In this review paper
we describe the main current and proposed approaches to the design of
combinatorial therapies, including the empirical methods used now by clinicians
and alternative approaches suggested recently by several authors. New
approaches for designing combinations arising from systems biology are
described. We discuss in special detail the design of algorithms that identify
optimal control parameters in cellular networks based on a quantitative
characterization of control landscapes, maximizing utilization of incomplete
knowledge of the state and structure of intracellular networks. The use of new
technology for high-throughput measurements is key to these new approaches to
combination therapy and essential for the characterization of control
landscapes and implementation of the algorithms. Combinatorial optimization in
medical therapy is also compared with the combinatorial optimization of
engineering and materials science and similarities and differences are
delineated.Comment: 25 page
Multiscale metabolic modeling of C4 plants: connecting nonlinear genome-scale models to leaf-scale metabolism in developing maize leaves
C4 plants, such as maize, concentrate carbon dioxide in a specialized
compartment surrounding the veins of their leaves to improve the efficiency of
carbon dioxide assimilation. Nonlinear relationships between carbon dioxide and
oxygen levels and reaction rates are key to their physiology but cannot be
handled with standard techniques of constraint-based metabolic modeling. We
demonstrate that incorporating these relationships as constraints on reaction
rates and solving the resulting nonlinear optimization problem yields realistic
predictions of the response of C4 systems to environmental and biochemical
perturbations. Using a new genome-scale reconstruction of maize metabolism, we
build an 18000-reaction, nonlinearly constrained model describing mesophyll and
bundle sheath cells in 15 segments of the developing maize leaf, interacting
via metabolite exchange, and use RNA-seq and enzyme activity measurements to
predict spatial variation in metabolic state by a novel method that optimizes
correlation between fluxes and expression data. Though such correlations are
known to be weak in general, here the predicted fluxes achieve high correlation
with the data, successfully capture the experimentally observed base-to-tip
transition between carbon-importing tissue and carbon-exporting tissue, and
include a nonzero growth rate, in contrast to prior results from similar
methods in other systems. We suggest that developmental gradients may be
particularly suited to the inference of metabolic fluxes from expression data.Comment: 57 pages, 14 figures; submitted to PLoS Computational Biology; source
code available at http://github.com/ebogart/fluxtools and
http://github.com/ebogart/multiscale_c4_sourc
Towards a comprehensive modeling framework for studying glucose repression in yeast
The yeast Saccharomyces cerevisiae is an important model organism for human health and for industry applications as a cell factory. For both purposes, it has been an important organism for studying glucose repression. Glucose sensing and signaling is a complex biological system, where the SNF1 pathway is the main pathway responsible for glucose repression. However, it is highly interconnected with the cAMP-PKA, Snf3-Rgt2 and TOR pathways. To handle the complexity, mathematical modeling has successfully aided in elucidating the structure, mechanism, and dynamics of the pathway. In this thesis, I aim to elucidate what the effect of the interconnection of glucose repression with sensory and metabolic pathways in yeast is, specifically, how crosstalk influences the signaling cascade; what the main effects of nutrient signaling on the metabolism are and how those are affected by intrinsic stress, such as damage accumulation. Here, I have addressed these questions by developing new frameworks for mathematical modeling. A vector based method for Boolean representation of complex signaling events is presented. The method reduces the amount of necessary nodes and eases the interpretation of the Boolean states by separating different events that could alter the activity of a protein. This method was used to study how crosstalk influences the signaling cascade.To be able to represent a diverse biological network using methods suitable for respective pathways, we also developed two hybrid models. The first is demonstrating a framework to connect signaling pathways with metabolic networks, enabling the study of long-term signaling effects on the metabolism. The second hybrid model is demonstrating a framework to connect models of signaling and metabolism to growth and damage accumulation, enabling the study of how the long-term signaling effects on the metabolism influence the lifespan. This thesis represents a step towards comprehensive models of glucose repression. In addition, the methods and frameworks in this thesis can be applied and extended to other signaling pathways
Combining evolutionary algorithms with reaction rules towards focused molecular design
Designing novel small molecules with desirable properties and feasible synthesis continues to pose a significant challenge in drug discovery, particularly in the realm of natural products. Reaction-based gradient-free methods are promising approaches for designing new molecules as they ensure synthetic feasibility and provide potential synthesis paths. However, it is important to note that the novelty and diversity of the generated molecules highly depend on the availability of comprehensive reaction templates. To address this challenge, we introduce ReactEA, a new open-source evolutionary framework for computer-aided drug discovery that solely utilizes biochemical reaction rules. ReactEA optimizes molecular properties using a comprehensive set of 22,949 reaction rules, ensuring chemical validity and synthetic feasibility. ReactEA is versatile, as it can virtually optimize any objective function and track potential synthetic routes during the optimization process. To demonstrate its effectiveness, we apply ReactEA to various case studies, including the design of novel drug-like molecules and the optimization of pre-existing ligands. The results show that ReactEA consistently generates novel molecules with improved properties and reasonable synthetic routes, even for complex tasks such as improving binding affinity against the PARP1 enzyme when compared to existing inhibitors.Centre of Biological Engineering (CEB, University of Minho) for financial and equipment support. Portuguese Foundation for Science and Technology (FCT) under the scope of the strategic funding of UIDB/04469/2020 unit and through a Ph.D. scholarship awarded to João Correia (SFRH/BD/144314/2019). European Commission through the project SHIKIFACTORY100 - Modular cell factories for the production of 100 compounds from the shikimate pathway (Reference 814408).info:eu-repo/semantics/publishedVersio
METABOLIC MODELING AND OMICS-INTEGRATIVE ANALYSIS OF SINGLE AND MULTI-ORGANISM SYSTEMS: DISCOVERY AND REDESIGN
Computations and modeling have emerged as indispensable tools that drive the process of understanding, discovery, and redesign of biological systems. With the accelerating pace of genome sequencing and annotation information generation, the development of computational pipelines for the rapid reconstruction of high-quality genome-scale metabolic networks has received significant attention. These models provide a rich tapestry for computational tools to quantitatively assess the metabolic phenotypes for various systems-level studies and to develop engineering interventions at the DNA, RNA, or enzymatic level by careful tuning in the biophysical modeling frameworks. in silico genome-scale metabolic modeling algorithms based on the concept of optimization, along with the incorporation of multi-level omics information, provides a diverse array of toolboxes for new discovery in the metabolism of living organisms (which includes single-cell microbes, plants, animals, and microbial ecosystems) and allows for the reprogramming of metabolism for desired output(s). Throughout my doctoral research, I used genome-scale metabolic models and omics-integrative analysis tools to study how microbes, plants, animal, and microbial ecosystems respond or adapt to diverse environmental cues, and how to leverage the knowledge gleaned from that to answer important biological questions. Each chapter in this dissertation will provide a detailed description of the methodology, results, and conclusions from one specific research project. The research works presented in this dissertation represent important foundational advance in Systems Biology and are crucial for sustainable development in food, pharmaceuticals and bioproduction of the future.
Advisor: Rajib Sah
Robust Algorithms for Detecting Hidden Structure in Biological Data
Biological data, such as molecular abundance measurements and protein
sequences, harbor complex hidden structure that reflects its underlying
biological mechanisms. For example, high-throughput abundance measurements
provide a snapshot the global state of a living cell, while homologous
protein sequences encode the residue-level logic of the proteins\u27 function
and provide a snapshot of the evolutionary trajectory of the protein family.
In this work I describe algorithmic approaches and analysis software I
developed for uncovering hidden structure in both kinds of data.
Clustering is an unsurpervised machine learning technique commonly used
to map the structure of data collected in high-throughput experiments,
such as quantification of gene expression by DNA microarrays or
short-read sequencing. Clustering algorithms always yield a partitioning
of the data, but relying on a single partitioning solution can lead to
spurious conclusions. In particular, noise in the data can cause objects
to fall into the same cluster by chance rather than due to meaningful
association. In the first part of this thesis I demonstrate approaches to
clustering data robustly in the presence of noise and apply robust clustering
to analyze the transcriptional response to injury in a neuron cell.
In the second part of this thesis I describe identifying hidden specificity
determining residues (SDPs) from alignments of protein sequences descended
through gene duplication from a common ancestor (paralogs) and apply the
approach to identify numerous putative SDPs in bacterial transcription
factors in the LacI family. Finally, I describe and demonstrate a new
algorithm for reconstructing the history of duplications by which paralogs
descended from their common ancestor. This algorithm addresses the
complexity of such reconstruction due to indeterminate or erroneous
homology assignments made by sequence alignment algorithms and to the
vast prevalence of divergence through speciation over divergence through
gene duplication in protein evolution
Flux sampling is a powerful tool to study metabolism under changing environmental conditions
The development of high-throughput ‘omic techniques has sparked a rising interest in genome-scale metabolic models, with applications ranging from disease diagnostics to crop adaptation. Efficient and accurate methods are required to analyze large metabolic networks. Flux sampling can be used to explore the feasible flux solutions in metabolic networks by generating probability distributions of steady-state reaction fluxes. Unlike other methods, flux sampling can be used without assuming a particular cellular objective. We have undertaken a rigorous comparison of several sampling algorithms and concluded that the coordinate hit-and-run with rounding (CHRR) algorithm is the most efficient based on both run-time and multiple convergence diagnostics. We demonstrate the power of CHRR by using it to study the metabolic changes that underlie photosynthetic acclimation to cold of Arabidopsis thaliana plant leaves. In combination with experimental measurements, we show how the regulated interplay between diurnal starch and organic acid accumulation defines the plant acclimation process. We confirm fumarate accumulation as a requirement for cold acclimation and further predict γ–aminobutyric acid to have a key role in metabolic signaling under cold conditions. These results demonstrate how flux sampling can be used to analyze the feasible flux solutions across changing environmental conditions, whereas eliminating the need to make assumptions which introduce observer bias
- …