1,222 research outputs found
The poly-omics of ageing through individual-based metabolic modelling
Abstract Background Ageing can be classified in two different ways, chronological ageing and biological ageing. While chronological age is a measure of the time that has passed since birth, biological (also known as transcriptomic) ageing is defined by how time and the environment affect an individual in comparison to other individuals of the same chronological age. Recent research studies have shown that transcriptomic age is associated with certain genes, and that each of those genes has an effect size. Using these effect sizes we can calculate the transcriptomic age of an individual from their age-associated gene expression levels. The limitation of this approach is that it does not consider how these changes in gene expression affect the metabolism of individuals and hence their observable cellular phenotype. Results We propose a method based on poly-omic constraint-based models and machine learning in order to further the understanding of transcriptomic ageing. We use normalised CD4 T-cell gene expression data from peripheral blood mononuclear cells in 499 healthy individuals to create individual metabolic models. These models are then combined with a transcriptomic age predictor and chronological age to provide new insights into the differences between transcriptomic and chronological ageing. As a result, we propose a novel metabolic age predictor. Conclusions We show that our poly-omic predictors provide a more detailed analysis of transcriptomic ageing compared to gene-based approaches, and represent a basis for furthering our knowledge of the ageing mechanisms in human cells
Machine and deep learning meet genome-scale metabolic modeling
Omic data analysis is steadily growing as a driver of basic and applied molecular biology research. Core to the interpretation of complex and heterogeneous biological phenotypes are computational approaches in the fields of statistics and machine learning. In parallel, constraint-based metabolic modeling has established itself as the main tool to investigate large-scale relationships between genotype, phenotype, and environment. The development and application of these methodological frameworks have occurred independently for the most part, whereas the potential of their integration for biological, biomedical, and biotechnological research is less known. Here, we describe how machine learning and constraint-based modeling can be combined, reviewing recent works at the intersection of both domains and discussing the mathematical and practical aspects involved. We overlap systematic classifications from both frameworks, making them accessible to nonexperts. Finally, we delineate potential future scenarios, propose new joint theoretical frameworks, and suggest concrete points of investigation for this joint subfield. A multiview approach merging experimental and knowledge-driven omic data through machine learning methods can incorporate key mechanistic information in an otherwise biologically-agnostic learning process
Genome-scale models as a vehicle for knowledge transfer from microbial to mammalian cell systems
With the plethora of omics data becoming available for mammalian cell and, increasingly, human cell systems, Genome-scale metabolic models (GEMs) have emerged as a useful tool for their organisation and analysis. The systems biology community has developed an array of tools for the solution, interrogation and customisation of GEMs as well as algorithms that enable the design of cells with desired phenotypes based on the multi-omics information contained in these models. However, these tools have largely found application in microbial cells systems, which benefit from smaller model size and ease of experimentation. Herein, we discuss the major outstanding challenges in the use of GEMs as a vehicle for accurately analysing data for mammalian cell systems and transferring methodologies that would enable their use to design strains and processes. We provide insights on the opportunities and limitations of applying GEMs to human cell systems for advancing our understanding of health and disease. We further propose their integration with data-driven tools and their enrichment with cellular functions beyond metabolism, which would, in theory, more accurately describe how resources are allocated intracellularly
The era of big data: Genome-scale modelling meets machine learning
With omics data being generated at an unprecedented rate, genome-scale modelling has become pivotal in its organisation and analysis. However, machine learning methods have been gaining ground in cases where knowledge is insufficient to represent the mechanisms underlying such data or as a means for data curation prior to attempting mechanistic modelling. We discuss the latest advances in genome-scale modelling and the development of optimisation algorithms for network and error reduction, intracellular constraining and applications to strain design. We further review applications of supervised and unsupervised machine learning methods to omics datasets from microbial and mammalian cell systems and present efforts to harness the potential of both modelling approaches through hybrid modelling
Recommended from our members
Metabolic influences on ageing in Caenorhabditis elegans: A time series multi-omics and metabolic modelling study
Ageing presents one of the most fundamental public health challenges of our time. Progress in living standards, combating infectious disease and promoting safety, and general nutritional availability, has led to an increase in lifespan across the developed world. However, this has been accompanied by an increase in the duration of late-life frailty and associated conditions affecting health. Metabolism is known to be a key mediator of ageing across the diversity of living species. Many of the pathways that extend lifespan and promote healthspan are known to be metabolic and relate to managing the balance of energy availability to optimise resource usage and survival during times of scarcity.
The model organism Caenorhabditis elegans, a small transparent nematode worm that ordinarily lives in the soil and eats bacteria, is one of the most common organisms used in the study of ageing as it is easy to culture in laboratory conditions and has a short lifespan of around three weeks under normal conditions. In this thesis, I analyse in detail the metabolic changes that occur during ageing in C. elegans, using a multi-omics metabolomics and transcriptomics time series of measurements in three C. elegans strains, and mathematical modelling.
Whole-genome metabolic models are representations of all the metabolic reactions taking place within an organism together with their metabolic inputs and outputs, and enzymatic catalysts. I describe the development and validation of a community-wide shared whole-genome metabolic model for C. elegans. Using this model together with measured gene expression levels for each enzyme that catalyses a reaction, it is possible to predict intracellular reaction fluxes using a method called Flux Balance Analysis (FBA). I describe a novel method for the integration of metabolomics data with FBA, and the results of a comparative analysis of the resulting fluxes in normal wild-type ageing. I then go on to describe the differences to a germline-free strain that is long-lived and metabolically different.
Finally, I have used the model to probe the metabolic flexibility and evidence for trans-omics bidirectional regulation between the transcriptomic and metabolomic layers
Multimodal regularised linear models with flux balance analysis for mechanistic integration of omics data
Motivation: High-throughput biological data, thanks to technological advances, have become cheaper to collect, leading to the availability of vast amounts of omic data of different types. In parallel, the in silico reconstruction and modeling of metabolic systems is now acknowledged as a key tool to complement experimental data on a large scale. The integration of these model- and data-driven information is therefore emerging as a new challenge in systems biology, with no clear guidance on how to better take advantage of the inherent multisource and multiomic nature of these data types while preserving mechanistic interpretation. Results: Here, we investigate different regularization techniques for high-dimensional data derived from the integration of gene expression profiles with metabolic flux data, extracted from strain-specific metabolic models, to improve cellular growth rate predictions. To this end, we propose ad-hoc extensions of previous regularization frameworks including group, view-specific and principal component regularization and experimentally compare them using data from 1143 Saccharomyces cerevisiae strains. We observe a divergence between methods in terms of regression accuracy and integration effectiveness based on the type of regularization employed. In multiomic regression tasks, when learning from experimental and model-generated omic data, our results demonstrate the competitiveness and ease of interpretation of multimodal regularized linear models compared to data-hungry methods based on neural networks. Availability and implementation: All data, models and code produced in this work are available on GitHub at https://github.com/Angione-Lab/HybridGroupIPFLasso_pc2Lasso. Supplementary information: Supplementary data are available at Bioinformatics online
- …