17 research outputs found
LGEM: a first-order logic framework for automated improvement of metabolic network models through abduction
Scientific discovery in biology is difficult due to the complexity of the
systems involved and the expense of obtaining high quality experimental data.
Automated techniques are a promising way to make scientific discoveries at the
scale and pace required to model large biological systems. A key problem for
21st century biology is to build a computational model of the eukaryotic cell.
The yeast Saccharomyces cerevisiae is the best understood eukaryote, and
genome-scale metabolic models (GEMs) are rich sources of background knowledge
that we can use as a basis for automated inference and investigation.
We present LGEM+, a system for automated abductive improvement of GEMs
consisting of: a compartmentalised first-order logic framework for describing
biochemical pathways (using curated GEMs as the expert knowledge source); and a
two-stage hypothesis abduction procedure.
We demonstrate that deductive inference on logical theories created using
LGEM+, using the automated theorem prover iProver, can predict growth/no-growth
of S. cerevisiae strains in minimal media. LGEM+ proposed 2094 unique candidate
hypotheses for model improvement. We assess the value of the generated
hypotheses using two criteria: (a) genome-wide single-gene essentiality
prediction, and (b) constraint of flux-balance analysis (FBA) simulations. For
(b) we developed an algorithm to integrate FBA with the logic model. We rank
and filter the hypotheses using these assessments. We intend to test these
hypotheses using the robot scientist Genesis, which is based around chemostat
cultivation and high-throughput metabolomics.Comment: 15 pages, one figure, two tables, two algorithm
Identification and characterisation of two high-affinity glucose transporters from the spoilage yeast Brettanomyces bruxellensis
The yeast Brettanomyces bruxellensis (syn. Dekkera bruxellensis) is an emerging and undesirable contaminant in industrial low-sugar ethanol fermentations that employ the yeast Saccharomyces cerevisiae. High-affinity glucose import in B. bruxellensis has been proposed to be the mechanism by which this yeast can outcompete S. cerevisiae. The present study describes the characterization of two B. bruxellensis genes (BHT1 and BHT3) believed to encode putative high-affinity glucose transporters. In vitro-generated transcripts of both genes as well as the S. cerevisiae HXT7 high-affinity glucose transporter were injected into Xenopus laevis oocytes and subsequent glucose uptake rates were assayed using 14C-labelled glucose. At 0.1 mM glucose, Bht1p was shown to transport glucose five times faster than Hxt7p. pH affected the rate of glucose transport by Bht1p and Bht3p, indicating an active glucose transport mechanism that involves proton symport. These results suggest a possible role for BHT1 and BHT3 in the competitive ability of B. bruxellensis
Recommended from our members
Interpreting protein abundance in Saccharomyces cerevisiae through relational learning.
Funder: Knut and Alice Wallenberg Foundation; DOI: https://doi.org/10.13039/501100004063MOTIVATION: Proteomic profiles reflect the functional readout of the physiological state of an organism. An increased understanding of what controls and defines protein abundances is of high scientific interest. Saccharomyces cerevisiae is a well-studied model organism, and there is a large amount of structured knowledge on yeast systems biology in databases such as the Saccharomyces Genome Database, and highly curated genome-scale metabolic models like Yeast8. These datasets, the result of decades of experiments, are abundant in information, and adhere to semantically meaningful ontologies. RESULTS: By representing this knowledge in an expressive Datalog database we generated data descriptors using relational learning that, when combined with supervised machine learning, enables us to predict protein abundances in an explainable manner. We learnt predictive relationships between protein abundances, function and phenotype; such as α-amino acid accumulations and deviations in chronological lifespan. We further demonstrate the power of this methodology on the proteins His4 and Ilv2, connecting qualitative biological concepts to quantified abundances. AVAILABILITY AND IMPLEMENTATION: All data and processing scripts are available at the following Github repository: https://github.com/DanielBrunnsaker/ProtPredict
LGEM+: A First-Order Logic Framework for Automated Improvement of Metabolic Network Models Through Abduction
Scientific discovery in biology is difficult due to the complexity of the systems involved and the expense of obtaining high quality experimental data. Automated techniques are a promising way to make scientific discoveries at the scale and pace required to model large biological systems. A key problem for 21st century biology is to build a computational model of the eukaryotic cell. The yeast Saccharomyces cerevisiae is the best understood eukaryote, and genome-scale metabolic models (GEMs) are rich sources of background knowledge that we can use as a basis for automated inference and investigation.We present LGEM+, a system for automated abductive improvement of GEMs consisting of: a compartmentalised first-order logic framework for describing biochemical pathways (using curated GEMs as the expert knowledge source); and a two-stage hypothesis abduction procedure.We demonstrate that deductive inference on logical theories created using LGEM+, using the automated theorem prover iProver, can predict growth/no-growth of S. cerevisiae strains in minimal media. LGEM+ proposed 2094 unique candidate hypotheses for model improvement. We assess the value of the generated hypotheses using two criteria: (a) genome-wide single-gene essentiality prediction, and (b) constraint of flux-balance analysis (FBA) simulations. For (b) we developed an algorithm to integrate FBA with the logic model. We rank and filter the hypotheses using these assessments. We intend to test these hypotheses using the robot scientist Genesis, which is based around chemostat cultivation and high-throughput metabolomics
Genome-scale model of Rhodotorula toruloides metabolism.
The basidiomycete red yeast Rhodotorula toruloides is a promising platform organism for production of biooils. We present rhto-GEM, the first genome-scale model (GEM) of R. toruloides metabolism, that was largely reconstructed using RAVEN toolbox. The model includes 852 genes, 2,731 reactions, and 2,277 metabolites, while lipid metabolism is described using the SLIMEr formalism allowing direct integration of lipid class and acyl chain experimental distribution data. The simulation results confirmed that the R. toruloides model provides valid growth predictions on glucose, xylose, and glycerol, while prediction of genetic engineering targets to increase production of linolenic acid, triacylglycerols, and carotenoids identified genes-some of which have previously been engineered to successfully increase production. This renders rtho-GEM valuable for future studies to improve the production of other oleochemicals of industrial relevance including value-added fatty acids and carotenoids, in addition to facilitate system-wide omics-data analysis in R. toruloides. Expanding the portfolio of GEMs for lipid-accumulating fungi contributes to both understanding of metabolic mechanisms of the oleaginous phenotype but also uncover particularities of the lipid production machinery in R. toruloides
Recommended from our members
High-throughput metabolomics for the design and validation of a diauxic shift model.
Funder: The Wallenberg AI, Autonomous Systems and Software ProgramSaccharomyces cerevisiae is a very well studied organism, yet ∼20% of its proteins remain poorly characterized. Moreover, recent studies seem to indicate that the pace of functional discovery is slow. Previous work has implied that the most probable path forward is via not only automation but fully autonomous systems in which active learning is applied to guide high-throughput experimentation. Development of tools and methods for these types of systems is of paramount importance. In this study we use constrained dynamical flux balance analysis (dFBA) to select ten regulatory deletant strains that are likely to have previously unexplored connections to the diauxic shift. We then analyzed these deletant strains using untargeted metabolomics, generating profiles which were then subsequently investigated to better understand the consequences of the gene deletions in the metabolic reconfiguration of the diauxic shift. We show that metabolic profiles can be utilised to not only gaining insight into cellular transformations such as the diauxic shift, but also on regulatory roles and biological consequences of regulatory gene deletion. We also conclude that untargeted metabolomics is a useful tool for guidance in high-throughput model improvement, and is a fast, sensitive and informative approach appropriate for future large-scale functional analyses of genes. Moreover, it is well-suited for automated approaches due to relative simplicity of processing and the potential to make massively high-throughput
Recommended from our members
High-throughput metabolomics for the design and validation of a diauxic shift model.
Acknowledgements: This work was partially supported by the Wallenberg AI, Autonomous Systems and Software Program (WASP) funded by the Knut and Alice Wallenberg Foundation, and the Swedish Research Council Formas (2020-01690).Funder: The Wallenberg AI, Autonomous Systems and Software Program (WASP)Saccharomyces cerevisiae is a very well studied organism, yet ∼20% of its proteins remain poorly characterized. Moreover, recent studies seem to indicate that the pace of functional discovery is slow. Previous work has implied that the most probable path forward is via not only automation but fully autonomous systems in which active learning is applied to guide high-throughput experimentation. Development of tools and methods for these types of systems is of paramount importance. In this study we use constrained dynamical flux balance analysis (dFBA) to select ten regulatory deletant strains that are likely to have previously unexplored connections to the diauxic shift. We then analyzed these deletant strains using untargeted metabolomics, generating profiles which were then subsequently investigated to better understand the consequences of the gene deletions in the metabolic reconfiguration of the diauxic shift. We show that metabolic profiles can be utilised to not only gaining insight into cellular transformations such as the diauxic shift, but also on regulatory roles and biological consequences of regulatory gene deletion. We also conclude that untargeted metabolomics is a useful tool for guidance in high-throughput model improvement, and is a fast, sensitive and informative approach appropriate for future large-scale functional analyses of genes. Moreover, it is well-suited for automated approaches due to relative simplicity of processing and the potential to make massively high-throughput
High-throughput metabolomics for the design and validation of a diauxic shift model
Abstract Saccharomyces cerevisiae is a very well studied organism, yet ∼20% of its proteins remain poorly characterized. Moreover, recent studies seem to indicate that the pace of functional discovery is slow. Previous work has implied that the most probable path forward is via not only automation but fully autonomous systems in which active learning is applied to guide high-throughput experimentation. Development of tools and methods for these types of systems is of paramount importance. In this study we use constrained dynamical flux balance analysis (dFBA) to select ten regulatory deletant strains that are likely to have previously unexplored connections to the diauxic shift. We then analyzed these deletant strains using untargeted metabolomics, generating profiles which were then subsequently investigated to better understand the consequences of the gene deletions in the metabolic reconfiguration of the diauxic shift. We show that metabolic profiles can be utilised to not only gaining insight into cellular transformations such as the diauxic shift, but also on regulatory roles and biological consequences of regulatory gene deletion. We also conclude that untargeted metabolomics is a useful tool for guidance in high-throughput model improvement, and is a fast, sensitive and informative approach appropriate for future large-scale functional analyses of genes. Moreover, it is well-suited for automated approaches due to relative simplicity of processing and the potential to make massively high-throughput
Chromosomal genome assembly of the ethanol production strain CBS 11270 indicates a highly dynamic genome structure in the yeast species Brettanomyces bruxellensis
Here, we present the genome of the industrial ethanol production strain Brettanomyces bruxellensis CBS 11270. The nuclear genome was found to be diploid, containing four chromosomes with sizes of ranging from 2.2 to 4.0 Mbp. A 75 Kbp mitochondrial genome was also identified. Comparing the homologous chromosomes, we detected that 0.32% of nucleotides were polymorphic, i.e. formed single nucleotide polymorphisms (SNPs), 40.6% of them were found in coding regions (i.e. 0.13% of all nucleotides formed SNPs and were in coding regions). In addition, 8,538 indels were found. The total number of protein coding genes was 4897, of them, 4,284 were annotated on chromosomes; and the mitochondrial genome contained 18 protein coding genes. Additionally, 595 genes, which were annotated, were on contigs not associated with chromosomes. A number of genes was duplicated, most of them as tandem repeats, including a six-gene cluster located on chromosome 3. There were also examples of interchromosomal gene duplications, including a duplication of a six-gene cluster, which was found on both chromosomes 1 and 4. Gene copy number analysis suggested loss of heterozygosity for 372 genes. This may reflect adaptation to relatively harsh but constant conditions of continuous fermentation. Analysis of gene topology showed that most of these losses occurred in clusters of more than one gene, the largest cluster comprising 33 genes. Comparative analysis against the wine isolate CBS 2499 revealed 88,534 SNPs and 8,133 indels. Moreover, when the scaffolds of the CBS 2499 genome assembly were aligned against the chromosomes of CBS 11270, many of them aligned completely, some have chunks aligned to different chromosomes, and some were in fact rearranged. Our findings indicate a highly dynamic genome within the species B. bruxellensis and a tendency towards reduction of gene number in long-term continuous cultivation
Expression of <i>D. bruxellensis</i> genes that do not have homologs in <i>S. cerevisiae</i>.
a<p>RPKM is a measure of gene expression level expressed as number of reads per 1-K base pairs and million mapped reads.</p