Search CORE

28 research outputs found

The thermodynamic landscape of carbon redox biochemistry

Author: Aspuru-Guzik Alán
Goldford Joshua
Jinich Adrian
Noor Elad
Ren Haniu
Sanchez-Lengeling Benjamin
Sanders Jacob
Segre Daniel
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 10/01/2018
Field of study

Redox biochemistry plays a key role in the transduction of chemical energy in all living systems. Observed redox reactions in metabolic networks represent only a minuscule fraction of the space of all possible redox reactions. Here we ask what distinguishes observed, natural redox biochemistry from the space of all possible redox reactions between natural and non-natural compounds. We generate the set of all possible biochemical redox reactions involving linear chain molecules with a fixed numbers of carbon atoms. Using cheminformatics and quantum chemistry tools we analyze the physicochemical and thermodynamic properties of natural and non-natural compounds and reactions. We find that among all compounds, aldose sugars are the ones with the highest possible number of connections (reductions and oxidations) to other molecules. Natural metabolites are significantly enriched in carboxylic acid functional groups and depleted in carbonyls, and have significantly higher solubilities than non-natural compounds. Upon constructing a thermodynamic landscape for the full set of reactions as a function of pH and of steady-state redox cofactor potential, we find that, over this whole range of conditions, natural metabolites have significantly lower energies than the non-natural compounds. For the set of 4-carbon compounds, we generate a Pourbaix phase diagram to determine which metabolites are local energetic minima in the landscape as a function of pH and redox potential. Our results suggest that, across a set of conditions, succinate and butyrate are local minima and would thus tend to accumulate at equilibrium. Our work suggests that metabolic compounds could have been selected for thermodynamic stability, and yields insight into thermodynamic and design principles governing nature’s metabolic redox reactions.https://www.biorxiv.org/content/10.1101/245811v1Othe

Crossref

Boston University Institutional Repository (OpenBU)

Recommended from our members

Quantum Chemical Approach to Estimating the Thermodynamics of Metabolic Reactions

Author: Aspuru-Guzik Alan
Dunn Ian
Even Arren Bar
Jinich Adrian
Noor Elad
Olivares-Amaya Roberto
Rappoport Dmitrij
Sanchez-Lengeling Benjamin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Thermodynamics plays an increasingly important role in modeling and engineering metabolism. We present the first nonempirical computational method for estimating standard Gibbs reaction energies of metabolic reactions based on quantum chemistry, which can help fill in the gaps in the existing thermodynamic data. When applied to a test set of reactions from core metabolism, the quantum chemical approach is comparable in accuracy to group contribution methods for isomerization and group transfer reactions and for reactions not including multiply charged anions. The errors in standard Gibbs reaction energy estimates are correlated with the charges of the participating molecules. The quantum chemical approach is amenable to systematic improvements and holds potential for providing thermodynamic data for all of metabolism.Chemistry and Chemical Biolog

Harvard University - DASH

PubMed Central

MPG.PuRe

On scientific understanding with artificial intelligence

Author: Aldeghi Matteo
Aspuru-Guzik Alán
Cervera-Lierta Alba
Friederich Pascal
Gomes Gabriel dos Passos
Guo Si Yue
Häse Florian
Jinich Adrian
Krenn Mario
Nigam AkshatKumar
Pollice Robert
Yao Zhenpeng
Publication venue
Publication date: 04/04/2022
Field of study

Imagine an oracle that correctly predicts the outcome of every particle physics experiment, the products of every chemical reaction, or the function of every protein. Such an oracle would revolutionize science and technology as we know them. However, as scientists, we would not be satisfied with the oracle itself. We want more. We want to comprehend how the oracle conceived these predictions. This feat, denoted as scientific understanding, has frequently been recognized as the essential aim of science. Now, the ever-growing power of computers and artificial intelligence poses one ultimate question: How can advanced artificial systems contribute to scientific understanding or achieve it autonomously? We are convinced that this is not a mere technical question but lies at the core of science. Therefore, here we set out to answer where we are and where we can go from here. We first seek advice from the philosophy of science to understand scientific understanding. Then we review the current state of the art, both from literature and by collecting dozens of anecdotes from scientists about how they acquired new conceptual understanding with the help of computers. Those combined insights help us to define three dimensions of android-assisted scientific understanding: The android as a I) computational microscope, II) resource of inspiration and the ultimate, not yet existent III) agent of understanding. For each dimension, we explain new avenues to push beyond the status quo and unleash the full power of artificial intelligence's contribution to the central aim of science. We hope our perspective inspires and focuses research towards androids that get new scientific understanding and ultimately bring us closer to true artificial scientists.Comment: 13 pages, 3 figures, comments welcome

arXiv.org e-Print Archive

PubMed Central

MPG.PuRe

The Mycobacterium tuberculosis transposon sequencing database (MtbTnDB): a large-scale guide to genetic conditional essentiality [preprint]

Author: DeJesus Michael A.
Ehrt Sabine
Flores-Bautista Emanuel
Ioerger Thomas R.
Jinich Adrian
Rhee Kyu
Rock Jeremy M.
Sassetti Christopher M.
Schnappinger Dirk
Smith Clare M.
Zaveri Anisha
Publication venue: eScholarship@UMassChan
Publication date: 06/03/2021
Field of study

Characterization of gene essentiality across different conditions is a useful approach for predicting gene function. Transposon sequencing (TnSeq) is a powerful means of generating genome-wide profiles of essentiality and has been used extensively in Mycobacterium tuberculosis (Mtb) genetic research. Over the past two decades, dozens of TnSeq screens have been published, yielding valuable insights into the biology of Mtb in vitro, inside macrophages, and in model host organisms. However, these Mtb TnSeq profiles are distributed across dozens of research papers within supplementary materials, which makes querying them cumbersome and assembling a complete and consistent synthesis of existing data challenging. Here, we address this problem by building a central repository of publicly available TnSeq screens performed in M. tuberculosis, which we call the Mtb transposon sequencing database (MtbTnDB). The MtbTnDB encompasses 64 published and unpublished TnSeq screens, and is standardized, open-access, and allows users easy access to data, visualizations, and functional predictions through an interactive web-app (www.mtbtndb.app). We also present evidence that (i) genes in the same genomic neighborhood tend to have similar TnSeq profiles, and (ii) clusters of genes with similar TnSeq profiles tend to be enriched for genes belonging to the same functional categories. Finally, we test and evaluate machine learning models trained on TnSeq profiles to guide functional annotation of orphan genes in Mtb. In addition to facilitating the exploration of conditional genetic essentiality in this important human pathogen via a centralized TnSeq data repository, the MtbTnDB will enable hypothesis generation and the extraction of meaningful patterns by facilitating the comparison of datasets across conditions. This will provide a basis for insights into the functional organization of Mtb genes as well as gene function prediction

eScholarship@UMMS

Caltech Authors

Recommended from our members

Lead candidates for high-performance organic photovoltaics from high-throughput quantum chemistry – the Harvard Clean Energy Project

Author: Appleton Anthony L.
Aspuru-Guzik Alan
Atahan-Evrenk Sule
Bao Zhenan
Blood-Forsythe Martin
Er Suleyman
Hachmann Johannes
Jinich Adrian
Mondal Rajib
Olivares-Amaya Roberto
Román-Salgado Carolina
Seress Laszlo Ryan
Shrestha Supriya
Sokolov Anatoliy
Trepte Kai
Publication venue: 'Royal Society of Chemistry (RSC)'
Publication date: 08/12/2014
Field of study

The virtual high-throughput screening framework of the Harvard Clean Energy Project allows for the computational assessment of candidate structures for organic electronic materials – in particular photovoltaics – at an unprecedented scale. We report the most promising compounds that have emerged after studying 2.3 million molecular motifs by means of 150 million density functional theory calculations. Our top candidates are analyzed with respect to their structural makeup in order to identify important building blocks and extract design rules for efficient materials. An online database of the results is made available to the community.Engineering and Applied Science

Harvard University - DASH

Invariant Distribution of Promoter Activities in Escherichia coli

Author: A Zaslaver
A Zaslaver
Adrian Jinich
AL Barabasi
Alon Zaslaver
Anat Bren
Avi Mayo
BP Cormack
C Furusawa
Christopher Rao
Erez Dekel
FC Neidhardt
G Churchward
H Bremer
H Bremer
H Jeong
HR Ueda
IM Keseler
JM Carlson
JM Carlson
M Ehrenberg
M Nomura
M Schaechter
MA Kohanski
MA Kohanski
NJ Guido
NO Kjeldgaard
NO Kjeldgaard
O Maaloe
O Maaloe
PP Dennis
R Schleif
R Tanaka
R Tanaka
S Kaplan
S Klumpp
S Lin-Chao
Shai Kaplan
Shalev Itzkovitz
Uri Alon
W Donachie
Publication venue: International Society for Computational Biology
Publication date: 01/01/2009
Field of study

Cells need to allocate their limited resources to express a wide range of genes. To understand how Escherichia coli partitions its transcriptional resources between its different promoters, we employ a robotic assay using a comprehensive reporter strain library for E. coli to measure promoter activity on a genomic scale at high-temporal resolution and accuracy. This allows continuous tracking of promoter activity as cells change their growth rate from exponential to stationary phase in different media. We find a heavy-tailed distribution of promoter activities, with promoter activities spanning several orders of magnitude. While the shape of the distribution is almost completely independent of the growth conditions, the identity of the promoters expressed at different levels does depend on them. Translation machinery genes, however, keep the same relative expression levels in the distribution across conditions, and their fractional promoter activity tracks growth rate tightly. We present a simple optimization model for resource allocation which suggests that the observed invariant distributions might maximize growth rate. These invariant features of the distribution of promoter activities may suggest design constraints that shape the allocation of transcriptional resources

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Caltech Authors

Lead candidates for high-performance organic photovoltaics from high-throughput quantum chemistry – the Harvard Clean Energy Project

Author: Adrian Jinich
Alán Aspuru-Guzik
Amador-Bedolla
Ameri
Anatoliy Sokolov
Ando
Anthony L. Appleton
Arora
Blouin
Bérubé
Carolina Román-Salgado
Chen
Chen
Clery
Corey
Curtarolo
Curtarolo
Dresselhaus
Forrest
Giro
Hachmann
Hamel
Hart
Heeger
Inganäs
Jain
Johannes Hachmann
Kai Trepte
Kanal
Landis
Luo
László R. Seress
Maier
Martin A. Blood-Forsythe
Mierloo
Misra
Mühlbacher
Nørskov
O'Boyle
Olivares-Amaya
Paszkowicz
Rajib Mondal
Rice
Ro
Roberto Olivares-Amaya
Scharber
Shockley
Solomon
Sule Atahan-Evrenk
Sun
Supriya Shrestha
Svensson
Süleyman Er
Wang
Yong
Yu
Yuan
Zhang
Zhang
Zhenan Bao
Publication venue: 'Royal Society of Chemistry (RSC)'
Publication date: 01/01/2014
Field of study

Crossref

Cell Lineage Analysis of the Mammalian Female Germline

Fundamental aspects of embryonic and post-natal development, including maintenance of the mammalian female germline, are largely unknown. Here we employ a retrospective, phylogenetic-based method for reconstructing cell lineage trees utilizing somatic mutations accumulated in microsatellites, to study female germline dynamics in mice. Reconstructed cell lineage trees can be used to estimate lineage relationships between different cell types, as well as cell depth (number of cell divisions since the zygote). We show that, in the reconstructed mouse cell lineage trees, oocytes form clusters that are separate from hematopoietic and mesenchymal stem cells, both in young and old mice, indicating that these populations belong to distinct lineages. Furthermore, while cumulus cells sampled from different ovarian follicles are distinctly clustered on the reconstructed trees, oocytes from the left and right ovaries are not, suggesting a mixing of their progenitor pools. We also observed an increase in oocyte depth with mouse age, which can be explained either by depth-guided selection of oocytes for ovulation or by post-natal renewal. Overall, our study sheds light on substantial novel aspects of female germline preservation and development

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Recommended from our members

Enzyme Substrate Prediction from Three-Dimensional Feature Representations Using Space-Filling Curves

Author: Jinich Adrian
Rappoport Dmitrij
Publication venue: eScholarship, University of California
Publication date: 13/03/2023
Field of study

Compact and interpretable structural feature representations are required for accurately predicting properties and function of proteins. In this work, we construct and evaluate three-dimensional feature representations of protein structures based on space-filling curves (SFCs). We focus on the problem of enzyme substrate prediction, using two ubiquitous enzyme families as case studies: the short-chain dehydrogenase/reductases (SDRs) and the S-adenosylmethionine-dependent methyltransferases (SAM-MTases). Space-filling curves such as the Hilbert curve and the Morton curve generate a reversible mapping from discretized three-dimensional to one-dimensional representations and thus help to encode three-dimensional molecular structures in a system-independent way and with only a few adjustable parameters. Using three-dimensional structures of SDRs and SAM-MTases generated using AlphaFold2, we assess the performance of the SFC-based feature representations in predictions on a new benchmark database of enzyme classification tasks including their cofactor and substrate selectivity. Gradient-boosted tree classifiers yield binary prediction accuracy of 0.77-0.91 and area under curve (AUC) characteristics of 0.83-0.92 for the classification tasks. We investigate the effects of amino acid encoding, spatial orientation, and (the few) parameters of SFC-based encodings on the accuracy of the predictions. Our results suggest that geometry-based approaches such as SFCs are promising for generating protein structural representations and are complementary to the existing protein feature representations such as evolutionary scale modeling (ESM) sequence embeddings

eScholarship - University of California