7 research outputs found
Exploring non-linear distance metrics in the structureâactivity space: QSAR models for human estrogen receptor
Abstract Background Quantitative structure-activity relationship (QSAR) models are important tools used in discovering new drug candidates and identifying potentially harmful environmental chemicals. These models often face two fundamental challenges: limited amount of available biological activity data and noise or uncertainty in the activity data themselves. To address these challenges, we introduce and explore a QSAR model based on custom distance metrics in the structure-activity space. Methods The model is built on top of the k-nearest neighbor model, incorporating non-linearity not only in the chemical structure space, but also in the biological activity space. The model is tuned and evaluated using activity data for human estrogen receptor from the US EPA ToxCast and Tox21 databases. Results The model closely trails the CERAPP consensus model (built on top of 48 individual human estrogen receptor activity models) in agonist activity predictions and consistently outperforms the CERAPP consensus model in antagonist activity predictions. Discussion We suggest that incorporating non-linear distance metrics may significantly improve QSAR model performance when the available biological activity data are limited
Steering Electrons on Moving Pathways
Electron transfer (ET) reactions provide a nexus among chemistry, biochemistry, and physics. These reactions underpin the âpower plantsâ and âpower gridsâ of bioenergetics, and they challenge us to understand how evolution manipulates structure to control ET kinetics. Ball-and-stick models for the machinery of electron transfer, however, fail to capture the rich electronic and nuclear dynamics of ET molecules: these static representations disguise, for example, the range of thermally accessible molecular conformations. The influence of structural fluctuations on electron-transfer kinetics is amplified by the exponential decay of electron tunneling probabilities with distance, as well as the delicate interference among coupling pathways. Fluctuations in the surrounding medium can also switch transport between coherent and incoherent ET mechanisms â and may gate ET so that its kinetics is limited by conformational interconversion times, rather than by the intrinsic ET time scale. Moreover, preparation of a charge-polarized donor state or of a donor state with linear or angular momentum can have profound dynamical and kinetic consequences. In this Account, we establish a vocabulary to describe how the conformational ensemble and the prepared donor state influence ET kinetics in macromolecules. This framework is helping to unravel the richness of functional biological ET pathways, which have evolved within fluctuating macromolecular structures.
The conceptual framework for describing nonadiabatic ET seems disarmingly simple: compute the ensemble-averaged (mean-squared) donorâacceptor (DA) tunneling interaction, âšHDA2â©, and the FranckâCondon weighted density of states, ÏFC, to describe the rate, (2Ï/â)âšHDA2â©ÏFC. Modern descriptions of the thermally averaged electronic coupling and of the FranckâCondon factor establish a useful predictive framework in biology, chemistry, and nanoscience. Describing the influence of geometric and energetic fluctuations on ET allows us to address a rich array of mechanistic and kinetic puzzles. How strongly is a proteinâs fold imprinted on the ET kinetics, and might thermal fluctuations âwash outâ signatures of structure? What is the influence of thermal fluctuations on ET kinetics beyond averaging of the tunneling barrier structure? Do electronic coupling mechanisms change as donor and acceptor reposition in a protein, and what are the consequences for the ET kinetics? Do fluctuations access minority species that dominate tunneling? Can energy exchanges between the electron and bridge vibrations generate vibronic signatures that label some of the D-to-A pathways traversed by the electron, thus eliminating unmarked pathways that would otherwise contribute to the DA coupling (as in other âwhich wayâ or double-slit experiments)? Might medium fluctuations drive tunnelingâhopping mechanistic transitions? How does the donor-state preparation, in particular, its polarization toward the acceptor and its momentum characteristics (which may introduce complex rather than pure real relationships among donor orbital amplitudes), influence the electronic dynamics?
In this Account, we describe our recent studies that address puzzling questions of how conformational distributions, excited-state polarization, and electronic and nuclear dynamical effects influence ET in macromolecules. Indeed, conformational and dynamical effects arise in all transport regimes, including the tunneling, resonant transport, and hopping regimes. Importantly, these effects can induce switching among ET mechanisms
CERAPP : Collaborative Estrogen Receptor Activity Prediction Project
BACKGROUND: Humans are exposed to thousands of man-made chemicals in the environment. Some chemicals mimic natural endocrine hormones and, thus, have the potential to be endocrine disruptors. Most of these chemicals have never been tested for their ability to interact with the estrogen receptor (ER). Risk assessors need tools to prioritize chemicals for evaluation in costly in vivo tests, for instance, within the U.S. EPA Endocrine Disruptor Screening Program. OBJECTIVES: We describe a large-scale modeling project called CERAPP (Collaborative Estrogen Receptor Activity Prediction Project) and demonstrate the efficacy of using predictive computational models trained on high-throughput screening data to evaluate thousands of chemicals for ER-related activity and prioritize them for further testing. METHODS: CERAPP combined multiple models developed in collaboration with 17 groups in the United States and Europe to predict ER activity of a common set of 32,464 chemical structures. Quantitative structure-activity relationship models and docking approaches were employed, mostly using a common training set of 1,677 chemical structures provided by the U.S. EPA, to build a total of 40 categorical and 8 continuous models for binding, agonist, and antagonist ER activity. All predictions were evaluated on a set of 7,522 chemicals curated from the literature. To overcome the limitations of single models, a consensus was built by weighting models on scores based on their evaluated accuracies. RESULTS: Individual model scores ranged from 0.69 to 0.85, showing high prediction reliabilities. Out of the 32,464 chemicals, the consensus model predicted 4,001 chemicals (12.3%) as high priority actives and 6,742 potential actives (20.8%) to be considered for further testing.CONCLUSION: This project demonstrated the possibility to screen large libraries of chemicals using a consensus of different in silico approaches. This concept will be applied in future projects related to other end points