29,972 research outputs found
Method for Computing Protein Binding Affinity
A Monte Carlo method is given to compute the binding affinity of a ligand to
a protein. The method involves extending configuration space by a discrete
variable indicating whether the ligand is bound to the protein and a special
Monte Carlo move which allows transitions between the unbound and bound states.
Provided that an accurate protein structure is given, that the protein-ligand
binding site is known, and that an accurate chemical force field together with
a continuum solvation model is used, this method provides a quantitative
estimate of the free energy of binding.Comment: RevTex, 10 pages with 5 figures. Explanatory figure adde
Analyzing Machupo virus-receptor binding by molecular dynamics simulations
In many biological applications, we would like to be able to computationally
predict mutational effects on affinity in protein-protein interactions.
However, many commonly used methods to predict these effects perform poorly in
important test cases. In particular, the effects of multiple mutations,
non-alanine substitutions, and flexible loops are difficult to predict with
available tools and protocols. We present here an existing method applied in a
novel way to a new test case; we interrogate affinity differences resulting
from mutations in a host-virus protein-protein interface. We use steered
molecular dynamics (SMD) to computationally pull the machupo virus (MACV) spike
glycoprotein (GP1) away from the human transferrin receptor (hTfR1). We then
approximate affinity using the maximum applied force of separation and the area
under the force-versus-distance curve. We find, even without the rigor and
planning required for free energy calculations, that these quantities can
provide novel biophysical insight into the GP1/hTfR1 interaction. First, with
no prior knowledge of the system we can differentiate among wild type and
mutant complexes. Moreover, we show that this simple SMD scheme correlates well
with relative free energy differences computed via free energy perturbation.
Second, although the static co-crystal structure shows two large
hydrogen-bonding networks in the GP1/hTfR1 interface, our simulations indicate
that one of them may not be important for tight binding. Third, one viral site
known to be critical for infection may mark an important evolutionary
suppressor site for infection-resistant hTfR1 mutants. Finally, our approach
provides a framework to compare the effects of multiple mutations, individually
and jointly, on protein-protein interactions.Comment: 33 pages, 8 figures, 5 table
TopologyNet: Topology based deep convolutional neural networks for biomolecular property predictions
Although deep learning approaches have had tremendous success in image, video
and audio processing, computer vision, and speech recognition, their
applications to three-dimensional (3D) biomolecular structural data sets have
been hindered by the entangled geometric complexity and biological complexity.
We introduce topology, i.e., element specific persistent homology (ESPH), to
untangle geometric complexity and biological complexity. ESPH represents 3D
complex geometry by one-dimensional (1D) topological invariants and retains
crucial biological information via a multichannel image representation. It is
able to reveal hidden structure-function relationships in biomolecules. We
further integrate ESPH and convolutional neural networks to construct a
multichannel topological neural network (TopologyNet) for the predictions of
protein-ligand binding affinities and protein stability changes upon mutation.
To overcome the limitations to deep learning arising from small and noisy
training sets, we present a multitask topological convolutional neural network
(MT-TCNN). We demonstrate that the present TopologyNet architectures outperform
other state-of-the-art methods in the predictions of protein-ligand binding
affinities, globular protein mutation impacts, and membrane protein mutation
impacts.Comment: 20 pages, 8 figures, 5 table
Computational structure‐based drug design: Predicting target flexibility
The role of molecular modeling in drug design has experienced a significant revamp in the last decade. The increase in computational resources and molecular models, along with software developments, is finally introducing a competitive advantage in early phases of drug discovery. Medium and small companies with strong focus on computational chemistry are being created, some of them having introduced important leads in drug design pipelines. An important source for this success is the extraordinary development of faster and more efficient techniques for describing flexibility in three‐dimensional structural molecular modeling. At different levels, from docking techniques to atomistic molecular dynamics, conformational sampling between receptor and drug results in improved predictions, such as screening enrichment, discovery of transient cavities, etc. In this review article we perform an extensive analysis of these modeling techniques, dividing them into high and low throughput, and emphasizing in their application to drug design studies. We finalize the review with a section describing our Monte Carlo method, PELE, recently highlighted as an outstanding advance in an international blind competition and industrial benchmarks.We acknowledge the BSC-CRG-IRB Joint Research Program in Computational Biology. This work was supported by a grant
from the Spanish Government CTQ2016-79138-R.J.I. acknowledges support from SVP-2014-068797, awarded by the Spanish Government.Peer ReviewedPostprint (author's final draft
Representability of algebraic topology for biomolecules in machine learning based scoring and virtual screening
This work introduces a number of algebraic topology approaches, such as
multicomponent persistent homology, multi-level persistent homology and
electrostatic persistence for the representation, characterization, and
description of small molecules and biomolecular complexes. Multicomponent
persistent homology retains critical chemical and biological information during
the topological simplification of biomolecular geometric complexity.
Multi-level persistent homology enables a tailored topological description of
inter- and/or intra-molecular interactions of interest. Electrostatic
persistence incorporates partial charge information into topological
invariants. These topological methods are paired with Wasserstein distance to
characterize similarities between molecules and are further integrated with a
variety of machine learning algorithms, including k-nearest neighbors, ensemble
of trees, and deep convolutional neural networks, to manifest their descriptive
and predictive powers for chemical and biological problems. Extensive numerical
experiments involving more than 4,000 protein-ligand complexes from the PDBBind
database and near 100,000 ligands and decoys in the DUD database are performed
to test respectively the scoring power and the virtual screening power of the
proposed topological approaches. It is demonstrated that the present approaches
outperform the modern machine learning based methods in protein-ligand binding
affinity predictions and ligand-decoy discrimination
High-throughput Binding Affinity Calculations at Extreme Scales
Resistance to chemotherapy and molecularly targeted therapies is a major
factor in limiting the effectiveness of cancer treatment. In many cases,
resistance can be linked to genetic changes in target proteins, either
pre-existing or evolutionarily selected during treatment. Key to overcoming
this challenge is an understanding of the molecular determinants of drug
binding. Using multi-stage pipelines of molecular simulations we can gain
insights into the binding free energy and the residence time of a ligand, which
can inform both stratified and personal treatment regimes and drug development.
To support the scalable, adaptive and automated calculation of the binding free
energy on high-performance computing resources, we introduce the High-
throughput Binding Affinity Calculator (HTBAC). HTBAC uses a building block
approach in order to attain both workflow flexibility and performance. We
demonstrate close to perfect weak scaling to hundreds of concurrent multi-stage
binding affinity calculation pipelines. This permits a rapid time-to-solution
that is essentially invariant of the calculation protocol, size of candidate
ligands and number of ensemble simulations. As such, HTBAC advances the state
of the art of binding affinity calculations and protocols
Absolute FKBP binding affinities obtained via non-equilibrium unbinding simulations
We compute absolute binding affinities for two ligands bound to the FKBP
protein using non-equilibrium unbinding simulations. The methodology is
straight-forward, requiring little or no modification to many modern molecular
simulation packages. The approach makes use of a physical pathway, eliminating
the need for complicated alchemical decoupling schemes. Results of this study
are promising. For the ligands studied here the binding affinities are
typically estimated within less than 4.0 kJ/mol of the target values; and the
target values are within less than 1.0 kJ/mol of experiment. These results
suggest that non-equilibrium simulation could provide a simple and robust means
to estimate protein-ligand binding affinities.Comment: 9 pages, 3 figures (no necessary color). Changes made to methodology
and results between revision
- …