17 research outputs found
GalaxyDock: Protein–Ligand Docking with Flexible Protein Side-chains
An important issue in developing protein–ligand
docking methods is how to incorporate receptor flexibility. Consideration
of receptor flexibility using an ensemble of precompiled receptor
conformations or by employing an effectively enlarged binding pocket
has been reported to be useful. However, direct consideration of receptor
flexibility during energy optimization of the docked conformation
has been less popular because of the large increase in computational
complexity. In this paper, we present a new docking program called
GalaxyDock that accounts for the flexibility of preselected receptor
side-chains by global optimization of an AutoDock-based energy function
trained for flexible side-chain docking. This method was tested on
3 sets of protein–ligand complexes (HIV-PR, LXRβ, cAPK)
and a diverse set of 16 proteins that involve side-chain conformational
changes upon ligand binding. The cross-docking tests show that the
performance of GalaxyDock is higher or comparable to previous flexible
docking methods tested on the same sets, increasing the binding conformation
prediction accuracy by 10%–60% compared to rigid-receptor docking.
This encouraging result suggests that this powerful global energy
optimization method may be further extended to incorporate larger
magnitudes of receptor flexibility in the future. The program is available
at http://galaxy.seoklab.org/softwares/galaxydock.html
PL-PatchSurfer2: Improved Local Surface Matching-Based Virtual Screening Method That Is Tolerant to Target and Ligand Structure Variation
Virtual
screening has become an indispensable procedure in drug
discovery. Virtual screening methods can be classified into two categories:
ligand-based and structure-based. While the former have advantages,
including being quick to compute, in general they are relatively weak
at discovering novel active compounds because they use known actives
as references. On the other hand, structure-based methods have higher
potential to find novel compounds because they directly predict the
binding affinity of a ligand in a target binding pocket, albeit with
substantially lower speed than ligand-based methods. Here we report
a novel structure-based virtual screening method, PL-PatchSurfer2.
In PL-PatchSurfer2, protein and ligand surfaces are represented by
a set of overlapping local patches, each of which is represented by
three-dimensional Zernike descriptors (3DZDs). By means of 3DZDs,
the shapes and physicochemical complementarities of local surface
regions of a pocket surface and a ligand molecule can be concisely
and effectively computed. Compared with the previous version of the
program, the performance of PL-PatchSurfer2 is substantially improved
by the addition of two more features, atom-based hydrophobicity and
hydrogen-bond acceptors and donors. Benchmark studies showed that
PL-PatchSurfer2 performed better than or comparable to popular existing
methods. Particularly, PL-PatchSurfer2 significantly outperformed
existing methods when apo-form or template-based protein models were
used for queries. The computational time of PL-PatchSurfer2 is about
20 times shorter than those of conventional structure-based methods.
The PL-PatchSurfer2 program is available at http://www.kiharalab.org/plps2/
Discovery of Nicotinamide Adenine Dinucleotide Binding Proteins in the Escherichia coli Proteome Using a Combined Energetic- and Structural-Bioinformatics-Based Approach
Protein–ligand
interaction plays a critical role in regulating
the biochemical functions of proteins. Discovering protein targets
for ligands is vital to new drug development. Here, we present a strategy
that combines experimental and computational approaches to identify
ligand-binding proteins in a proteomic scale. For the experimental
part, we coupled pulse proteolysis with filter-assisted sample preparation
(FASP) and quantitative mass spectrometry. Under denaturing conditions,
ligand binding affected protein stability, which resulted in altered
protein abundance after pulse proteolysis. For the computational part,
we used the software Patch-Surfer2.0. We applied the integrated approach
to identify nicotinamide adenine dinucleotide (NAD)-binding proteins
in the Escherichia coli proteome, which
has over 4200 proteins. Pulse proteolysis and Patch-Surfer2.0 identified
78 and 36 potential NAD-binding proteins, respectively, including
12 proteins that were consistently detected by the two approaches.
Interestingly, the 12 proteins included 8 that are not previously
known as NAD binders. Further validation of these eight proteins showed
that their binding affinities to NAD computed by AutoDock Vina are
higher than their cognate ligands and also that their protein ratios
in the pulse proteolysis are consistent with known NAD-binding proteins.
These results strongly suggest that these eight proteins are indeed
newly identified NAD binders
Discovery of Nicotinamide Adenine Dinucleotide Binding Proteins in the Escherichia coli Proteome Using a Combined Energetic- and Structural-Bioinformatics-Based Approach
Protein–ligand
interaction plays a critical role in regulating
the biochemical functions of proteins. Discovering protein targets
for ligands is vital to new drug development. Here, we present a strategy
that combines experimental and computational approaches to identify
ligand-binding proteins in a proteomic scale. For the experimental
part, we coupled pulse proteolysis with filter-assisted sample preparation
(FASP) and quantitative mass spectrometry. Under denaturing conditions,
ligand binding affected protein stability, which resulted in altered
protein abundance after pulse proteolysis. For the computational part,
we used the software Patch-Surfer2.0. We applied the integrated approach
to identify nicotinamide adenine dinucleotide (NAD)-binding proteins
in the Escherichia coli proteome, which
has over 4200 proteins. Pulse proteolysis and Patch-Surfer2.0 identified
78 and 36 potential NAD-binding proteins, respectively, including
12 proteins that were consistently detected by the two approaches.
Interestingly, the 12 proteins included 8 that are not previously
known as NAD binders. Further validation of these eight proteins showed
that their binding affinities to NAD computed by AutoDock Vina are
higher than their cognate ligands and also that their protein ratios
in the pulse proteolysis are consistent with known NAD-binding proteins.
These results strongly suggest that these eight proteins are indeed
newly identified NAD binders
Modeling the assembly order of multimeric heteroprotein complexes
<div><p>Protein-protein interactions are the cornerstone of numerous biological processes. Although an increasing number of protein complex structures have been determined using experimental methods, relatively fewer studies have been performed to determine the assembly order of complexes. In addition to the insights into the molecular mechanisms of biological function provided by the structure of a complex, knowing the assembly order is important for understanding the process of complex formation. Assembly order is also practically useful for constructing subcomplexes as a step toward solving the entire complex experimentally, designing artificial protein complexes, and developing drugs that interrupt a critical step in the complex assembly. There are several experimental methods for determining the assembly order of complexes; however, these techniques are resource-intensive. Here, we present a computational method that predicts the assembly order of protein complexes by building the complex structure. The method, named Path-LzerD, uses a multimeric protein docking algorithm that assembles a protein complex structure from individual subunit structures and predicts assembly order by observing the simulated assembly process of the complex. Benchmarked on a dataset of complexes with experimental evidence of assembly order, Path-LZerD was successful in predicting the assembly pathway for the majority of the cases. Moreover, when compared with a simple approach that infers the assembly path from the buried surface area of subunits in the native complex, Path-LZerD has the strong advantage that it can be used for cases where the complex structure is not known. The path prediction accuracy decreased when starting from unbound monomers, particularly for larger complexes of five or more subunits, for which only a part of the assembly path was correctly identified. As the first method of its kind, Path-LZerD opens a new area of computational protein structure modeling and will be an indispensable approach for studying protein complexes.</p></div
Assembly pathway of 1ikn.
<p>Subunit A is p65(RelA), subunit C is p50, and subunit D is I-<i>Îş</i>-B. The assembly pathway is AC> ACD.</p
Assembly pathway of 3vyt.
<p>Subunits marked C and C′ (green and yellow) are HypC, subunits marked D and D′ (cyan and salmon) are HypD, and subunits marked E and E′ (magenta and white) are HypE. The assembly pathway is CD+C′D′+EE′> CD+C′D′EE′> CC′DD′EE′.</p
Examples of Multi-LZerD predictions with correct or almost correct topology.
<p>Dark colors: native structures. Light colors: lowest RMSD output of Multi-LZerD. Top: 1ikn, 14.51 Ă…. Bottom: 1hez, 11.73 Ă…. The diagram to the right of each complex represents the interactions between subunits. Nodes in the diagrams are colored in the same way as the complex structure models. Black lines, interactions in the native structure; gray, the complex model. A solid line indicates that there are more than 20 interacting residue pairs between the subunits and a dotted line is an interaction with fewer than 20 interacting residue pairs. A cutoff distance of 5.0 Ă… was used to define inter-residue contacts.</p
An example of Multi-LZerD prediction that is partially correct.
<p>Dark colors: Native structure of 2e9x. Light colors: Multi-LZerD model with 9.5 Ă… RMSD. Chains A, B, and D (green, cyan, and yellow, respectively) have an RMSD of 1.6 Ă…. The majority of the RMSD error is due to the position of chain C (magenta).</p
Assembly pathway of 1ikn.
<p>Subunit A is p65(RelA), subunit C is p50, and subunit D is I-<i>Îş</i>-B. The assembly pathway is AC> ACD.</p