17 research outputs found

    GalaxyDock: Protein–Ligand Docking with Flexible Protein Side-chains

    No full text
    An important issue in developing protein–ligand docking methods is how to incorporate receptor flexibility. Consideration of receptor flexibility using an ensemble of precompiled receptor conformations or by employing an effectively enlarged binding pocket has been reported to be useful. However, direct consideration of receptor flexibility during energy optimization of the docked conformation has been less popular because of the large increase in computational complexity. In this paper, we present a new docking program called GalaxyDock that accounts for the flexibility of preselected receptor side-chains by global optimization of an AutoDock-based energy function trained for flexible side-chain docking. This method was tested on 3 sets of protein–ligand complexes (HIV-PR, LXRβ, cAPK) and a diverse set of 16 proteins that involve side-chain conformational changes upon ligand binding. The cross-docking tests show that the performance of GalaxyDock is higher or comparable to previous flexible docking methods tested on the same sets, increasing the binding conformation prediction accuracy by 10%–60% compared to rigid-receptor docking. This encouraging result suggests that this powerful global energy optimization method may be further extended to incorporate larger magnitudes of receptor flexibility in the future. The program is available at http://galaxy.seoklab.org/softwares/galaxydock.html

    PL-PatchSurfer2: Improved Local Surface Matching-Based Virtual Screening Method That Is Tolerant to Target and Ligand Structure Variation

    No full text
    Virtual screening has become an indispensable procedure in drug discovery. Virtual screening methods can be classified into two categories: ligand-based and structure-based. While the former have advantages, including being quick to compute, in general they are relatively weak at discovering novel active compounds because they use known actives as references. On the other hand, structure-based methods have higher potential to find novel compounds because they directly predict the binding affinity of a ligand in a target binding pocket, albeit with substantially lower speed than ligand-based methods. Here we report a novel structure-based virtual screening method, PL-PatchSurfer2. In PL-PatchSurfer2, protein and ligand surfaces are represented by a set of overlapping local patches, each of which is represented by three-dimensional Zernike descriptors (3DZDs). By means of 3DZDs, the shapes and physicochemical complementarities of local surface regions of a pocket surface and a ligand molecule can be concisely and effectively computed. Compared with the previous version of the program, the performance of PL-PatchSurfer2 is substantially improved by the addition of two more features, atom-based hydrophobicity and hydrogen-bond acceptors and donors. Benchmark studies showed that PL-PatchSurfer2 performed better than or comparable to popular existing methods. Particularly, PL-PatchSurfer2 significantly outperformed existing methods when apo-form or template-based protein models were used for queries. The computational time of PL-PatchSurfer2 is about 20 times shorter than those of conventional structure-based methods. The PL-PatchSurfer2 program is available at http://www.kiharalab.org/plps2/

    Discovery of Nicotinamide Adenine Dinucleotide Binding Proteins in the Escherichia coli Proteome Using a Combined Energetic- and Structural-Bioinformatics-Based Approach

    No full text
    Protein–ligand interaction plays a critical role in regulating the biochemical functions of proteins. Discovering protein targets for ligands is vital to new drug development. Here, we present a strategy that combines experimental and computational approaches to identify ligand-binding proteins in a proteomic scale. For the experimental part, we coupled pulse proteolysis with filter-assisted sample preparation (FASP) and quantitative mass spectrometry. Under denaturing conditions, ligand binding affected protein stability, which resulted in altered protein abundance after pulse proteolysis. For the computational part, we used the software Patch-Surfer2.0. We applied the integrated approach to identify nicotinamide adenine dinucleotide (NAD)-binding proteins in the Escherichia coli proteome, which has over 4200 proteins. Pulse proteolysis and Patch-Surfer2.0 identified 78 and 36 potential NAD-binding proteins, respectively, including 12 proteins that were consistently detected by the two approaches. Interestingly, the 12 proteins included 8 that are not previously known as NAD binders. Further validation of these eight proteins showed that their binding affinities to NAD computed by AutoDock Vina are higher than their cognate ligands and also that their protein ratios in the pulse proteolysis are consistent with known NAD-binding proteins. These results strongly suggest that these eight proteins are indeed newly identified NAD binders

    Discovery of Nicotinamide Adenine Dinucleotide Binding Proteins in the Escherichia coli Proteome Using a Combined Energetic- and Structural-Bioinformatics-Based Approach

    No full text
    Protein–ligand interaction plays a critical role in regulating the biochemical functions of proteins. Discovering protein targets for ligands is vital to new drug development. Here, we present a strategy that combines experimental and computational approaches to identify ligand-binding proteins in a proteomic scale. For the experimental part, we coupled pulse proteolysis with filter-assisted sample preparation (FASP) and quantitative mass spectrometry. Under denaturing conditions, ligand binding affected protein stability, which resulted in altered protein abundance after pulse proteolysis. For the computational part, we used the software Patch-Surfer2.0. We applied the integrated approach to identify nicotinamide adenine dinucleotide (NAD)-binding proteins in the Escherichia coli proteome, which has over 4200 proteins. Pulse proteolysis and Patch-Surfer2.0 identified 78 and 36 potential NAD-binding proteins, respectively, including 12 proteins that were consistently detected by the two approaches. Interestingly, the 12 proteins included 8 that are not previously known as NAD binders. Further validation of these eight proteins showed that their binding affinities to NAD computed by AutoDock Vina are higher than their cognate ligands and also that their protein ratios in the pulse proteolysis are consistent with known NAD-binding proteins. These results strongly suggest that these eight proteins are indeed newly identified NAD binders

    Modeling the assembly order of multimeric heteroprotein complexes

    No full text
    <div><p>Protein-protein interactions are the cornerstone of numerous biological processes. Although an increasing number of protein complex structures have been determined using experimental methods, relatively fewer studies have been performed to determine the assembly order of complexes. In addition to the insights into the molecular mechanisms of biological function provided by the structure of a complex, knowing the assembly order is important for understanding the process of complex formation. Assembly order is also practically useful for constructing subcomplexes as a step toward solving the entire complex experimentally, designing artificial protein complexes, and developing drugs that interrupt a critical step in the complex assembly. There are several experimental methods for determining the assembly order of complexes; however, these techniques are resource-intensive. Here, we present a computational method that predicts the assembly order of protein complexes by building the complex structure. The method, named Path-LzerD, uses a multimeric protein docking algorithm that assembles a protein complex structure from individual subunit structures and predicts assembly order by observing the simulated assembly process of the complex. Benchmarked on a dataset of complexes with experimental evidence of assembly order, Path-LZerD was successful in predicting the assembly pathway for the majority of the cases. Moreover, when compared with a simple approach that infers the assembly path from the buried surface area of subunits in the native complex, Path-LZerD has the strong advantage that it can be used for cases where the complex structure is not known. The path prediction accuracy decreased when starting from unbound monomers, particularly for larger complexes of five or more subunits, for which only a part of the assembly path was correctly identified. As the first method of its kind, Path-LZerD opens a new area of computational protein structure modeling and will be an indispensable approach for studying protein complexes.</p></div

    Assembly pathway of 3vyt.

    No full text
    <p>Subunits marked C and C′ (green and yellow) are HypC, subunits marked D and D′ (cyan and salmon) are HypD, and subunits marked E and E′ (magenta and white) are HypE. The assembly pathway is CD+C′D′+EE′> CD+C′D′EE′> CC′DD′EE′.</p

    Examples of Multi-LZerD predictions with correct or almost correct topology.

    No full text
    <p>Dark colors: native structures. Light colors: lowest RMSD output of Multi-LZerD. Top: 1ikn, 14.51 Ă…. Bottom: 1hez, 11.73 Ă…. The diagram to the right of each complex represents the interactions between subunits. Nodes in the diagrams are colored in the same way as the complex structure models. Black lines, interactions in the native structure; gray, the complex model. A solid line indicates that there are more than 20 interacting residue pairs between the subunits and a dotted line is an interaction with fewer than 20 interacting residue pairs. A cutoff distance of 5.0 Ă… was used to define inter-residue contacts.</p

    An example of Multi-LZerD prediction that is partially correct.

    No full text
    <p>Dark colors: Native structure of 2e9x. Light colors: Multi-LZerD model with 9.5 Ă… RMSD. Chains A, B, and D (green, cyan, and yellow, respectively) have an RMSD of 1.6 Ă…. The majority of the RMSD error is due to the position of chain C (magenta).</p
    corecore