323 research outputs found

    Optimization methods for side-chain positioning and macromolecular docking

    Full text link
    This dissertation proposes new optimization algorithms targeting protein-protein docking which is an important class of problems in computational structural biology. The ultimate goal of docking methods is to predict the 3-dimensional structure of a stable protein-protein complex. We study two specific problems encountered in predictive docking of proteins. The first problem is Side-Chain Positioning (SCP), a central component of homology modeling and computational protein docking methods. We formulate SCP as a Maximum Weighted Independent Set (MWIS) problem on an appropriately constructed graph. Our formulation also considers the significant special structure of proteins that SCP exhibits for docking. We develop an approximate algorithm that solves a relaxation of MWIS and employ randomized estimation heuristics to obtain high-quality feasible solutions to the problem. The algorithm is fully distributed and can be implemented on multi-processor architectures. Our computational results on a benchmark set of protein complexes show that the accuracy of our approximate MWIS-based algorithm predictions is comparable with the results achieved by a state-of-the-art method that finds an exact solution to SCP. The second problem we target in this work is protein docking refinement. We propose two different methods to solve the refinement problem. The first approach is based on a Monte Carlo-Minimization (MCM) search to optimize rigid-body and side-chain conformations for binding. In particular, we study the impact of optimally positioning the side-chains in the interface region between two proteins in the process of binding. We report computational results showing that incorporating side-chain flexibility in docking provides substantial improvement in the quality of docked predictions compared to the rigid-body approaches. Further, we demonstrate that the inclusion of unbound side-chain conformers in the side-chain search introduces significant improvement in the performance of the docking refinement protocols. In the second approach, we propose a novel stochastic optimization algorithm based on Subspace Semi-Definite programming-based Underestimation (SSDU), which aims to solve protein docking and protein structure prediction. SSDU is based on underestimating the binding energy function in a permissive subspace of the space of rigid-body motions. We apply Principal Component Analysis (PCA) to determine the permissive subspace and reduce the dimensionality of the conformational search space. We consider the general class of convex polynomial underestimators, and formulate the problem of finding such underestimators as a Semi-Definite Programming (SDP) problem. Using these underestimators, we perform a biased sampling in the vicinity of the conformational regions where the energy function is at its global minimum. Moreover, we develop an exploration procedure based on density-based clustering to detect the near-native regions even when there are many local minima residing far from each other. We also incorporate a Model Selection procedure into SSDU to pick a predictive conformation. Testing our algorithm over a benchmark of protein complexes indicates that SSDU substantially improves the quality of docking refinement compared with existing methods

    Advances in structure and small molecule docking predictions for crystallized G-Protein coupled receptors

    Get PDF
    This dissertation discusses two main aspects of protein-ligand interaction for G-Protein coupled receptors: structure predictions of the flexible loop domains and docking into these receptors. The prediction of loop structure has been long worked on in the context of native, globular proteins. In this work it is extended to transmembrane proteins, which requires an explicit integration of the lipid bilayer into the loop prediction calculation. In the initial work, this new approach to loop prediction yields highly accurate 3-dimensional structures of the intra and intercellular loops of four G-protein coupled receptors--the A2A adenosine, bovine rhodopsin, β1 and β2 adronergic receptors. For these cases, the loops were predicted in the context of a completely native crystal structure. In subsequent work the approach was extended to work on perturbed cases, where all loops and tails were removed, and side chains near the loop being predicted were in nonnative conformations. Lastly, a full homology model of the β2 adronergic receptor was successfully built from the β1 adronegric receptor as its template. Work on docking into these receptors focuses on the kappa opioid receptor. Known antagonist binders are discriminated from a set of decoy nonbinders via docking calculations. Two new terms were added to the scoring function, WScore to achieve this, based on a detailed molecular understanding of how the receptor works

    DEVELOPMENT AND APPLICATIONS OF THE HINT FORCEFIELD IN PREDICTION OF ANTIBIOTIC EFFLUX AND VIRTUAL SCREENING FOR ANTIVIRALS

    Get PDF
    This work was aimed at developing novel tools that utilize HINT, an empirical forcefield capable of quantitating both hydrophobic and hydrophilic (hydropathic) interactions, for implementation in theoretical biology and drug discovery/design. The role of hydrophobicity in determination of macromolecular structure and formation of complexes in biological molecules is undeniable and has been the subject of research across several decades. Hydrophobicity is introduced, with a review of its history and contemporary theories. This is followed by a description of various methods that quantify this all-pervading phenomenon and their use in protein folding and contemporary drug design projects – including a detailed overview of the HINT forcefield. The specific aim of this dissertation is to introduce our attempts at developing new methods for use in the study of antibacterial drug resistance and antiviral drug discovery. Multidrug efflux is commonly regarded as a fast growing problem in the field of medicine. Several species of microbes are known to have developed resistance against almost all classes of antibiotics by various modes-of-action, which include multidrug transporters (a.k.a. efflux pumps). These proteins are present in both gram-positive and gram-negative bacteria and extrude molecules of various classes. They protect the efflux pump-expressing bacterium from harmful effects of exogenous agents by simply evacuating the latter. Perhaps the best characterized mechanism amongst these is that of the AcrA-AcrB-TolC efflux pump. Data is available in literature and perhaps also in proprietary databases available with pharmaceutical companies, characterizing this pump in terms of the minimum inhibitory concentration ratios (MIC ratios) for various antibiotics. We procured a curated dataset of 32 β-lactam and 12 antibiotics of other classes from this literature. Initial attempts at studying the MIC ratios of β-lactam antibiotics as a function of their three dimensional topology via 3D-quantitative structure activity relationship (3D-QSAR) technology yielded seemingly good models. However, this methodology is essentially designed to address single receptor-ligand interactions. Molecules being transported by the efflux pump must undoubtedly be involved in multiple interactions with the same. Notably, such methods require a pharmacophoric overlap of ligands prior to the generation of models, thereby limiting their applicability to a set of structurally-related compounds. Thus, we designed a novel method that takes various interactions between antibiotic agents and the AcrA-AcrB-TolC pump into account in conjunction with certain properties of the drugs. This method yielded mathematical models that are capable of predicting high/low efflux with significant efficiency (\u3e93% correct). The development of this method, along with the results from its validation, is presented herein. A parallel aim being pursued by us is to discover inhibitors for hemagglutinin-neuraminidase (HN) of human parainfluenza virus type 3 (HPIV3) by in silico screening. The basis for targeting HN is explored, along with commentary on the methodology adopted during this effort. This project yielded a moderate success rate of 34%, perhaps due to problems in the computational methodology utilized. We highlight one particular problem – that of emulating target flexibility – and explore new avenues for overcoming this obstacle in the long run. As a starting point towards enhancing the tools available to us for virtual screening in general (and for discovering antiviral compounds in specific), we explored the compatibility between sidechain rotamer libraries and the HINT scoring function. A new algorithm was designed to optimize amino acid residue sidechains, if provided with the backbone coordinates, by generating sidechain positions using the Dunbrack and Cohen backbone-dependent rotamer library and scoring them with the HINT scoring function. This rotamer library was previously used by its developers previously to design a very successful sidechain optimization algorithm called SCWRL. Output structures from our algorithm were compared with those from SCWRL and showed extraordinary similarities as well as significant differences, which are discussed herein. This successful implementation of HINT in our sidechain optimization algorithm establishes the compatibility between this forcefield and sidechain rotamer libraries. Future aims in this project include enhancement of our current algorithm and the design of a new algorithm to explore partial induced-fit in targets aimed at improving current docking methodology. This work shows significant progress towards the implementation of our hydropathic force field in theoretical modeling of biological systems in order to enhance our ability to understand atomistic details of inter- and intramolecular interactions which must form the basis for a wide variety of biological phenomena. Such efforts are key to not only to understanding the said phenomena, but also towards a solid basis for efficient drug design in the future

    A Generic Program for Multistate Protein Design

    Get PDF
    Some protein design tasks cannot be modeled by the traditional single state design strategy of finding a sequence that is optimal for a single fixed backbone. Such cases require multistate design, where a single sequence is threaded onto multiple backbones (states) and evaluated for its strengths and weaknesses on each backbone. For example, to design a protein that can switch between two specific conformations, it is necessary to to find a sequence that is compatible with both backbone conformations. We present in this paper a generic implementation of multistate design that is suited for a wide range of protein design tasks and demonstrate in silico its capabilities at two design tasks: one of redesigning an obligate homodimer into an obligate heterodimer such that the new monomers would not homodimerize, and one of redesigning a promiscuous interface to bind to only a single partner and to no longer bind the rest of its partners. Both tasks contained negative design in that multistate design was asked to find sequences that would produce high energies for several of the states being modeled. Success at negative design was assessed by computationally redocking the undesired protein-pair interactions; we found that multistate design's accuracy improved as the diversity of conformations for the undesired protein-pair interactions increased. The paper concludes with a discussion of the pitfalls of negative design, which has proven considerably more challenging than positive design

    Step-by-step design of proteins for small molecule interaction: a review on recent milestones

    Get PDF
    Protein design is the field of synthetic biology that aims at developing de-novo custom made proteins and peptides for specific applications. Despite exploring an ambitious goal, recent computational advances in both hardware and software technologies have paved the way to high-throughput screening and detailed design of novel folds and improved functionalities. Modern advances in the field of protein design for small molecule targeting are described in this review, organized in a step-by-step fashion: from the conception of a new or upgraded active binding site, to scaffold design, sequence optimization and experimental expression of the custom protein. In each step, contemporary examples are described, and state-of-the art software is briefly explored.publishe
    • …
    corecore