3,510 research outputs found

    STRUCTURAL MODELING OF PROTEIN-PROTEIN INTERACTIONS USING MULTIPLE-CHAIN THREADING AND FRAGMENT ASSEMBLY

    Get PDF
    Since its birth, the study of protein structures has made progress with leaps and bounds. However, owing to the expenses and difficulties involved, the number of protein structures has not been able to catch up with the number of protein sequences and in fact has steadily lost ground. This necessitated the development of high-throughput but accurate computational algorithms capable of predicting the three dimensional structure of proteins from its amino acid sequence. While progress has been made in the realm of protein tertiary structure prediction, the advancement in protein quaternary structure prediction has been limited by the fact that the degree of freedom for protein complexes is even larger and even fewer number of protein complex structures are present in the PDB library. In fact, protein complex structure prediction till date has largely remained a docking problem where automated algorithms aim to predict the protein complex structure starting from the unbound crystal structure of its component subunits and thus has remained largely limited in terms of scope. Secondly, since docking essentially treats the unbound subunits as "rigid-bodies" it has limited accuracy when conformational change accompanies protein-protein interaction. In one of the first of its kind effort, this study aims for the development of protein complex structure algorithms which require only the amino acid sequence of the interacting subunits as input. The study aimed to adapt the best features of protein tertiary structure prediction including template detection and ab initio loop modeling and extend it for protein-protein complexes thus requiring simultaneous modeling of the three dimensional structure of the component subunits as well as ensuring the correct orientation of the chains at the protein-protein interface. Essentially, the algorithms are dependent on knowledge-based statistical potentials for both fold recognition and structure modeling. First, as a way to compare known structure of protein-protein complexes, a complex structure alignment program MM-align was developed. MM-align joins the chains of the complex structures to be aligned to form artificial monomers in every possible order. It then aligns them using a heuristic dynamic programming based approach using TM-score as the objective function. However, the traditional NW dynamic programming was redesigned to prevent the cross alignment of chains during the structure alignment process. Driven by the knowledge obtained from MM-align that protein complex structures share evolutionary relationships and the current protein complex structure library already contains homologous/structurally analogous protein quaternary structure families, a dimeric threading approach, COTH was designed. The new threading-recombination approach boosts the protein complex structure library by combining tertiary structure templates with complex alignments. The query sequences are first aligned to complex templates using the modified dynamic programming algorithm, guided by a number of predicted structural features including ab initio binding-site predictions. Finally, a template-based complex structure prediction approach, TACOS, was designed to build full-length protein complex structures starting from the initial templates identified by COTH. TACOS, fragments the templates aligned regions of templates and reassembles them while building the structure of the threading unaligned region ab inito using a replica-exchange monte-carlo simulation procedure. Simultaneously, TACOS also searches for the best orientation match of the component structures driven by a number of knowledge-based potential terms. Overall, TACOS presents the one of the first approach capable of predicting full length protein complex structures from sequence alone and introduces a new paradigm in the field of protein complex structure modeling

    The benefits of in silico modeling to identify possible small-molecule drugs and their off-target interactions

    Get PDF
    Accepted for publication in a future issue of Future Medicinal Chemistry.The research into the use of small molecules as drugs continues to be a key driver in the development of molecular databases, computer-aided drug design software and collaborative platforms. The evolution of computational approaches is driven by the essential criteria that a drug molecule has to fulfill, from the affinity to targets to minimal side effects while having adequate absorption, distribution, metabolism, and excretion (ADME) properties. A combination of ligand- and structure-based drug development approaches is already used to obtain consensus predictions of small molecule activities and their off-target interactions. Further integration of these methods into easy-to-use workflows informed by systems biology could realize the full potential of available data in the drug discovery and reduce the attrition of drug candidates.Peer reviewe

    Prediction of Ligand Binding Using an Approach Designed to Accommodate Diversity in Protein-Ligand Interactions

    Get PDF
    Computational determination of protein-ligand interaction potential is important for many biological applications including virtual screening for therapeutic drugs. The novel internal consensus scoring strategy is an empirical approach with an extended set of 9 binding terms combined with a neural network capable of analysis of diverse complexes. Like conventional consensus methods, internal consensus is capable of maintaining multiple distinct representations of protein-ligand interactions. In a typical use the method was trained using ligand classification data (binding/no binding) for a single receptor. The internal consensus analyses successfully distinguished protein-ligand complexes from decoys (r2, 0.895 for a series of typical proteins). Results are superior to other tested empirical methods. In virtual screening experiments, internal consensus analyses provide consistent enrichment as determined by ROC-AUC and pROC metrics

    Computational structure‐based drug design: Predicting target flexibility

    Get PDF
    The role of molecular modeling in drug design has experienced a significant revamp in the last decade. The increase in computational resources and molecular models, along with software developments, is finally introducing a competitive advantage in early phases of drug discovery. Medium and small companies with strong focus on computational chemistry are being created, some of them having introduced important leads in drug design pipelines. An important source for this success is the extraordinary development of faster and more efficient techniques for describing flexibility in three‐dimensional structural molecular modeling. At different levels, from docking techniques to atomistic molecular dynamics, conformational sampling between receptor and drug results in improved predictions, such as screening enrichment, discovery of transient cavities, etc. In this review article we perform an extensive analysis of these modeling techniques, dividing them into high and low throughput, and emphasizing in their application to drug design studies. We finalize the review with a section describing our Monte Carlo method, PELE, recently highlighted as an outstanding advance in an international blind competition and industrial benchmarks.We acknowledge the BSC-CRG-IRB Joint Research Program in Computational Biology. This work was supported by a grant from the Spanish Government CTQ2016-79138-R.J.I. acknowledges support from SVP-2014-068797, awarded by the Spanish Government.Peer ReviewedPostprint (author's final draft

    Protein binding site prediction using an empirical scoring function

    Get PDF
    Most biological processes are mediated by interactions between proteins and their interacting partners including proteins, nucleic acids and small molecules. This work establishes a method called PINUP for binding site prediction of monomeric proteins. With only two weight parameters to optimize, PINUP produces not only 42.2% coverage of actual interfaces (percentage of correctly predicted interface residues in actual interface residues) but also 44.5% accuracy in predicted interfaces (percentage of correctly predicted interface residues in the predicted interface residues) in a cross validation using a 57-protein dataset. By comparison, the expected accuracy via random prediction (percentage of actual interface residues in surface residues) is only 15%. The binding sites of the 57-protein set are found to be easier to predict than that of an independent test set of 68 proteins. The average coverage and accuracy for this independent test set are 30.5 and 29.4%, respectively. The significant gain of PINUP over expected random prediction is attributed to (i) effective residue-energy score and accessible-surface-area-dependent interface-propensity, (ii) isolation of functional constraints contained in the conservation score from the structural constraints through the combination of residue-energy score (for structural constraints) and conservation score and (iii) a consensus region built on top-ranked initial patches

    Composite structural motifs of binding sites for delineating biological functions of proteins

    Get PDF
    Most biological processes are described as a series of interactions between proteins and other molecules, and interactions are in turn described in terms of atomic structures. To annotate protein functions as sets of interaction states at atomic resolution, and thereby to better understand the relation between protein interactions and biological functions, we conducted exhaustive all-against-all atomic structure comparisons of all known binding sites for ligands including small molecules, proteins and nucleic acids, and identified recurring elementary motifs. By integrating the elementary motifs associated with each subunit, we defined composite motifs which represent context-dependent combinations of elementary motifs. It is demonstrated that function similarity can be better inferred from composite motif similarity compared to the similarity of protein sequences or of individual binding sites. By integrating the composite motifs associated with each protein function, we define meta-composite motifs each of which is regarded as a time-independent diagrammatic representation of a biological process. It is shown that meta-composite motifs provide richer annotations of biological processes than sequence clusters. The present results serve as a basis for bridging atomic structures to higher-order biological phenomena by classification and integration of binding site structures.Comment: 34 pages, 7 figure

    Comparing Kernels For Predicting Protein Binding Sites From Amino Acid Sequence

    Get PDF
    The ability to identify protein binding sites and to detect specific amino acid residues that contribute to the specificity and affinity of protein interactions has important implications for problems ranging from rational drug design to analysis of metabolic and signal transduction networks. Support vector machines (SVM) and related kernel methods offer an attractive approach to predicting protein binding sites. An appropriate choice of the kernel function is critical to the performance of SVM. Kernel functions offer a way to incorporate domain-specific knowledge into the classifier. We compare the performance of 3 types of kernels functions: identity kernel, sequence-alignment kernel, and amino acid substitution matrix kernel for predicting protein-protein, protein-DNA and protein-RNA binding sites. The results show that the identity kernel is quite effective in on all three tasks, with the substitution kernel based on amino acid substitution matrices that take into account structural or evolutionary conservation or physicochemical properties of amino acids yields modest improvement in the performance of the resulting SVM classifiers for predicting protein-protein, protein-DNA and protein-RNA binding sites

    Molecular Considerations In The Design Of Novel Alpha/Beta Hydrolase Inhibitors

    Get PDF
    Alpha/beta hydrolases (ABHs) are a superfamily of hydrolytic enzymes that process a wide variety of substrates. A subfamily of ABHs called carboxylesterases (CEs) are important enzymes that catalyze biological detoxification, hydrolysis of certain pesticides, and metabolism of many esterified drugs. The chemotherapy drug irinotecan used for treatment of colorectal cancer is metabolized to SN-38, the active drug metabolite, by two CE isozymes CES1 (localized in the liver) and CES2 (localized in the small intestines). CES2\u27s ability to activate irinotecan at a faster rate than CES1 creates a localization of activated SN-38 in the gut epithelium, resulting in the dose limiting side effect of delayed diarrhea. Development of inhibitors for the CE subfamily of ABHs could assist in ameliorating the toxic side effects associated with some esterified prodrugs such as irinotecan, and enhance the distribution of prodrugs in vivo. Hence, our research targets CES2 for inhibitor design with the goal of amelioration of intestinal cytotoxicity associated with irinotecan chemotherapy. In this work we (i) utilized QSAR technology to design and optimize novel sulfonamide CES2 inhibitors; (ii) combined QSAR with in silico design to generate new CE inhibitor scaffolds that maintained the potency of previous CE inhibitor generations, yet had improved water solubility; and ( iii) investigated the contribution of the loop 7 in CEs to sensitizing the enzyme to inhibition by sulfonamides through docking analysis. Our QSAR model, developed using 57 sulfonamide analogs, identified several features of this class of CE inhibitor that confer their potency. Using a QSAR model, constructed using 4 classes of CE inhibitors (benzils, benzoins, isatins, and sulfonamides), as a pocket site to perform in silico design we generated several new scaffolds predicted to have good solubility and potency. This work suggests that the inner loop 7 on CE plays a role in inhibitor selectivity, and interactions with this loop should be considered in the development of selective CE inhibitors. The contributions from this work will be applicable to the design of novel ABH inhibitors, help to increase the likelihood of these drugs entering in clinical use, and ameliorate the dose-limiting side effect associated with irinotecan

    Novel pharmacophore clustering methods for protein binding site comparison

    Full text link
    Proteins perform diverse functions within cells. Some of the functions depend on the protein being involved in a protein complex, interacting with other proteins or with other entities (ligands) through specific binding sites on their surface. Comparison of protein binding sites has potential benefits in many research fields, including drug promiscuity studies, polypharmacology and immunology. While multiple methods have been proposed for comparing binding sites, they tend to focus on comparing very similar proteins and have only been developed for small specific datasets or very targeted applications. None of these methods make use of the powerful representation afforded by 3D complex-based pharmacophores. A pharmacophore model provides a description of a binding site, consisting of a group of chemical features arranged in three-dimensional space, that can be used to represent biological activities. Two different pharmacophore comparison and clustering methods based on the Iterative Closest Point (ICP) algorithm are proposed: a 3-dimensional ICP pharmacophore clustering method, and an N-dimensional ICP pharmacophore clustering method. These methods are complemented by a series of data pre-processing methods for input data preparation. The implementation of the methods takes computational representations (pharmacophores) of single molecule or protein complexes as input and produces distance matrices that can be visualised as dendrograms. The methods integrate both alignment-dependent and alignment-independent concepts. Both clustering methods were successfully evaluated using a 31 globulin-binding steroid dataset and a 41 antibody-antigen dataset, and were able to handle a larger dataset of 159 protein homodimers. For the steroid dataset, the resulting classification of ligands shows good correspondence with a classification based on binding affinity. For the antibody-antigen dataset, the classification of antigens reflected both antigen type and binding antibody. The applications to homodimers demonstrated the ability of both clustering methods to handle a larger dataset, and the possibility to visualise N-D pairwise comparisons using structural superposition of binding sites
    corecore