835 research outputs found

    Partitioned Least Squares

    Get PDF
    In this paper we propose a variant of the linear least squares model allowing practitioners to partition the input features into groups of variables that they require to contribute similarly to the final result. The output allows practitioners to assess the importance of each group and of each variable in the group. We formally show that the new formulation is not convex and provide two alternative methods to deal with the problem: one non-exact method based on an alternating least squares approach; and one exact method based on a reformulation of the problem using an exponential number of sub-problems whose minimum is guaranteed to be the optimal solution. We formally show the correctness of the exact method and also compare the two solutions showing that the exact solution provides better results in a fraction of the time required by the alternating least squares solution (assuming that the number of partitions is small). For the sake of completeness, we also provide an alternative branch and bound algorithm that can be used in place of the exact method when the number of partitions is too large, and a proof of NP-completeness of the optimization problem introduced in this paper

    Structure of Tagatose-1,6-bisphosphate Aldolase. Insight into chiral discrimination, mechanism, and specificity of class II aldolases

    Get PDF
    Tagatose-1,6-bisphosphate aldolase (TBPA) is a tetrameric class II aldolase that catalyzes the reversible condensation of dihydroxyacetone phosphate with glyceraldehyde 3-phosphate to produce tagatose 1,6-bisphosphate. The high resolution (1.45 Å) crystal structure of the Escherichia coli enzyme, encoded by the agaY gene, complexed with phosphoglycolohydroxamate (PGH) has been determined. Two subunits comprise the asymmetric unit, and a crystallographic 2-fold axis generates the functional tetramer. A complex network of hydrogen bonds position side chains in the active site that is occupied by two cations. An unusual Na(+) binding site is created using a interaction with Tyr(183) in addition to five oxygen ligands. The catalytic Zn(2+) is five-coordinate using three histidine nitrogens and two PGH oxygens. Comparisons of TBPA with the related fructose-1,6-bisphosphate aldolase (FBPA) identifies common features with implications for the mechanism. Because the major product of the condensation catalyzed by the enzymes differs in the chirality at a single position, models of FBPA and TBPA with their cognate bisphosphate products provide insight into chiral discrimination by these aldolases. The TBPA active site is more open on one side than FBPA, and this contributes to a less specific enzyme. The availability of more space and a wider range of aldehyde partners used by TBPA together with the highly specific nature of FBPA suggest that TBPA might be a preferred enzyme to modify for use in biotransformation chemistry

    Ligand binding site superposition and comparison based on Atomic Property Fields: identification of distant homologues, convergent evolution and PDB-wide clustering of binding sites

    Get PDF
    A new binding site comparison algorithm using optimal superposition of the continuous pharmacophoric property distributions is reported. The method demonstrates high sensitivity in discovering both, distantly homologous and convergent binding sites. Good quality of superposition is also observed on multiple examples. Using the new approach, a measure of site similarity is derived and applied to clustering of ligand binding pockets in PDB

    Current perspectives in fragment based lead discovery (FBLD)

    Get PDF
    It is over 20 years since the first fragment-based discovery projects were disclosed. The methods are now mature for most ‘conventional’ targets in drug discovery such as enzymes (kinases and proteases) but there has also been growing success on more challenging targets, such as disruption of protein–protein interactions. The main application is to identify tractable chemical startpoints that non-covalently modulate the activity of a biological molecule. In this essay, we overview current practice in the methods and discuss how they have had an impact in lead discovery – generating a large number of fragment-derived compounds that are in clinical trials and two medicines treating patients. In addition, we discuss some of the more recent applications of the methods in chemical biology – providing chemical tools to investigate biological molecules, mechanisms and systems

    Intuitive, But Not Simple: Including Explicit Water Molecules in Protein-Protein Docking Simulations Improves Model Quality

    Get PDF
    Characterizing the nature of interaction between proteins that have not been experimentally co-crystallized requires a computational docking approach that can successfully predict the spatial conformation adopted in the complex. In this work, the Hydropathic INTeractions (HINT) force field model was used for scoring docked models in a data set of 30 high-resolution crystallographically characterized “dry” protein-protein complexes, and was shown to reliably identify native-like models. However, most current protein-protein docking algorithms fail to explicitly account for water molecules involved in bridging interactions that mediate and stabilize the association of the protein partners, so we used HINT to illuminate the physical and chemical properties of bridging waters and account for their energetic stabilizing contributions. The HINT water Relevance metric identified the ‘truly’ bridging waters at the 30 protein-protein interfaces and we utilized them in “solvated” docking by manually inserting them into the input files for the rigid body ZDOCK program. By accounting for these interfacial waters, a statistically significant improvement of ~24% in the average hit-count within the top-10 predictions the protein-protein dataset was seen, compared to standard “dry” docking. The results also show scoring improvement, with medium and high accuracy models ranking much better than incorrect ones. These improvements can be attributed to the physical presence of water molecules that alter surface properties and better represent native shape and hydropathic complementarity between interacting partners, with concomitantly more accurate native-like structure predictions

    The Cysteine-Rich Protein Thimet Oligopeptidase as a Model of the Structural Requirements for S-glutathiolation and Oxidative Oligomerization

    Get PDF
    Thimet oligopeptidase (EP24.15) is a cysteine-rich metallopeptidase containing fifteen Cys residues and no intra-protein disulfide bonds. Previous work on this enzyme revealed that the oxidative oligomerization of EP24.15 is triggered by S-glutathiolation at physiological GSSG levels (10–50 µM) via a mechanism based on thiol-disulfide exchange. In the present work, our aim was to identify EP24.15 Cys residues that are prone to S-glutathiolation and to determine which structural features in the cysteinyl bulk are responsible for the formation of mixed disulfides through the reaction with GSSG and, in this particular case, the Cys residues within EP24.15 that favor either S-glutathiolation or inter-protein thiol-disulfide exchange. These studies were conducted by in silico structural analyses and simulations as well as site-specific mutation. S-glutathiolation was determined by mass spectrometric analyses and western blotting with anti-glutathione antibody. The results indicated that the stabilization of a thiolate sulfhydryl and the solvent accessibility of the cysteines are necessary for S-thiolation. The Solvent Access Surface analysis of the Cys residues prone to glutathione modification showed that the S-glutathiolated Cys residues are located inside pockets where the sulfur atom comes into contact with the solvent and that the positively charged amino acids are directed toward these Cys residues. The simulation of a covalent glutathione docking onto the same Cys residues allowed for perfect glutathione posing. A mutation of the Arg residue 263 that forms a saline bridge to the Cys residue 175 significantly decreased the overall S-glutathiolation and oligomerization of EP24.15. The present results show for the first time the structural requirements for protein S-glutathiolation by GSSG and are consistent with our previous hypothesis that EP24.15 oligomerization is dependent on the electron transfer from specific protonated Cys residues of one molecule to previously S-glutathionylated Cys residues of another one

    Novel Druggable Hot Spots in Avian Influenza Neuraminidase H5N1 Revealed by Computational Solvent Mapping of a Reduced and Representative Receptor Ensemble

    Get PDF
    The influenza virus subtype H5N1 has raised concerns of a possible human pandemic threat because of its high virulence and mutation rate. Although several approved anti-influenza drugs effectively target the neuraminidase, some strains have already acquired resistance to the currently available anti-influenza drugs. In this study, we present the synergistic application of extended explicit solvent molecular dynamics (MD) and computational solvent mapping (CS-Map) to identify putative ‘hot spots’ within flexible binding regions of N1 neuraminidase. Using representative conformations of the N1 binding region extracted from a clustering analysis of four concatenated 40-ns MD simulations, CS-Map was utilized to assess the ability of small, solvent-sized molecules to bind within close proximity to the sialic acid binding region. Mapping analyses of the dominant MD conformations reveal the presence of additional hot spot regions in the 150- and 430-loop regions. Our hot spot analysis provides further support for the feasibility of developing high-affinity inhibitors capable of binding these regions, which appear to be unique to the N1 strain

    Optimal assignment methods for ligand-based virtual screening

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Ligand-based virtual screening experiments are an important task in the early drug discovery stage. An ambitious aim in each experiment is to disclose active structures based on new scaffolds. To perform these "scaffold-hoppings" for individual problems and targets, a plethora of different similarity methods based on diverse techniques were published in the last years. The optimal assignment approach on molecular graphs, a successful method in the field of quantitative structure-activity relationships, has not been tested as a ligand-based virtual screening method so far.</p> <p>Results</p> <p>We evaluated two already published and two new optimal assignment methods on various data sets. To emphasize the "scaffold-hopping" ability, we used the information of chemotype clustering analyses in our evaluation metrics. Comparisons with literature results show an improved early recognition performance and comparable results over the complete data set. A new method based on two different assignment steps shows an increased "scaffold-hopping" behavior together with a good early recognition performance.</p> <p>Conclusion</p> <p>The presented methods show a good combination of chemotype discovery and enrichment of active structures. Additionally, the optimal assignment on molecular graphs has the advantage to investigate and interpret the mappings, allowing precise modifications of internal parameters of the similarity measure for specific targets. All methods have low computation times which make them applicable to screen large data sets.</p
    corecore