108 research outputs found

    Characterizing and explaining the impact of disease-associated mutations in proteins without known structures or structural homologs

    Get PDF
    Mutations in human proteins lead to diseases. The structure of these proteins can help understand the mechanism of such diseases and develop therapeutics against them. With improved deep learning techniques, such as RoseTTAFold and AlphaFold, we can predict the structure of proteins even in the absence of structural homologs. We modeled and extracted the domains from 553 disease-associated human proteins without known protein structures or close homologs in the Protein Databank. We noticed that the model quality was higher and the Root mean square deviation (RMSD) lower between AlphaFold and RoseTTAFold models for domains that could be assigned to CATH families as compared to those which could only be assigned to Pfam families of unknown structure or could not be assigned to either. We predicted ligand-binding sites, protein–protein interfaces and conserved residues in these predicted structures. We then explored whether the disease-associated missense mutations were in the proximity of these predicted functional sites, whether they destabilized the protein structure based on ddG calculations or whether they were predicted to be pathogenic. We could explain 80% of these disease-associated mutations based on proximity to functional sites, structural destabilization or pathogenicity. When compared to polymorphisms, a larger percentage of disease-associated missense mutations were buried, closer to predicted functional sites, predicted as destabilizing and pathogenic. Usage of models from the two state-of-the-art techniques provide better confidence in our predictions, and we explain 93 additional mutations based on RoseTTAFold models which could not be explained based solely on AlphaFold models

    PDBImages: A Command Line Tool for Automated Macromolecular Structure Visualization

    Full text link
    Summary: PDBImages is an innovative, open-source Node.js package that harnesses the power of the popular macromolecule structure visualization software Mol*. Designed for use by the scientific community, PDBImages provides a means to generate high-quality images for PDB and AlphaFold DB models. Its unique ability to render and save images directly to files in a browserless mode sets it apart, offering users a streamlined, automated process for macromolecular structure visualization. Here, we detail the implementation of PDBImages, enumerating its diverse image types and elaborating on its user-friendly setup. This powerful tool opens a new gateway for researchers to visualize, analyse, and share their work, fostering a deeper understanding of bioinformatics. Availability and Implementation: PDBImages is available as an npm package from https://www.npmjs.com/package/pdb-images. The source code is available from https://github.com/PDBeurope/pdb-images. Contact: [email protected], [email protected]: 7 pages, 1 figure, to be submitted to Bioinformatic

    Polymyxins and quinazolines are LSD1/KDM1A inhibitors with unusual structural features

    Get PDF
    Because of its involvement in the progression of several malignant tumors, the histone lysine-specific demethylase 1 (LSD1) has become a prominent drug target in modern medicinal chemistry research. We report on the discovery of two classes of noncovalent inhibitors displaying unique structural features. The antibiotics polymyxins bind at the entrance of the substrate cleft, where their highly charged cyclic moiety interacts with a cluster of positively charged amino acids. The same site is occupied by quinazoline-based compounds, which were found to inhibit the enzyme through a most peculiar mode because they form a pile of five to seven molecules that obstruct access to the active center. These data significantly indicate unpredictable strategies for the development of epigenetic inhibitors

    AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms

    Get PDF
    Deep-learning (DL) methods like DeepMind's AlphaFold2 (AF2) have led to substantial improvements in protein structure prediction. We analyse confident AF2 models from 21 model organisms using a new classification protocol (CATH-Assign) which exploits novel DL methods for structural comparison and classification. Of ~370,000 confident models, 92% can be assigned to 3253 superfamilies in our CATH domain superfamily classification. The remaining cluster into 2367 putative novel superfamilies. Detailed manual analysis on 618 of these, having at least one human relative, reveal extremely remote homologies and further unusual features. Only 25 novel superfamilies could be confirmed. Although most models map to existing superfamilies, AF2 domains expand CATH by 67% and increases the number of unique 'global' folds by 36% and will provide valuable insights on structure function relationships. CATH-Assign will harness the huge expansion in structural data provided by DeepMind to rationalise evolutionary changes driving functional divergence

    Relevance of Rheological Properties of Sodium Alginate in Solution to Calcium Alginate Gel Properties

    Get PDF
    Abstract. The purpose of this study is to determine whether sodium alginate solutions' rheological parameters are meaningful relative to sodium alginate's use in the formulation of calcium alginate gels. Calcium alginate gels were prepared from six different grades of sodium alginate (FMC Biopolymer), one of which was available in ten batches. Cylindrical gel samples were prepared from each of the gels and subjected to compression to fracture on an Instron Universal Testing Machine, equipped with a 1-kN load cell, at a cross-head speed of 120 mm/min. Among the grades with similar % G, (grades 1, 3, and 4), there is a significant correlation between deformation work (L E ) and apparent viscosity (η app ). However, the results for the partial correlation analysis for all six grades of sodium alginate show that L E is significantly correlated with % G, but not with the rheological properties of the sodium alginate solutions. Studies of the ten batches of one grade of sodium alginate show that η app of their solutions did not correlate with L E while tan Ύ was significantly, but minimally, correlated to L E . These results suggest that other factors-polydispersity and the randomness of guluronic acid sequencing-are likely to influence the mechanical properties of the resultant gels. In summary, the rheological properties of solutions for different grades of sodium alginate are not indicative of the resultant gel properties. Interbatch differences in the rheological behavior for one specific grade of sodium alginate were insufficient to predict the corresponding calcium alginate gel's mechanical properties

    PDBe: towards reusable data delivery infrastructure at protein data bank in Europe

    Get PDF
    © 2017 The Authors. Published by OUP. This is an open access article available under a Creative Commons licence. The published version can be accessed at the following link on the publisher’s website: https://doi.org/10.1093/nar/gkx1070The Protein Data Bank in Europe (PDBe, pdbe.org) is actively engaged in the deposition, annotation, remediation, enrichment and dissemination of macromolecular structure data. This paper describes new developments and improvements at PDBe addressing three challenging areas: data enrichment, data dissemination and functional reusability. New features of the PDBe Web site are discussed, including a context dependent menu providing links to raw experimental data and improved presentation of structures solved by hybrid methods. The paper also summarizes the features of the LiteMol suite, which is a set of services enabling fast and interactive 3D visualization of structures, with associated experimental maps, annotations and quality assessment information. We introduce a library of Web components which can be easily reused to port data and functionality available at PDBe to other services. We also introduce updates to the SIFTS resource which maps PDB data to other bioinformatics resources, and the PDBe REST API.Wellcome Trust [104948]; UK Biotechnology and Biological Sciences Research Council [BB/M011674/1, BB/N019172/1, BB/M020347/1]; European Union [284209]; European Molecular Biology Laboratory (EMBL). Funding for open access charge: EMBL.Published versio

    Genome3D: exploiting structure to help users understand their sequences.

    Get PDF
    Genome3D (http://www.genome3d.eu) is a collaborative resource that provides predicted domain annotations and structural models for key sequences. Since introducing Genome3D in a previous NAR paper, we have substantially extended and improved the resource. We have annotated representatives from Pfam families to improve coverage of diverse sequences and added a fast sequence search to the website to allow users to find Genome3D-annotated sequences similar to their own. We have improved and extended the Genome3D data, enlarging the source data set from three model organisms to 10, and adding VIVACE, a resource new to Genome3D. We have analysed and updated Genome3D's SCOP/CATH mapping. Finally, we have improved the superposition tools, which now give users a more powerful interface for investigating similarities and differences between structural models

    3D-Beacons: decreasing the gap between protein sequences and structures through a federated network of protein structure data resources

    Get PDF
    While scientists can often infer the biological function of proteins from their 3-dimensional quaternary structures, the gap between the number of known protein sequences and their experimentally determined structures keeps increasing. A potential solution to this problem is presented by ever more sophisticated computational protein modeling approaches. While often powerful on their own, most methods have strengths and weaknesses. Therefore, it benefits researchers to examine models from various model providers and perform comparative analysis to identify what models can best address their specific use cases. To make data from a large array of model providers more easily accessible to the broader scientific community, we established 3D-Beacons, a collaborative initiative to create a federated network with unified data access mechanisms. The 3D-Beacons Network allows researchers to collate coordinate files and metadata for experimentally determined and theoretical protein models from state-of-the-art and specialist model providers and also from the Protein Data Bank
    • 

    corecore