Search CORE

105 research outputs found

Characterizing and explaining the impact of disease-associated mutations in proteins without known structures or structural homologs

Author: Anishchenko Ivan
Baker David
Bordin Nicola
Orengo Christine
Sen Neeladri
Sillitoe Ian
Velankar Sameer
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/06/2022
Field of study

Mutations in human proteins lead to diseases. The structure of these proteins can help understand the mechanism of such diseases and develop therapeutics against them. With improved deep learning techniques, such as RoseTTAFold and AlphaFold, we can predict the structure of proteins even in the absence of structural homologs. We modeled and extracted the domains from 553 disease-associated human proteins without known protein structures or close homologs in the Protein Databank. We noticed that the model quality was higher and the Root mean square deviation (RMSD) lower between AlphaFold and RoseTTAFold models for domains that could be assigned to CATH families as compared to those which could only be assigned to Pfam families of unknown structure or could not be assigned to either. We predicted ligand-binding sites, protein–protein interfaces and conserved residues in these predicted structures. We then explored whether the disease-associated missense mutations were in the proximity of these predicted functional sites, whether they destabilized the protein structure based on ddG calculations or whether they were predicted to be pathogenic. We could explain 80% of these disease-associated mutations based on proximity to functional sites, structural destabilization or pathogenicity. When compared to polymorphisms, a larger percentage of disease-associated missense mutations were buried, closer to predicted functional sites, predicted as destabilizing and pathogenic. Usage of models from the two state-of-the-art techniques provide better confidence in our predictions, and we explain 93 additional mutations based on RoseTTAFold models which could not be explained based solely on AlphaFold models

UCL Discovery

PubMed Central

PDBImages: A Command Line Tool for Automated Macromolecular Structure Visualization

Author: Anyango Stephen
Deshpande Mandar
Midlik Adam
Nair Sreenath
Sehnal David
Varadi Mihaly
Velankar Sameer
Publication venue
Publication date: 01/08/2023
Field of study

Summary: PDBImages is an innovative, open-source Node.js package that harnesses the power of the popular macromolecule structure visualization software Mol*. Designed for use by the scientific community, PDBImages provides a means to generate high-quality images for PDB and AlphaFold DB models. Its unique ability to render and save images directly to files in a browserless mode sets it apart, offering users a streamlined, automated process for macromolecular structure visualization. Here, we detail the implementation of PDBImages, enumerating its diverse image types and elaborating on its user-friendly setup. This powerful tool opens a new gateway for researchers to visualize, analyse, and share their work, fostering a deeper understanding of bioinformatics. Availability and Implementation: PDBImages is available as an npm package from https://www.npmjs.com/package/pdb-images. The source code is available from https://github.com/PDBeurope/pdb-images. Contact: [email protected], [email protected]: 7 pages, 1 figure, to be submitted to Bioinformatic

arXiv.org e-Print Archive

Polymyxins and quinazolines are LSD1/KDM1A inhibitors with unusual structural features

Author: Ciossani Giuseppe
Forgione Mariantonietta
Forneris Federico
Lucidi Alessia
Mai Antonello
Marrocco Biagina
Mattevi Andrea
Mehdipour Parinaz
Pilotto Simona
Rotili Dante
Speranzini Valentina
Velankar Sameer
Publication venue: 'American Association for the Advancement of Science (AAAS)'
Publication date: 01/01/2016
Field of study

Because of its involvement in the progression of several malignant tumors, the histone lysine-specific demethylase 1 (LSD1) has become a prominent drug target in modern medicinal chemistry research. We report on the discovery of two classes of noncovalent inhibitors displaying unique structural features. The antibiotics polymyxins bind at the entrance of the substrate cleft, where their highly charged cyclic moiety interacts with a cluster of positively charged amino acids. The same site is occupied by quinazoline-based compounds, which were found to inhibit the enzyme through a most peculiar mode because they form a pile of five to seven molecules that obstruct access to the active center. These data significantly indicate unpredictable strategies for the development of epigenetic inhibitors

Archivio Istituzionale della Ricerca - Università degli Studi di Pavia

PubMed Central

UCL Discovery

Archivio della ricerca- Università di Roma La Sapienza

AlphaFold2 reveals commonalities and novelties in protein structure space for 21 model organisms

Author: Bordin Nicola
Heinzinger Michael
Kim Stephanie
Lam Su Datt
Littmann Maria
Nallapareddy Vamsi
Orengo Christine
Rauer Clemens
Rost Burkhard
Sen Neeladri
Sillitoe Ian
Steinegger Martin
Velankar Sameer
Waman Vaishali P
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2023
Field of study

Deep-learning (DL) methods like DeepMind's AlphaFold2 (AF2) have led to substantial improvements in protein structure prediction. We analyse confident AF2 models from 21 model organisms using a new classification protocol (CATH-Assign) which exploits novel DL methods for structural comparison and classification. Of ~370,000 confident models, 92% can be assigned to 3253 superfamilies in our CATH domain superfamily classification. The remaining cluster into 2367 putative novel superfamilies. Detailed manual analysis on 618 of these, having at least one human relative, reveal extremely remote homologies and further unusual features. Only 25 novel superfamilies could be confirmed. Although most models map to existing superfamilies, AF2 domains expand CATH by 67% and increases the number of unique 'global' folds by 36% and will provide valuable insights on structure function relationships. CATH-Assign will harness the huge expansion in structural data provided by DeepMind to rationalise evolutionary changes driving functional divergence

UCL Discovery

Recommended from our members

Genome3D: integrating a collaborative data pipeline to expand the depth and breadth of consensus protein structure annotation.

Author: Andreeva Antonina
Blundell Tom L
Buchan Daniel WA
Finn Robert D
Gough Julian
Jones David
Kelley Lawrence A
Lam Su Datt
Murzin Alexey G
Orengo Christine
Pandurangan Arun Prasad
Paysan-Lafosse Typhaine
Salazar Gustavo A
Sillitoe Ian
Skwark Marcin J
Sternberg Michael JE
Velankar Sameer
Publication venue: Nucleic Acids Res
Publication date: 08/01/2020
Field of study

Genome3D (https://www.genome3d.eu) is a freely available resource that provides consensus structural annotations for representative protein sequences taken from a selection of model organisms. Since the last NAR update in 2015, the method of data submission has been overhauled, with annotations now being 'pushed' to the database via an API. As a result, contributing groups are now able to manage their own structural annotations, making the resource more flexible and maintainable. The new submission protocol brings a number of additional benefits including: providing instant validation of data and avoiding the requirement to synchronise releases between resources. It also makes it possible to implement the submission of these structural annotations as an automated part of existing internal workflows. In turn, these improvements facilitate Genome3D being opened up to new prediction algorithms and groups. For the latest release of Genome3D (v2.1), the underlying dataset of sequences used as prediction targets has been updated using the latest reference proteomes available in UniProtKB. A number of new reference proteomes have also been added of particular interest to the wider scientific community: cow, pig, wheat and mycobacterium tuberculosis. These additions, along with improvements to the underlying predictions from contributing resources, has ensured that the number of annotations in Genome3D has nearly doubled since the last NAR update article. The new API has also been used to facilitate the dissemination of Genome3D data into InterPro, thereby widening the visibility of both the annotation data and annotation algorithms

Apollo (Cambridge)

Relevance of Rheological Properties of Sodium Alginate in Solution to Calcium Alginate Gel Properties

Author: Allison Radwick
Ankur Thacker
Diana M Sperger
Eric J Munson
Ira S Buckner
Lawrence H Block
Otilia Koo
Riccardo L Boni
Sachin Velankar
Sameer Late
Shao Fu
Thomas Farrell
Publication venue
Publication date: 24/04/2020
Field of study

Abstract. The purpose of this study is to determine whether sodium alginate solutions' rheological parameters are meaningful relative to sodium alginate's use in the formulation of calcium alginate gels. Calcium alginate gels were prepared from six different grades of sodium alginate (FMC Biopolymer), one of which was available in ten batches. Cylindrical gel samples were prepared from each of the gels and subjected to compression to fracture on an Instron Universal Testing Machine, equipped with a 1-kN load cell, at a cross-head speed of 120 mm/min. Among the grades with similar % G, (grades 1, 3, and 4), there is a significant correlation between deformation work (L E ) and apparent viscosity (η app ). However, the results for the partial correlation analysis for all six grades of sodium alginate show that L E is significantly correlated with % G, but not with the rheological properties of the sodium alginate solutions. Studies of the ten batches of one grade of sodium alginate show that η app of their solutions did not correlate with L E while tan δ was significantly, but minimally, correlated to L E . These results suggest that other factors-polydispersity and the randomness of guluronic acid sequencing-are likely to influence the mechanical properties of the resultant gels. In summary, the rheological properties of solutions for different grades of sodium alginate are not indicative of the resultant gel properties. Interbatch differences in the rheological behavior for one specific grade of sodium alginate were insufficient to predict the corresponding calcium alginate gel's mechanical properties

CiteSeerX

PDBe: towards reusable data delivery infrastructure at protein data bank in Europe

Author: Alhroub Younes
Anyango Stephen
Armstrong David R
Berrisford John M
Clark Alice R
Conroy Matthew J
Dana Jose M
Deshpande Mandar
Gupta Deepti
Gutmanas Aleksandras
Haslam Pauline
Kleywegt Gerard J
Mak Lora
Mir Saqib
Mukhopadhyay Abhik
Nadzirin Nurul
Paysan-Lafosse Typhaine
Sehnal David
Sen Sanchayita
Smart Oliver S
Varadi Mihaly
Velankar Sameer
Publication venue: 'Oxford University Press (OUP)'
Publication date: 26/10/2017
Field of study

© 2017 The Authors. Published by OUP. This is an open access article available under a Creative Commons licence. The published version can be accessed at the following link on the publisher’s website: https://doi.org/10.1093/nar/gkx1070The Protein Data Bank in Europe (PDBe, pdbe.org) is actively engaged in the deposition, annotation, remediation, enrichment and dissemination of macromolecular structure data. This paper describes new developments and improvements at PDBe addressing three challenging areas: data enrichment, data dissemination and functional reusability. New features of the PDBe Web site are discussed, including a context dependent menu providing links to raw experimental data and improved presentation of structures solved by hybrid methods. The paper also summarizes the features of the LiteMol suite, which is a set of services enabling fast and interactive 3D visualization of structures, with associated experimental maps, annotations and quality assessment information. We introduce a library of Web components which can be easily reused to port data and functionality available at PDBe to other services. We also introduce updates to the SIFTS resource which maps PDB data to other bioinformatics resources, and the PDBe REST API.Wellcome Trust [104948]; UK Biotechnology and Biological Sciences Research Council [BB/M011674/1, BB/N019172/1, BB/M020347/1]; European Union [284209]; European Molecular Biology Laboratory (EMBL). Funding for open access charge: EMBL.Published versio

Crossref

Wolverhampton Intellectual Repository and E-theses

Genome3D: exploiting structure to help users understand their sequences.

Author: Andreeva Antonina
Blundell Tom L
Buchan Daniel WA
Chothia Cyrus
Cozzetto Domenico
Dana José M
Filippis Ioannis
Gough Julian
Jones David T
Kelley Lawrence A
Kleywegt Gerard J
Lewis Tony E
Minneci Federico
Mistry Jaina
Murzin Alexey G
Oates Matt E
Ochoa-Montaño Bernardo
Orengo Christine
Punta Marco
Rackham Owen JL
Sillitoe Ian
Stahlhacke Jonathan
Sternberg Michael JE
Velankar Sameer
Publication venue: Nucleic Acids Res
Publication date: 27/10/2014
Field of study

Genome3D (http://www.genome3d.eu) is a collaborative resource that provides predicted domain annotations and structural models for key sequences. Since introducing Genome3D in a previous NAR paper, we have substantially extended and improved the resource. We have annotated representatives from Pfam families to improve coverage of diverse sequences and added a fast sequence search to the website to allow users to find Genome3D-annotated sequences similar to their own. We have improved and extended the Genome3D data, enlarging the source data set from three model organisms to 10, and adding VIVACE, a resource new to Genome3D. We have analysed and updated Genome3D's SCOP/CATH mapping. Finally, we have improved the superposition tools, which now give users a more powerful interface for investigating similarities and differences between structural models

Goldsmiths Research Online

Southampton (e-Prints Soton)

Crossref

PubMed Central

UCL Discovery

Spiral - Imperial College Digital Repository

Apollo (Cambridge)

3D-Beacons: decreasing the gap between protein sequences and structures through a federated network of protein structure data resources

Author: Anyango Stephen
Bienert Stefan
Borges Clemente
Deshpande Mandar
Green Tim
Hassabis Demis
Hatos Andras
Hegedus Tamas
Hekkelman Maarten L
Joosten Robbie
Jumper John
Laydon Agata
Molodenskiy Dmitry
Nair Sreenath
Orengo Christine
Piovesan Damiano
Salladini Edoardo
Salzberg Steven L
Schwede Torsten
Sillitoe Ian
Sommer Markus J
Steinegger Martin
Suhajda Erzsebet
Svergun Dmitri
Tauriello Gerardo
Tenorio-Ku Luiggi
Tosatto Silvio
Tunyasuvunakool Kathryn
Varadi Mihaly
Velankar Sameer
Waterhouse Andrew Mark
Žídek Augustin
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2022
Field of study

While scientists can often infer the biological function of proteins from their 3-dimensional quaternary structures, the gap between the number of known protein sequences and their experimentally determined structures keeps increasing. A potential solution to this problem is presented by ever more sophisticated computational protein modeling approaches. While often powerful on their own, most methods have strengths and weaknesses. Therefore, it benefits researchers to examine models from various model providers and perform comparative analysis to identify what models can best address their specific use cases. To make data from a large array of model providers more easily accessible to the broader scientific community, we established 3D-Beacons, a collaborative initiative to create a federated network with unified data access mechanisms. The 3D-Beacons Network allows researchers to collate coordinate files and metadata for experimentally determined and theoretical protein models from state-of-the-art and specialist model providers and also from the Protein Data Bank

edoc

PubMed Central

UCL Discovery

Archivio istituzionale della ricerca - Università di Padova