Search CORE

179 research outputs found

Specialized dynamical properties of promiscuous residues revealed by simulated conformational ensembles

Author: Abecasis G. R.
Alessandro Pandini
Altschul S. F.
Altshuler D. M.
Amadei A.
Aranda B.
Arianna Fornili
Bahar I.
Bahar I.
Bahar I.
Berendsen H.
Bhardwaj N.
Bobay B. G.
Boehr D. D.
Bogan A. A.
Bordogna A.
Bouvier B.
Brookes A. J.
Camacho C.
Carbonell P.
Chandonia J.-M.
Cover T. M.
Cukuroglu E.
Cumming G.
Daily M. D.
Dasgupta B.
Davis F. P.
de Groot B. L.
de Groot B. L.
De Simone A.
del Sol A.
DeLano W.
Dobbins S. E.
Dong Q.
Doruker P.
Dosztányi Z.
Dunbrack R. L.
Dyson H. J.
Echave J.
Ekman D.
Erijman A.
Essmann U.
Eyrisch S.
Fernández A.
Ferrer-Costa C.
Fong J. H.
Fornili A.
Franca Fraternali
Fraternali F.
Goldenberg O.
Haliloglu T.
Haliloglu T.
Hamosh A.
Han J.-D. J.
Hess B.
Hess B.
Higurashi M.
Higurashi M.
Hub J. S.
Hui-Chun Lu
Humphris E. L.
Jeong H.
Jones S.
Jorgensen W.
Kabsch W.
Kar G.
Keskin O.
Keskin O.
Keskin O.
Keskin O.
Keskin O.
Kiel C.
Kim P. M.
Kim P. M.
Kim S.
Kleinjung J.
Kohn J. E.
Kortemme T.
Krissinel E.
Kuttner Y. Y.
Kuzu G.
Lange O. F.
Li X.
Liu L.
Lounnas V.
Maguid S.
Margreitter C.
Martin A. C. R.
Meireles L.
Meireles L. M. C.
Micheletti C.
Mittag T.
Münz M.
Nussinov R.
Pandini A.
Pandini A.
Pandini A.
Pandini A.
Pandini A.
Park B. H.
Patil A.
Patil A.
Peters J. H.
Petrov D.
Poirot O.
Qin H.
R-Development-Core-Team
Rajamani D.
Roulston M.
Rousseeuw P.
Schlitter J.
Schäfer H.
Seeliger D.
Sherry S. T.
Stein A.
Tsai C.-J.
Tuncbag N.
Tuncbag N.
Tyagi M.
van der Spoel D.
Van Gunsteren W.
Vogel C.
Vogel C.
Volkman B. F.
Wells J. A.
Winget J. M.
Wolfe R.
Yogurtcu O. N.
Zen A.
Zen A.
Zhang Q. C.
Zheng W.
Zhu X.
Publication venue: 'American Chemical Society (ACS)'
Publication date: 18/10/2013
Field of study

The ability to interact with different partners is one of the most important features in proteins. Proteins that bind a large number of partners (hubs) have been often associated with intrinsic disorder. However, many examples exist of hubs with an ordered structure, and evidence of a general mechanism promoting promiscuity in ordered proteins is still elusive. An intriguing hypothesis is that promiscuous binding sites have specific dynamical properties, distinct from the rest of the interface and pre-existing in the protein isolated state. Here, we present the first comprehensive study of the intrinsic dynamics of promiscuous residues in a large protein data set. Different computational methods, from coarse-grained elastic models to geometry-based sampling methods and to full-atom Molecular Dynamics simulations, were used to generate conformational ensembles for the isolated proteins. The flexibility and dynamic correlations of interface residues with a different degree of binding promiscuity were calculated and compared considering side chain and backbone motions, the latter both on a local and on a global scale. The study revealed that (a) promiscuous residues tend to be more flexible than nonpromiscuous ones, (b) this additional flexibility has a higher degree of organization, and (c) evolutionary conservation and binding promiscuity have opposite effects on intrinsic dynamics. Findings on simulated ensembles were also validated on ensembles of experimental structures extracted from the Protein Data Bank (PDB). Additionally, the low occurrence of single nucleotide polymorphisms observed for promiscuous residues indicated a tendency to preserve binding diversity at these positions. A case study on two ubiquitin-like proteins exemplifies how binding promiscuity in evolutionary related proteins can be modulated by the fine-tuning of the interface dynamics. The interplay between promiscuity and flexibility highlighted here can inspire new directions in protein-protein interaction prediction and design methods. © 2013 American Chemical Society

Crossref

PubMed Central

King's Research Portal

Brunel University Research Archive

Assessment of predicted enzymatic activity of α‐N‐acetylglucosaminidase variants of unknown significance for CAGI 2016

Author: Andreoletti G
Babbi G
Bakolitsa C
Brenner SE
Bromberg Y
Casadio R
Clark WT
Dunbrack R
Folkman L
Ford CT
Hu Z
Ivojac PRR
Jones D
Kasak L
Katsonis P
Kundu K
LeBowitz JH
Lichtarge O
Martelli PL
Mooney SD
Moult J
Nodzak C
Pal LR
Pejaver V
Savojardo C
Shi X
Uppal A
Wang M
Wei L
Xu Q
Yin Y
Yu GK
Zhou Y
Publication venue: 'Royal College of Obstetricians & Gynaecologists (RCOG)'
Publication date: 01/09/2019
Field of study

The NAGLU challenge of the fourth edition of the Critical Assessment of Genome Interpretation experiment (CAGI4) in 2016, invited participants to predict the impact of variants of unknown significance (VUS) on the enzymatic activity of the lysosomal hydrolase α‐N‐acetylglucosaminidase (NAGLU). Deficiencies in NAGLU activity lead to a rare, monogenic, recessive lysosomal storage disorder, Sanfilippo syndrome type B (MPS type IIIB). This challenge attracted 17 submissions from 10 groups. We observed that top models were able to predict the impact of missense mutations on enzymatic activity with Pearson's correlation coefficients of up to .61. We also observed that top methods were significantly more correlated with each other than they were with observed enzymatic activity values, which we believe speaks to the importance of sequence conservation across the different methods. Improved functional predictions on the VUS will help population‐scale analysis of disease epidemiology and rare variant association analysis

UCL Discovery

A Generic Program for Multistate Protein Design

Some protein design tasks cannot be modeled by the traditional single state design strategy of finding a sequence that is optimal for a single fixed backbone. Such cases require multistate design, where a single sequence is threaded onto multiple backbones (states) and evaluated for its strengths and weaknesses on each backbone. For example, to design a protein that can switch between two specific conformations, it is necessary to to find a sequence that is compatible with both backbone conformations. We present in this paper a generic implementation of multistate design that is suited for a wide range of protein design tasks and demonstrate in silico its capabilities at two design tasks: one of redesigning an obligate homodimer into an obligate heterodimer such that the new monomers would not homodimerize, and one of redesigning a promiscuous interface to bind to only a single partner and to no longer bind the rest of its partners. Both tasks contained negative design in that multistate design was asked to find sequences that would produce high energies for several of the states being modeled. Success at negative design was assessed by computationally redocking the undesired protein-pair interactions; we found that multistate design's accuracy improved as the diversity of conformations for the undesired protein-pair interactions increased. The paper concludes with a discussion of the pitfalls of negative design, which has proven considerably more challenging than positive design

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Carolina Digital Repository

A Mathematical Framework for Protein Structure Comparison

Author: A Srivastava
A Srivastava
A Zemla
AG Murzin
Anuj Srivastava
AR Ortiz
AS Konagurthu
B Kolbeck
C Berbalk
CA Orengo
CA Orengo
DL Theobald
E Klassen
E Krissinel
F Teichert
G Mayr
H Hasegawa
HM Berman
IN Shindyalov
J Dundas
J Ebert
J Zhang
J Zhang
J Zhu
JF Gibrat
Jinfeng Zhang
K Illergard
L Holm
L Holm
L Lo Conte
M Levitt
M Menke
M Shatsky
M Shatsky
MJ Sippl
N Furnham
O Dror
P Koehl
PD Dobson
QS Du
R Kolodny
R Kolodny
R Mosca
R Mosca
Roland L. Dunbrack
S Kurtek
SH Joshi
SR Eddy
VA Ilyin
W Mio
Wei Liu
WR Taylor
X Zhou
Y Ye
Y Zhang
YJ Huang
Publication venue: Public Library of Science
Publication date: 03/02/2011
Field of study

Comparison of protein structures is important for revealing the evolutionary relationship among proteins, predicting protein functions and predicting protein structures. Many methods have been developed in the past to align two or multiple protein structures. Despite the importance of this problem, rigorous mathematical or statistical frameworks have seldom been pursued for general protein structure comparison. One notable issue in this field is that with many different distances used to measure the similarity between protein structures, none of them are proper distances when protein structures of different sequences are compared. Statistical approaches based on those non-proper distances or similarity scores as random variables are thus not mathematically rigorous. In this work, we develop a mathematical framework for protein structure comparison by treating protein structures as three-dimensional curves. Using an elastic Riemannian metric on spaces of curves, geodesic distance, a proper distance on spaces of curves, can be computed for any two protein structures. In this framework, protein structures can be treated as random variables on the shape manifold, and means and covariance can be computed for populations of protein structures. Furthermore, these moments can be used to build Gaussian-type probability distributions of protein structures for use in hypothesis testing. The covariance of a population of protein structures can reveal the population-specific variations and be helpful in improving structure classification. With curves representing protein structures, the matching is performed using elastic shape analysis of curves, which can effectively model conformational changes and insertions/deletions. We show that our method performs comparably with commonly used methods in protein structure classification on a large manually annotated data set

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

PDBe-KB: collaboratively defining the biological context of structural data

Author: Al-Lazikani B.
Andreini C.
Anyango S.
Armstrong D.
Barton G. J.
Bednar D.
Berka K.
Berrisford J.
Blundell T.
Brock K. P.
Carazo J. M.
Choudhary P.
Damborsky J.
David A.
Deshpande M.
Dey S.
Dunbrack R.
Fraternali F.
Gibson T.
Helmer Citterich M.
Hoksza D.
Hopf T.
Jakubec D.
Kannan N.
Krivak R.
Kumar M.
Levy E. D.
London N.
Macias J. R.
Marks D. S.
Martens L.
McGowan S. A.
McGreig J. E.
Modi V.
Nadzirin N.
Nair S. S.
Orengo C.
Parra R. G.
Pepe G.
Piovesan D.
Pravda L.
Prilusky J.
Putignano V.
Radusky L. G.
Ramasamy P.
Rausch A. O.
Recio J. F.
Reuter N.
Rodriguez L. A.
Rollins N. J.
Rosato A.
Rubach P.
Serrano L.
Singh G.
Skoda P.
Sorzano C. O. S.
Srivatsan M. M.
Sternberg M.
Stourac J.
Sulkowska J. I.
Svobodova R.
Tanweer A.
Thornton J.
Tichshenko N.
Tosatto S. C. E.
Varadi M.
Velankar S.
Vranken W.
Wass M. N.
Xue D.
Zaidman D.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2022
Field of study

The Protein Data Bank in Europe - Knowledge Base (PDBe-KB, https://pdbe-kb.org) is an open collaboration between world-leading specialist data resources contributing functional and biophysical annotations derived from or relevant to the Protein Data Bank (PDB). The goal of PDBe-KB is to place macromolecular structure data in their biological context by developing standardised data exchange formats and integrating functional annotations from the contributing partner resources into a knowledge graph that can provide valuable biological insights. Since we described PDBe-KB in 2019, there have been significant improvements in the variety of available annotation data sets and user functionality. Here, we provide an overview of the consortium, highlighting the addition of annotations such as predicted covalent binders, phosphorylation sites, effects of mutations on the protein structure and energetic local frustration. In addition, we describe a library of reusable web-based visualisation components and introduce new features such as a bulk download data service and a novel superposition service that generates clusters of superposed protein chains weekly for the whole PDB archive

ART

Highly Sensitive Detection of Individual HEAT and ARM Repeats with HHpred and COACH

Author: A Biegert
AF Neuwald
AG Murzin
B Kobe
B Riggleman
BA Hemmings
C Petosa
CD deBakker
Dietlind L. Gerloff
EF Pettersen
EL Sonnhammer
F Kippert
F Rook
Fred Kippert
GE Crooks
H Striegl
HM Berman
HS Malik
J Söding
Jason E. Stajich
K Karplus
L Aravind
L Jaroszewski
M Peifer
MA Andrade
MA Andrade
MA Andrade
MA Andrade
P März
R Yano
RC Edgar
RC Edgar
RL Dunbrack
SF Altschul
SR Eddy
US Cho
Y Xu
Publication venue: Public Library of Science
Publication date: 01/09/2009
Field of study

BACKGROUND:HEAT and ARM repeats occur in a large number of eukaryotic proteins. As these repeats are often highly diverged, the prediction of HEAT or ARM domains can be challenging. Except for the most clear-cut cases, identification at the individual repeat level is indispensable, in particular for determining domain boundaries. However, methods using single sequence queries do not have the sensitivity required to deal with more divergent repeats and, when applied to proteins with known structures, in some cases failed to detect a single repeat. METHODOLOGY AND PRINCIPAL FINDINGS:Testing algorithms which use multiple sequence alignments as queries, we found two of them, HHpred and COACH, to detect HEAT and ARM repeats with greatly enhanced sensitivity. Calibration against experimentally determined structures suggests the use of three score classes with increasing confidence in the prediction, and prediction thresholds for each method. When we applied a new protocol using both HHpred and COACH to these structures, it detected 82% of HEAT repeats and 90% of ARM repeats, with the minimum for a given protein of 57% for HEAT repeats and 60% for ARM repeats. Application to bona fide HEAT and ARM proteins or domains indicated that similar numbers can be expected for the full complement of HEAT/ARM proteins. A systematic screen of the Protein Data Bank for false positive hits revealed their number to be low, in particular for ARM repeats. Double false positive hits for a given protein were rare for HEAT and not at all observed for ARM repeats. In combination with fold prediction and consistency checking (multiple sequence alignments, secondary structure prediction, and position analysis), repeat prediction with the new HHpred/COACH protocol dramatically improves prediction in the twilight zone of fold prediction methods, as well as the delineation of HEAT/ARM domain boundaries. SIGNIFICANCE:A protocol is presented for the identification of individual HEAT or ARM repeats which is straightforward to implement. It provides high sensitivity at a low false positive rate and will therefore greatly enhance the accuracy of predictions of HEAT and ARM domains

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

A three dimensional visualisation approach to protein heavy-atom structure reconstruction

Author: A Roy
AG Murzin
Alireza Chenani
Antti J Niemi
EO Purisima
GN Murshudov
GN Ramachandran
H Schrauber
HA Scheraga
HM Berman
I Sillitoe
J Janin
JW Ponder
K Dill
K Hinsen
L Holm
LX Peterson
M Lundgren
M Lundgren
MA DePristo
MS Shapovalov
NS Alexander
O Carugo
P Rotkiewicz
PD Adams
PL Freddolino
R Chandrasekaran
RA Engh
RA Laskowski
RL Dunbrack Jr
RL Dunbrack Jr
S Hu
S Subramaniam
S Subramaniam
SC Lovell
SC Lovell
Shuangwei Hu
SM Islam
T Kirys
T Schwede
TA Jones
VB Chen
WG Touw
X Qu
Xubiao Peng
Y Li
Y Zhang
Yifan Zhou
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Candidate Variants in DNA Replication and Repair Genes in Early-Onset Renal Cell Carcinoma Patients Referred for Germline Testing

Author: Andrake Mark D.
Arora Sanjeevani
Chen David Y.T.
Daly Mary B.
Demidova Elena V.
Dunbrack Roland L.
Golemis Erica A.
Hall Michael J.
Hartman Tiffiney R.
Kelow Simon
Kent Tatiana
Pomerantz Richard T.
Rosen Gail L.
Serebriiskii Ilya G.
Virtucio James
Vlasenkova Ramilia
Publication venue: Jefferson Digital Commons
Publication date: 24/04/2023
Field of study

Background: Early-onset renal cell carcinoma (eoRCC) is typically associated with pathogenic germline variants (PGVs) in RCC familial syndrome genes. However, most eoRCC patients lack PGVs in familial RCC genes and their genetic risk remains undefined. Methods: Here, we analyzed biospecimens from 22 eoRCC patients that were seen at our institution for genetic counseling and tested negative for PGVs in RCC familial syndrome genes. Results: Analysis of whole-exome sequencing (WES) data found enrichment of candidate pathogenic germline variants in DNA repair and replication genes, including multiple DNA polymerases. Induction of DNA damage in peripheral blood monocytes (PBMCs) significantly elevated numbers of [Formula: see text]H2AX foci, a marker of double-stranded breaks, in PBMCs from eoRCC patients versus PBMCs from matched cancer-free controls. Knockdown of candidate variant genes in Caki RCC cells increased [Formula: see text]H2AX foci. Immortalized patient-derived B cell lines bearing the candidate variants in DNA polymerase genes (POLD1, POLH, POLE, POLK) had DNA replication defects compared to control cells. Renal tumors carrying these DNA polymerase variants were microsatellite stable but had a high mutational burden. Direct biochemical analysis of the variant Pol δ and Pol η polymerases revealed defective enzymatic activities. Conclusions: Together, these results suggest that constitutional defects in DNA repair underlie a subset of eoRCC cases. Screening patient lymphocytes to identify these defects may provide insight into mechanisms of carcinogenesis in a subset of genetically undefined eoRCCs. Evaluation of DNA repair defects may also provide insight into the cancer initiation mechanisms for subsets of eoRCCs and lay the foundation for targeting DNA repair vulnerabilities in eoRCC

Jefferson Digital Commons

Rational Mutational Analysis of a Multidrug MFS Transporter CaMdr1p of Candida albicans by Employing a Membrane Environment Based Computational Approach

Author: A Chandrasekaran
A Decottignies
A Krogh
A Sali
AC Tutulan-Cunita
Ajeeta Kaushiki
Andrew M. Lynn
B Reva
CD Livingstone
CE Shannon
CJ Law
CP Chen
GJ Barton
I Sa-Correia
IT Paulsen
J Abramson
JA Capra
JD Fischer
JL Ditty
K Koike
K Nakamura
K Wang
Khyati Kapoor
L Guan
L Li
LC Martin
M Gaur
M Gaur
M Schiffer
Mohd Rehan
N Puri
N Sigal
P Saini
PC Ng
PK Srivastava
Q Yang
R Egner
R Ernst
R Pasrija
R Prasad
Rajendra Prasad
RC Edgar
RE De
Ritu Pasrija
Roland Dunbrack
S Henikoff
S Shukla
S Shukla
SL Ginn
Smriti
SS Hannenhalli
SS Pao
T Cover
W Pirovano
WS Valdar
Y Huang
Y Yin
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

CaMdr1p is a multidrug MFS transporter of pathogenic Candida albicans. An over-expression of the gene encoding this protein is linked to clinically encountered azole resistance. In-depth knowledge of the structure and function of CaMdr1p is necessary for an effective design of modulators or inhibitors of this efflux transporter. Towards this goal, in this study, we have employed a membrane environment based computational approach to predict the functionally critical residues of CaMdr1p. For this, information theoretic scores which are variants of Relative Entropy (Modified Relative Entropy REM) were calculated from Multiple Sequence Alignment (MSA) by separately considering distinct physico-chemical properties of transmembrane (TM) and inter-TM regions. The residues of CaMdr1p with high REM which were predicted to be significantly important were subjected to site-directed mutational analysis. Interestingly, heterologous host Saccharomyces cerevisiae, over-expressing these mutant variants of CaMdr1p wherein these high REM residues were replaced by either alanine or leucine, demonstrated increased susceptibility to tested drugs. The hypersensitivity to drugs was supported by abrogated substrate efflux mediated by mutant variant proteins and was not attributed to their poor expression or surface localization. Additionally, by employing a distance plot from a 3D deduced model of CaMdr1p, we could also predict the role of these functionally critical residues in maintaining apparent inter-helical interactions to provide the desired fold for the proper functioning of CaMdr1p. Residues predicted to be critical for function across the family were also found to be vital from other previously published studies, implying its wider application to other membrane protein families

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Computational Design of a PDZ Domain Peptide Inhibitor that Rescues CFTR Activity

Author: A Leaver-Fay
A Piserchio
A Taddei
AJW te Velthuis
AR Leach
B Brannetti
B Kuhlman
BD Allen
BI Dahiyat
BI Dahiyat
BR Brooks
BR Donald
Bruce R. Donald
C Chen
C Lee
C Yanover
CA Smith
CL Kingsford
D Saro
DA Case
DB Gordon
DB Gordon
Dean R. Madden
DM Cholon
DN Sheppard
DT Jones
E Althaus
E Bruscia
E Hong
E Kim
FV Goor
Giorgio Colombo
GK Hom
H Kamisetty
HM Sampson
I Georgiev
IN Berezovsky
J Cheng
J Cheng
J Desmet
J Janin
J Reina
J Thomas
J Zhang
JM Word
JM Word
JR Desjarlais
JW Ponder
KA Reynolds
KM Frey
Kyle E. Roberts
L Vouilleme
LA Joachimiak
M Dayhoff
M Fromer
M Gilson
M Wolde
MD Altman
MJ Gorczynski
N Pedemonte
P Gainza
P Humbert
P Koehl
P Koehl
Patrick R. Cushing
PR Cushing
PR Cushing
Prisca Boisguerin
R Goldstein
RL Dunbrack
SC Lovell
SJ Weiner
SM Lippow
SM Rowe
T Lazaridis
T Ma
U Wiedemann
WB Guggino
X Jiang
Y Li
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

The cystic fibrosis transmembrane conductance regulator (CFTR) is an epithelial chloride channel mutated in patients with cystic fibrosis (CF). The most prevalent CFTR mutation, ΔF508, blocks folding in the endoplasmic reticulum. Recent work has shown that some ΔF508-CFTR channel activity can be recovered by pharmaceutical modulators (“potentiators” and “correctors”), but ΔF508-CFTR can still be rapidly degraded via a lysosomal pathway involving the CFTR-associated ligand (CAL), which binds CFTR via a PDZ interaction domain. We present a study that goes from theory, to new structure-based computational design algorithms, to computational predictions, to biochemical testing and ultimately to epithelial-cell validation of novel, effective CAL PDZ inhibitors (called “stabilizers”) that rescue ΔF508-CFTR activity. To design the “stabilizers”, we extended our structural ensemble-based computational protein redesign algorithm to encompass protein-protein and protein-peptide interactions. The computational predictions achieved high accuracy: all of the top-predicted peptide inhibitors bound well to CAL. Furthermore, when compared to state-of-the-art CAL inhibitors, our design methodology achieved higher affinity and increased binding efficiency. The designed inhibitor with the highest affinity for CAL (kCAL01) binds six-fold more tightly than the previous best hexamer (iCAL35), and 170-fold more tightly than the CFTR C-terminus. We show that kCAL01 has physiological activity and can rescue chloride efflux in CF patient-derived airway epithelial cells. Since stabilizers address a different cellular CF defect from potentiators and correctors, our inhibitors provide an additional therapeutic pathway that can be used in conjunction with current methods

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare