Search CORE

83 research outputs found

Exploring Protein-Protein Interactions as Drug Targets for Anti-cancer Therapy with In Silico Workflows

Author: A Goncearenco
A Goncearenco
A Marchler-Bauer
A Truszkowski
AA Bogan
B Graves
B Ma
BA Shoemaker
BA Shoemaker
BA Shoemaker
BJ Smith
CA Goble
CM Yates
D Petrey
E Cukuroglu
FP Davis
H Perez-Sanchez
HS Haase
J Bhagat
J Cinatl
JA Wells
K Wolstencroft
M Guharoy
M Li
M Li
M Li
M Petukh
M Tyagi
MK Gilson
MP Mazanetz
N Estrada-Ortiz
P Aloy
P Aloy
P Filippakopoulos
R Mosca
RR Thangudu
S Beisken
S Kim
S Shangary
S Teng
T Rolland
W Yang
WS Valdar
Y Wang
Y Zhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

We describe a computational protocol to aid the design of small molecule and peptide drugs that target protein-protein interactions, particularly for anti-cancer therapy. To achieve this goal, we explore multiple strategies, including finding binding hot spots, incorporating chemical similarity and bioactivity data, and sampling similar binding sites from homologous protein complexes. We demonstrate how to combine existing interdisciplinary resources with examples of semi-automated workflows. Finally, we discuss several major problems, including the occurrence of drug-resistant mutations, drug promiscuity, and the design of dual-effect inhibitors.Fil: Goncearenco, Alexander. National Institutes of Health; Estados UnidosFil: Li, Minghui. Soochow University; China. National Institutes of Health; Estados UnidosFil: Simonetti, Franco Lucio. Consejo Nacional de Investigaciones Científicas y Técnicas. Oficina de Coordinación Administrativa Parque Centenario. Instituto de Investigaciones Bioquímicas de Buenos Aires. Fundación Instituto Leloir. Instituto de Investigaciones Bioquímicas de Buenos Aires; ArgentinaFil: Shoemaker, Benjamin A. National Institutes of Health; Estados UnidosFil: Panchenko, Anna R. National Institutes of Health; Estados Unido

Crossref

CONICET Digital

Functional Diversity and Structural Disorder in the Human Ubiquitination Pathway

Author: A Arrigoni
A Hershko
A Mohan
AE Deffenbaugh
B Bothner
B Hao
B Hess
C Guda
C Haynes
CJ Cox
CM Pickart
D Ekman
D Szklarczyk
D Van Der Spoel
D Vuzman
DM Duda
DM Duda
DW Leung
ES Zimmerman
ET Powers
F Bernassola
F Tama
F. Gisou van der Goot
FU Hartl
G Bussi
G Moncalian
G Swaminathan
G Wu
H Daub
H Dinkel
H Dou
H Shimizu
H Zhu
HC van Leeuwen
HE Niessen
HJ Dyson
HM Berman
I Szymkiewicz
I von Ossowski
J Liu
J Ma
J Prilusky
JC Rosenbaum
JH Fong
JM Huibregtse
K Peng
K Suhre
L Aravind
L Yang
LM Iakoucheva
M Christen
M Cormont
M Fuxreiter
M Fuxreiter
M Guharoy
M Hochstrasser
M Kanehisa
M Kanehisa
M Kanehisa
M Vidal
M Zhao
MA McCoy
Mainak Guharoy
MD Petroski
MD Petroski
ME Sowa
N Foray
N Mathias
N Zheng
P Tompa
P Tompa
P Tompa
P Tompa
P Tompa
Pallab Bhowmick
PE Ryan
PE Wright
Peter Tompa
PW Hildebrand
R Kiss
RG Smock
Rita Pancsa
RJ Deshaies
S Fishbain
S Vucetic
S Wiesner
SC Kales
SJ Demarest
SJ van Wijk
T Hagai
T Inobe
T Mittag
T Ravid
U Emekli
VN Uversky
W Humphrey
W Li
WY Mark
X Zhong
Y Haupt
Y Sheng
Y Xie
Z Dosztanyi
Z Dosztanyi
Z Dosztanyi
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

The ubiquitin-proteasome system plays a central role in cellular regulation and protein quality control (PQC). The system is built as a pyramid of increasing complexity, with two E1 (ubiquitin activating), few dozen E2 (ubiquitin conjugating) and several hundred E3 (ubiquitin ligase) enzymes. By collecting and analyzing E3 sequences from the KEGG BRITE database and literature, we assembled a coherent dataset of 563 human E3s and analyzed their various physical features. We found an increase in structural disorder of the system with multiple disorder predictors (IUPred - E1: 5.97%, E2: 17.74%, E3: 20.03%). E3s that can bind E2 and substrate simultaneously (single subunit E3, ssE3) have significantly higher disorder (22.98%) than E3s in which E2 binding (multi RING-finger, mRF, 0.62%), scaffolding (6.01%) and substrate binding (adaptor/substrate recognition subunits, 17.33%) functions are separated. In ssE3s, the disorder was localized in the substrate/adaptor binding domains, whereas the E2-binding RING/HECT-domains were structured. To demonstrate the involvement of disorder in E3 function, we applied normal modes and molecular dynamics analyses to show how a disordered and highly flexible linker in human CBL (an E3 that acts as a regulator of several tyrosine kinase-mediated signalling pathways) facilitates long-range conformational changes bringing substrate and E2-binding domains towards each other and thus assisting in ubiquitin transfer. E3s with multiple interaction partners (as evidenced by data in STRING) also possess elevated levels of disorder (hubs, 22.90% vs. non-hubs, 18.36%). Furthermore, a search in PDB uncovered 21 distinct human E3 interactions, in 7 of which the disordered region of E3s undergoes induced folding (or mutual induced folding) in the presence of the partner. In conclusion, our data highlights the primary role of structural disorder in the functions of E3 ligases that manifests itself in the substrate/adaptor binding functions as well as the mechanism of ubiquitin transfer by long-range conformational transitions. © 2013 Bhowmick et al

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Repository of the Academy's Library

FigShare

Simplified Method to Predict Mutual Interactions of Human Transcription Factors Based on Their Primary Structure

Author: A Ben Hur
A Ceol
A Ramani
A Remenyi
A van Dijk
A Varshavsky
B Aranda
B Breitkreutz
B Lemon
Boris Jankovic
C Camacho
C Chen
D Caffrey
D GuhaThakurta
E Wingender
F Browne
GJ McLachlan
H Almuallim
I Donaldson
I Guyon
J Bock
J Capra
J Espadaler
J Hoskins
J Shen
J Wang
JJ Chung
JM Vaquerizas
Joaquín Dopazo
L Matthews
L Yu
M Guharoy
M Guharoy
M Kato
M McDowall
N Banerjee
P Aloy
P Aloy
R Hoffmann
R Jansen
S Hannenhalli
S Kawashima
S Lee
S Lo
S Orchard
S Pitre
S Teichmann
Sebastian Schmeier
T Dandekar
T Lee
T Ravasi
U Ogmen
V Matys
VB Bajić
Vladimir B. Bajic
W Kim
W Valdar
X Chen
X Li
X Wu
X Yu
X Yu
Y Guo
Z Hu
Z Zhu
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Background: Physical interactions between transcription factors (TFs) are necessary for forming regulatory protein complexes and thus play a crucial role in gene regulation. Currently, knowledge about the mechanisms of these TF interactions is incomplete and the number of known TF interactions is limited. Computational prediction of such interactions can help identify potential new TF interactions as well as contribute to better understanding the complex machinery involved in gene regulation. Methodology: We propose here such a method for the prediction of TF interactions. The method uses only the primary sequence information of the interacting TFs, resulting in a much greater simplicity of the prediction algorithm. Through an advanced feature selection process, we determined a subset of 97 model features that constitute the optimized model in the subset we considered. The model, based on quadratic discriminant analysis, achieves a prediction accuracy of 85.39 % on a blind set of interactions. This result is achieved despite the selection for the negative data set of only those TF from the same type of proteins, i.e. TFs that function in the same cellular compartment (nucleus) and in the same type of molecular process (transcription initiation). Such selection poses significant challenges for developing models with high specificity, but at the same time better reflects real-world problems. Conclusions: The performance of our predictor compares well to those of much more complex approaches for predicting TF and general protein-protein interactions, particularly when taking the reduced complexity of model utilisation into account

CiteSeerX

Public Library of Science (PLOS)

Crossref

PubMed Central

Solution Structure of the Iron−Sulfur Cluster Cochaperone HscB and Its Binding Surface for the Iron−Sulfur Assembly Scaffold Protein IscU†‡

Author: Agar J. N.
Agar J. N.
Andrew A. J.
Bax A.
Beinert H.
Bertini I.
Bogan A. A.
Bonomi F.
Caffrey D. R.
Chandramouli K.
Cheng H.
Chenna R.
Cornilescu G.
Cornilescu G.
Cupp-Vickery J. R.
Delaglio F.
Dutkiewicz R.
Farmer B. T.
Farrow N. A.
Guharoy M.
Hill R. B.
Hoff K. G.
Hoff K. G.
Huth J. R.
Jansson M.
Johnson D.
Jones S.
Kay L. E.
Kay L. E.
Kostic M.
Lill R.
Lill R.
Mansy S. S.
Mühlenhoff U.
Nuth M.
Pellecchia M.
Pellecchia M.
Qian Y. Q.
Ramelot T. A.
Sibille N.
Silberg J. J.
Szyperski T.
Tjandra N.
Urbina H. D.
Vickery L. E.
Wu S.-p.
Zhao Q.
Publication venue: American Chemical Society
Publication date: 01/01/2008
Field of study

ABSTRACT: The interaction between IscU and HscB is critical for successful assembly of iron-sulfur clusters. NMR experiments were performed on HscB to investigate which of its residues might be part of the IscU binding surface. Residual dipolar couplings ( 1 DHN and 1 DCRHR) indicated that the crystal structure of HscB [Cupp-Vickery, J. R., and Vickery, L. E. (2000) Crystal structure of Hsc20, a J-type cochaperone from Escherichia coli, J. Mol. Biol. 304, 835-845] faithfully represents its solution state. NMR relaxation rates ( 15 N R1, R2) and 1 H- 15 N heteronuclear NOE values indicated that HscB is rigid along its entire backbone except for three short regions which exhibit flexibility on a fast time scale. Changes in the NMR spectrum of HscB upon addition of IscU mapped to the J-domain/C-domain interface, the interdomain linker, and the C-domain. Sequence conservation is low in the interface and in the linker, and NMR changes observed for these residues likely result from indirect effects of IscU binding. NMR changes observed in the conserved patch of residues in the C-domain (L92, M93, L96, E97, E100, E104, and F153) were suggestive of a direct interaction with IscU. To test this, we replaced several of these residues with alanine and assayed for the ability of HscB to interact with IscU and to stimulate HscA ATPase activity. HscB(L92A,M93A,F153A) and HscB(E97A,E100A,E104A) both showed decreased binding affinity for IscU; the (L92A,M93A,F153A) substitution also strongly perturbed the allosteric interactio

CiteSeerX

Crossref

PubMed Central

Predicting protein-protein interface residues using local surface structural similarity

Author: A Porollo
A Rossi
B Liu
B Ma
BA Shoemaker
C Yan
Drena Dobbs
F Wu
H Chen
H Hwang
H Naveed
H Neuvirth
HM Berman
HX Zhou
I Ezkurdia
J Fernández-Recio
J Janin
J Konc
J Konc
J Yu
JE Dayhoff
JL Chung
JL Chung
JR Bradford
K Henrick
L Bartoli
L Giot
M Guharoy
M Sikić
N Carl
N Carl
N Tuncbag
N Tuncbag
NJ Krogan
P Baldi
P Fariselli
QC Zhang
QC Zhang
R Liu
R Nussinov
RA Jordan
Rafael A Jordan
S Hubbard
S Jones
S Jones
S Li
S Liang
S Qin
SJ de Vries
Vasant Honavar
X Li
Y Murakami
Y Ofran
Yasser EL-Manzalawy
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background Identification of the residues in protein-protein interaction sites has a significant impact in problems such as drug discovery. Motivated by the observation that the set of interface residues of a protein tend to be conserved even among remote structural homologs, we introduce <it>PrISE</it>, a family of local structural similarity-based computational methods for predicting protein-protein interface residues. Results We present a novel representation of the surface residues of a protein in the form of structural elements. Each structural element consists of a central residue and its surface neighbors. The <it>PrISE </it>family of interface prediction methods uses a representation of structural elements that captures the atomic composition and accessible surface area of the residues that make up each structural element. Each of the members of the <it>PrISE </it>methods identifies for each structural element in the query protein, a collection of <it>similar </it>structural elements in its repository of structural elements and weights them according to their similarity with the structural element of the query protein. <it>PrISEL </it>relies on the similarity between structural elements (i.e. local structural similarity). <it>PrISEG </it>relies on the similarity between protein surfaces (i.e. general structural similarity). <it>PrISEC</it>, combines local structural similarity and general structural similarity to predict interface residues. These predictors label the central residue of a structural element in a query protein as an interface residue if a weighted majority of the structural elements that are similar to it are interface residues, and as a non-interface residue otherwise. The results of our experiments using three representative benchmark datasets show that the <it>PrISEC </it>outperforms <it>PrISEL </it>and <it>PrISEG</it>; and that <it>PrISEC </it>is highly competitive with state-of-the-art structure-based methods for predicting protein-protein interface residues. Our comparison of <it>PrISEC </it>with <it>PredUs</it>, a recently developed method for predicting interface residues of a query protein based on the known interface residues of its (global) structural homologs, shows that performance superior or comparable to that of <it>PredUs </it>can be obtained using only local surface structural similarity. <it>PrISEC </it>is available as a Web server at <url>http://prise.cs.iastate.edu/</url> Conclusions Local surface structural similarity based methods offer a simple, efficient, and effective approach to predict protein-protein interface residues.</p

Digital Repository @ Iowa State University (ISU)

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Beauty Is in the Eye of the Beholder: Proteins Can Recognize Binding Sites of Homologous Proteins in More than One Way

Understanding the mechanisms of protein–protein interaction is a fundamental problem with many practical applications. The fact that different proteins can bind similar partners suggests that convergently evolved binding interfaces are reused in different complexes. A set of protein complexes composed of non-homologous domains interacting with homologous partners at equivalent binding sites was collected in 2006, offering an opportunity to investigate this point. We considered 433 pairs of protein–protein complexes from the ABAC database (AB and AC binary protein complexes sharing a homologous partner A) and analyzed the extent of physico-chemical similarity at the atomic and residue level at the protein–protein interface. Homologous partners of the complexes were superimposed using Multiprot, and similar atoms at the interface were quantified using a five class grouping scheme and a distance cut-off. We found that the number of interfacial atoms with similar properties is systematically lower in the non-homologous proteins than in the homologous ones. We assessed the significance of the similarity by bootstrapping the atomic properties at the interfaces. We found that the similarity of binding sites is very significant between homologous proteins, as expected, but generally insignificant between the non-homologous proteins that bind to homologous partners. Furthermore, evolutionarily conserved residues are not colocalized within the binding sites of non-homologous proteins. We could only identify a limited number of cases of structural mimicry at the interface, suggesting that this property is less generic than previously thought. Our results support the hypothesis that different proteins can interact with similar partners using alternate strategies, but do not support convergent evolution

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Sequence-based identification of interface residues by an integrative profile combining hydrophobic and evolutionary information

Author: A Porollo
AJ Bordner
B Wang
B Wang
BD Alberts
C Cortes
C Sander
ED Levy
F Glaser
F Pazos
H Chen
H Zhou
HM Berman
HS Wong
I Ezkurdia
I Res
J Chung
J Janin
J Kittler
J Kyte
J Mihel
JC Bezdek
Jinyan Li
JR Bradford
JR Bradford
KS Thorn
L Lo Conte
LI Kuncheva
LK Hansen
M Charton
M Guharoy
M Sikic
N H
P Baldi
P Chakrabarti
P Chen
P Cherepanov
P Cherepanov
P Fariselli
Peng Chen
Q Dong
R Singh
RA Laskowski
RD Pascual-Marqui
RM Kini
RP Bahadur
RP Bahadur
S Jones
S Jones
S Jones
SJ de Vries
T Friedrich
T Kohonen
TA Larsen
TJ Bollenbach
Uni-Prot-Consortium
V Chelliah
W Kauzmann
X Du
X Gallet
XW Chen
Y Murakami
Y Ofran
Y Ofran
Y Ofran
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Protein-protein interactions play essential roles in protein function determination and drug design. Numerous methods have been proposed to recognize their interaction sites, however, only a small proportion of protein complexes have been successfully resolved due to the high cost. Therefore, it is important to improve the performance for predicting protein interaction sites based on primary sequence alone. Results We propose a new idea to construct an integrative profile for each residue in a protein by combining its hydrophobic and evolutionary information. A support vector machine (SVM) ensemble is then developed, where SVMs train on different pairs of positive (interface sites) and negative (non-interface sites) subsets. The subsets having roughly the same sizes are grouped in the order of accessible surface area change before and after complexation. A self-organizing map (SOM) technique is applied to group similar input vectors to make more accurate the identification of interface residues. An ensemble of ten-SVMs achieves an MCC improvement by around 8% and F1 improvement by around 9% over that of three-SVMs. As expected, SVM ensembles constantly perform better than individual SVMs. In addition, the model by the integrative profiles outperforms that based on the sequence profile or the hydropathy scale alone. As our method uses a small number of features to encode the input vectors, our model is simpler, faster and more accurate than the existing methods. Conclusions The integrative profile by combining hydrophobic and evolutionary information contributes most to the protein-protein interaction prediction. Results show that evolutionary context of residue with respect to hydrophobicity makes better the identification of protein interface residues. In addition, the ensemble of SVM classifiers improves the prediction performance. Availability Datasets and software are available at <url>http://mail.ustc.edu.cn/~bigeagle/BMCBioinfo2010/index.htm</url>.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

OPUS - University of Technology Sydney

PubMed Central

Silica nanoparticles enhance autophagic activity, disturb endothelial cell homeostasis and impair angiogenesis

Author: A Roy
AA Keller
C He
CA Pope 3rd
CH Cho
DR Gold
DR Green
E VandenBerg
H Gerhardt
H Li
HL Liu
J Du
J Duan
J Duan
J Duan
J Xu
JO Pyo
Junchao Duan
JY Lee
L Matassoni
L Murr
L Sun
L Tian
M Di Gioacchino
M Ehrenberg
M Guharoy
M Inoue
M Kotecki
MI Khan
N Ferrara
N Mizushima
O Attar-Schneider
P Dromparis
P Kumar
Peili Huang
R Tabibiazar
R-J Teng
RD Brook
S Hackenberg
S Hussain
S Mostowy
S Wakui
Shuangqing Peng
SM Dudek
ST Stern
T Lee JE
United States Government Accountability Office
W Martinet
W Shen
X Huang
X Wang
Xianqing Zhou
Y Shao
Y Yu
Y Yu
Yang Li
Yang Yu
Yongbo Yu
Z Li
Z Ungvari
Zhiwei Sun
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Using Shifts in Amino Acid Frequency and Substitution Rate to Identify Latent Structural Characters in Base-Excision Repair Enzymes

Author: A Bateman
A del Sol
A Gutteridge
A Gutteridge
A Marchler-Bauer
A Stamatakis
AB Robertson
AH Elcock
AN Barclay
AR Panchenko
AT Laurie
B Kolaczkowski
B Reva
C Branden
C Notredame
DA Kraut
DE Pumo
DM Standley
DO Zharkov
DO Zharkov
DO Zharkov
DP Brown
DR Caffrey
DT Jones
E Deu
E Hodis
E Martz
E Youn
EA Gaucher
EC Friedberg
F Coste
G Casari
G Golan
G Nimrod
GJ Naylor
H Hirano
I Mihalek
IN Sarkar
J Felsenstein
J Ko
J Pei
JA Capra
JA Capra
JC Fromme
Jeffrey P. Bond
JM Koshi
K Imamura
K Katoh
K Pereira de Jesus
KD Pruitt
KP Peters
KY Kropachev
L Rabow
LA Mirny
LE Limbird
M Clamp
M Guharoy
M Landau
M Rogacheva
M Saparbaev
M Sugahara
MJ Ondrechen
N Galtier
N Miyatake
NV Petrova
O Lichtarge
O Rahat
O Schueler-Furman
OM Sidorkina
OV Kalinina
P Aloy
P Amara
P Lio
P Lopez
P Marttinen
Q Cheng
R Gilboa
R Landgraf
RA George
Ramiro Barrantes-Reynolds
S Ahmad
S Burgess
S Doublie
S Gribaldo
S Henikoff
S Madabushi
S Sankararaman
S Wolfram
SD Kathe
Sebastian D. Fugmann
SF Altschul
SS Hannenhalli
SS Wallace
Susan S. Wallace
SV Kuznetsov
V Bandaru
V Ruano-Rubio
WL Delano
X Gu
X Gu
X Gu
X Gu
X Gu
X Gu
X Gu
Z Yang
Publication venue: Public Library of Science
Publication date: 06/10/2011
Field of study

Protein evolution includes the birth and death of structural motifs. For example, a zinc finger or a salt bridge may be present in some, but not all, members of a protein family. We propose that such transitions are manifest in sequence phylogenies as concerted shifts in substitution rates of amino acids that are neighbors in a representative structure. First, we identified rate shifts in a quartet from the Fpg/Nei family of base excision repair enzymes using a method developed by Xun Gu and coworkers. We found the shifts to be spatially correlated, more precisely, associated with a flexible loop involved in bacterial Fpg substrate specificity. Consistent with our result, sequences and structures provide convincing evidence that this loop plays a very different role in other family members. Second, then, we developed a method for identifying latent protein structural characters (LSC) given a set of homologous sequences based on Gu's method and proximity in a high-resolution structure. Third, we identified LSC and assigned states of LSC to clades within the Fpg/Nei family of base excision repair enzymes. We describe seven LSC; an accompanying Proteopedia page (http://proteopedia.org/wiki/index.php/Fpg_Nei_Protein_Family) describes these in greater detail and facilitates 3D viewing. The LSC we found provided a surprisingly complete picture of the interaction of the protein with the DNA capturing familiar examples, such as a Zn finger, as well as more subtle interactions. Their preponderance is consistent with an important role as phylogenetic characters. Phylogenetic inference based on LSC provided convincing evidence of independent losses of Zn fingers. Structural motifs may serve as important phylogenetic characters and modeling transitions involving structural motifs may provide a much deeper understanding of protein evolution

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Assessment of protein-protein interfaces in cryo-EM derived assemblies

Author: A Kryshtafovych
A Kryshtafovych
A Patwardhan
AA Bogan
AJ McCoy
AP Joseph
AP Joseph
AP Joseph
AP Pandurangan
BA Barad
C Yan
D Guzenko
D Russel
DR Caffrey
E Cukuroglu
E Krissinel
EF Pettersen
F Glaser
FB Sheinerman
G Kuzu
G Pintilie
HM Berman
HR Saibil
I Farabella
IMA Nooren
J Zhang
JM de la Rosa-Trevín
LL Conte
M Gao
M Guharoy
MC Lawrence
MD Winn
MG Prisant
P Chakrabarti
P Emsley
R Chen
R Dintyala
R Norel
RC Edgar
RC Edgar
S Jones
S Malhotra
S Malhotra
S Viswanath
S Xia
SF Altschul
T Burnley
T Pupko
VB Chen
WSJ Valdar
X Bai
Y Ofran
Y Tsuchiya
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

Structures of macromolecular assemblies derived from cryo-EM maps often contain errors that become more abundant with decreasing resolution. Despite efforts in the cryo-EM community to develop metrics for map and atomistic model validation, thus far, no specific scoring metrics have been applied systematically to assess the interface between the assembly subunits. Here, we comprehensively assessed protein–protein interfaces in macromolecular assemblies derived by cryo-EM. To this end, we developed Protein Interface-score (PI-score), a density-independent machine learning-based metric, trained using the features of protein–protein interfaces in crystal structures. We evaluated 5873 interfaces in 1053 PDB-deposited cryo-EM models (including SARS-CoV-2 complexes), as well as the models submitted to CASP13 cryo-EM targets and the EM model challenge. We further inspected the interfaces associated with low-scores and found that some of those, especially in intermediate-to-low resolution (worse than 4 Å) structures, were not captured by density-based assessment scores. A combined score incorporating PI-score and fit-to-density score showed discriminatory power, allowing our method to provide a powerful complementary assessment tool for the ever-increasing number of complexes solved by cryo-EM

Crossref

Birkbeck Institutional Research Online

ePubs: the open archive for STFC research publications