Search CORE

79 research outputs found

A computational framework to empower probabilistic protein design

Author: Fromer Menachem
Yanover Chen
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

Motivation: The task of engineering a protein to perform a target biological function is known as protein design. A commonly used paradigm casts this functional design problem as a structural one, assuming a fixed backbone. In probabilistic protein design, positional amino acid probabilities are used to create a random library of sequences to be simultaneously screened for biological activity. Clearly, certain choices of probability distributions will be more successful in yielding functional sequences. However, since the number of sequences is exponential in protein length, computational optimization of the distribution is difficult

CiteSeerX

PubMed Central

Inferring PDZ Domain Multi-Mutant Binding Preferences from Single-Mutant Data

Author: A Ceol
A Ernst
BZ Harris
C Nourry
C Vogel
Chen Yanover
E Beitz
E Kim
Elena Zaslavsky
G Stolovitzky
JR Chen
M Venkatarajan
MA Stiffler
Mark Isalan
N Habib
P Beltrao
Philip Bradley
R Tonikian
T Beuming
T Hertz
T Pawson
Publication venue: Public Library of Science
Publication date: 30/09/2010
Field of study

Many important cellular protein interactions are mediated by peptide recognition domains. The ability to predict a domain's binding specificity directly from its primary sequence is essential to understanding the complexity of protein-protein interaction networks. One such recognition domain is the PDZ domain, functioning in scaffold proteins that facilitate formation of signaling networks. Predicting the PDZ domain's binding specificity was a part of the DREAM4 Peptide Recognition Domain challenge, the goal of which was to describe, as position weight matrices, the specificity profiles of five multi-mutant ERBB2IP-1 domains. We developed a method that derives multi-mutant binding preferences by generalizing the effects of single point mutations on the wild type domain's binding specificities. Our approach, trained on publicly available ERBB2IP-1 single-mutant phage display data, combined linear regression-based prediction for ligand positions whose specificity is determined by few PDZ positions, and single-mutant position weight matrix averaging for all other ligand columns. The success of our method as the winning entry of the DREAM4 competition, as well as its superior performance over a general PDZ-ligand binding model, demonstrates the advantages of training a model on a well-selected domain-specific data set

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Ranitidine Use and Incident Cancer in a Multinational Cohort

Author: Duarte-Salles Talita
Falconer Thomas
Hripcsak George
Hsu Jason C.
Hsu Min Huei
Kim Yeesuk
Ko Heejoo
Lee Hang Lak
Nguyen Phung Anh
Park Chan Hyuk
Park Rae Woong
Posada Jose D.
Pratt Nicole
Prieto-Alhambra Daniel
Reich Christin G.
Seager Sarah
Seo Seung In
Shah Nigam H.
Shin Woon Geon
Suchard Marc A.
Van Zandt Mui
Yanover Chen
You Seng Chan
Publication venue
Publication date: 19/09/2023
Field of study

Importance: Ranitidine, the most widely used histamine-2 receptor antagonist (H2RA), was withdrawn because of N-nitrosodimethylamine impurity in 2020. Given the worldwide exposure to this drug, the potential risk of cancer development associated with the intake of known carcinogens is an important epidemiological concern. Objective: To examine the comparative risk of cancer associated with the use of ranitidine vs other H2RAs. Design, Setting, and Participants: This new-user active comparator international network cohort study was conducted using 3 health claims and 9 electronic health record databases from the US, the United Kingdom, Germany, Spain, France, South Korea, and Taiwan. Large-scale propensity score (PS) matching was used to minimize confounding of the observed covariates with negative control outcomes. Empirical calibration was performed to account for unobserved confounding. All databases were mapped to a common data model. Database-specific estimates were combined using random-effects meta-analysis. Participants included individuals aged at least 20 years with no history of cancer who used H2RAs for more than 30 days from January 1986 to December 2020, with a 1-year washout period. Data were analyzed from April to September 2021. Exposure: The main exposure was use of ranitidine vs other H2RAs (famotidine, lafutidine, nizatidine, and roxatidine). Main Outcomes and Measures: The primary outcome was incidence of any cancer, except nonmelanoma skin cancer. Secondary outcomes included all cancer except thyroid cancer, 16 cancer subtypes, and all-cause mortality. Results: Among 1 183 999 individuals in 11 databases, 909 168 individuals (mean age, 56.1 years; 507 316 [55.8%] women) were identified as new users of ranitidine, and 274 831 individuals (mean age, 58.0 years; 145 935 [53.1%] women) were identified as new users of other H2RAs. Crude incidence rates of cancer were 14.30 events per 1000 person-years (PYs) in ranitidine users and 15.03 events per 1000 PYs among other H2RA users. After PS matching, cancer risk was similar in ranitidine compared with other H2RA users (incidence, 15.92 events per 1000 PYs vs 15.65 events per 1000 PYs; calibrated meta-analytic hazard ratio, 1.04; 95% CI, 0.97-1.12). No significant associations were found between ranitidine use and any secondary outcomes after calibration. Conclusions and Relevance: In this cohort study, ranitidine use was not associated with an increased risk of cancer compared with the use of other H2RAs. Further research is needed on the long-term association of ranitidine with cancer development.</p

EUR Research Repository

PepDist: A New Framework for Protein-Peptide Binding Prediction based on Learning Peptide Distance Functions

Author: A Bar-Hilel
A Sette
A Sette
AP Dempster
AY Hung
CA Janeway
Chen Yanover
D Klein
DR Flower
DR Madden
E Xing
H Mamitsuka
HG Rammensee
JW Yewdell
K Gulukota
K WagstafF
K Yu
M Andersen
M Bhasin
M Bilenko
MS Venkatarajan
N Shental
O Schueler-Furman
P Donnes
PA Reche
RE Schapire
RE Schapire
S Buus
T Bailey
T Hertz
T Hertz
Tomer Hertz
U Wiedemann
V Brusic
V Brusic
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Many different aspects of cellular signalling, trafficking and targeting mechanisms are mediated by interactions between proteins and peptides. Representative examples are MHC-peptide complexes in the immune system. Developing computational methods for protein-peptide binding prediction is therefore an important task with applications to vaccine and drug design. METHODS: Previous learning approaches address the binding prediction problem using traditional margin based binary classifiers. In this paper we propose PepDist: a novel approach for predicting binding affinity. Our approach is based on learning peptide-peptide distance functions. Moreover, we suggest to learn a single peptide-peptide distance function over an entire family of proteins (e.g. MHC class I). This distance function can be used to compute the affinity of a novel peptide to any of the proteins in the given family. In order to learn these peptide-peptide distance functions, we formalize the problem as a semi-supervised learning problem with partial information in the form of equivalence constraints. Specifically, we propose to use DistBoost [1,2], which is a semi-supervised distance learning algorithm. RESULTS: We compare our method to various state-of-the-art binding prediction algorithms on MHC class I and MHC class II datasets. In almost all cases, our method outperforms all of its competitors. One of the major advantages of our novel approach is that it can also learn an affinity function over proteins for which only small amounts of labeled peptides exist. In these cases, our method's performance gain, when compared to other computational methods, is even more pronounced. We have recently uploaded the PepDist webserver which provides binding prediction of peptides to 35 different MHC class I alleles. The webserver which can be found at is powered by a prediction engine which was trained using the framework presented in this paper. CONCLUSION: The results obtained suggest that learning a single distance function over an entire family of proteins achieves higher prediction accuracy than learning a set of binary classifiers for each of the proteins separately. We also show the importance of obtaining information on experimentally determined non-binders. Learning with real non-binders generalizes better than learning with randomly generated peptides that are assumed to be non-binders. This suggests that information about non-binding peptides should also be published and made publicly available

Crossref

Springer - Publisher Connector

PubMed Central

Extensive protein and DNA backbone sampling improves structure-based specificity prediction for C2H2 zinc fingers

Author: Bateman
Benos
Benos
Cahill
Chen Yanover
Chevalier
Cho
Choo
Contreras-Moreira
Crooks
Desjarlais
Dickerson
Donald
Doody
Elrod-Erickson
Emerson
Endres
Endres
Fu
Grigoryan
Gromiha
Habib
Havranek
Holbrook
Jamal Rahi
Janin
Janin
Joachimiak
Johnson
Jolma
Joshi
Kaplan
Kono
Kortemme
Kuhlman
Lafontaine
Lazaridis
Li
Liu
Liu
Liu
Maeder
Morozov
O'Flanagan
Olson
Ordiz
Paillard
Persikov
Philip Bradley
Philippakis
Quintana
Reddy
Renda
Rohl
Rohs
Sander
Siggers
Siggers
Simons
Steffen
Temiz
Wang
Wolfe
Wolfe
Zhu
Publication venue: Oxford University Press
Publication date
Field of study

Sequence-specific DNA recognition by gene regulatory proteins is critical for proper cellular functioning. The ability to predict the DNA binding preferences of these regulatory proteins from their amino acid sequence would greatly aid in reconstruction of their regulatory interactions. Structural modeling provides one route to such predictions: by building accurate molecular models of regulatory proteins in complex with candidate binding sites, and estimating their relative binding affinities for these sites using a suitable potential function, it should be possible to construct DNA binding profiles. Here, we present a novel molecular modeling protocol for protein-DNA interfaces that borrows conformational sampling techniques from de novo protein structure prediction to generate a diverse ensemble of structural models from small fragments of related and unrelated protein-DNA complexes. The extensive conformational sampling is coupled with sequence space exploration so that binding preferences for the target protein can be inferred from the resulting optimized DNA sequences. We apply the algorithm to predict binding profiles for a benchmark set of eleven C2H2 zinc finger transcription factors, five of known and six of unknown structure. The predicted profiles are in good agreement with experimental binding data; furthermore, examination of the modeled structures gives insight into observed binding preferences

Crossref

PubMed Central

A computational method for designing diverse linear epitopes including citrullinated peptides with desired binding affinities to intravenous immunoglobulin

Author: Alessandra Tiengo
B Yao
B Yao
BG Pierce
Bjoern Ziems
C Lundegaard
C Meydan
C Peri
C Yanover
Carl Kingsford
CH Teo
Felix Steinbeck
GL Zhang
GL Zhang
Gustavo Stolovitzky
Hans-Jürgen Thiesen
J Chen
J Gao
J Kittler
JE Larsen
Julio Saez-Rodriguez
JV Kringelum
JW Ponder
K Newton
L Huang
L Nanni
L Nanni
LJ Wee
M Hecker
M Luštrek
M Nielsen
M Ojala
Mitja Luštrek
N Barbarini
Nicola Barbarini
NP Boghossian
P Pudil
P Wang
Peter Lorenz
R Chen
Raquel Norel
Riccardo Bellazzi
Rob Patro
Robert J. Prill
S Gupta
S Henikoff
S Kawashima
S Saha
SY-H Lin
V Brusic
W Zhang
X Hu
Y EL-Manzalawy
Y Wang
Y Zhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Tradeoff Between Stability and Multispecificity in the Design of Promiscuous Proteins

Author: A Barabasi
A del Sol
A Houdusse
B Kuhlman
BI Dahiyat
BM Beadle
C Dodge
C Yanover
CJ Tsai
CM Kraemer-Pecore
CM Summa
CT Saunders
CY Chen
D Chin
D Reichmann
DB Gordon
DD Boehr
DN Bolon
E Beitz
E Yosef
EL Humphris
EL Humphris
F Ding
G Grigoryan
GD Friedland
GE Crooks
Gx Xie
H Jeong
IN Berezovsky
J Gsponer
J Karanicolas
J Mason
JDJ Han
JE Donald
JJ Havranek
JM Shifman
Julia M. Shifman
L Li
Leonid A. Mirny
M Fromer
M Fromer
M Fromer
M Ikura
M Ikura
M Schneider
M Shimaoka
M Zhang
MA Schumacher
MA Schumacher
Menachem Fromer
N Tokuriki
NA Rosenberg
O Keskin
O Keskin
O Sharabi
P Carbonell
P Pagel
RL Dunbrack
S Kirkpatrick
S Kumar
S Sankararaman
SH Gellman
T Kortemme
U Alon
V Potapov
W Meador
WL Delano
X Fu
X Hu
Z Hu
Publication venue: Public Library of Science
Publication date: 01/12/2009
Field of study

Natural proteins often partake in several highly specific protein-protein interactions. They are thus subject to multiple opposing forces during evolutionary selection. To be functional, such multispecific proteins need to be stable in complex with each interaction partner, and, at the same time, to maintain affinity toward all partners. How is this multispecificity acquired through natural evolution? To answer this compelling question, we study a prototypical multispecific protein, calmodulin (CaM), which has evolved to interact with hundreds of target proteins. Starting from high-resolution structures of sixteen CaM-target complexes, we employ state-of-the-art computational methods to predict a hundred CaM sequences best suited for interaction with each individual CaM target. Then, we design CaM sequences most compatible with each possible combination of two, three, and all sixteen targets simultaneously, producing almost 70,000 low energy CaM sequences. By comparing these sequences and their energies, we gain insight into how nature has managed to find the compromise between the need for favorable interaction energies and the need for multispecificity. We observe that designing for more partners simultaneously yields CaM sequences that better match natural sequence profiles, thus emphasizing the importance of such strategies in nature. Furthermore, we show that the CaM binding interface can be nicely partitioned into positions that are critical for the affinity of all CaM-target complexes and those that are molded to provide interaction specificity. We reveal several basic categories of sequence-level tradeoffs that enable the compromise necessary for the promiscuity of this protein. We also thoroughly quantify the tradeoff between interaction energetics and multispecificity and find that facilitating seemingly competing interactions requires only a small deviation from optimal energies. We conclude that multispecific proteins have been subjected to a rigorous optimization process that has fine-tuned their sequences for interactions with a precise set of targets, thus conferring their multiple cellular functions

Crossref

Directory of Open Access Journals

PubMed Central

Computational Design of a PDZ Domain Peptide Inhibitor that Rescues CFTR Activity

Author: A Leaver-Fay
A Piserchio
A Taddei
AJW te Velthuis
AR Leach
B Brannetti
B Kuhlman
BD Allen
BI Dahiyat
BI Dahiyat
BR Brooks
BR Donald
Bruce R. Donald
C Chen
C Lee
C Yanover
CA Smith
CL Kingsford
D Saro
DA Case
DB Gordon
DB Gordon
Dean R. Madden
DM Cholon
DN Sheppard
DT Jones
E Althaus
E Bruscia
E Hong
E Kim
FV Goor
Giorgio Colombo
GK Hom
H Kamisetty
HM Sampson
I Georgiev
IN Berezovsky
J Cheng
J Cheng
J Desmet
J Janin
J Reina
J Thomas
J Zhang
JM Word
JM Word
JR Desjarlais
JW Ponder
KA Reynolds
KM Frey
Kyle E. Roberts
L Vouilleme
LA Joachimiak
M Dayhoff
M Fromer
M Gilson
M Wolde
MD Altman
MJ Gorczynski
N Pedemonte
P Gainza
P Humbert
P Koehl
P Koehl
Patrick R. Cushing
PR Cushing
PR Cushing
Prisca Boisguerin
R Goldstein
RL Dunbrack
SC Lovell
SJ Weiner
SM Lippow
SM Rowe
T Lazaridis
T Ma
U Wiedemann
WB Guggino
X Jiang
Y Li
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

The cystic fibrosis transmembrane conductance regulator (CFTR) is an epithelial chloride channel mutated in patients with cystic fibrosis (CF). The most prevalent CFTR mutation, ΔF508, blocks folding in the endoplasmic reticulum. Recent work has shown that some ΔF508-CFTR channel activity can be recovered by pharmaceutical modulators (“potentiators” and “correctors”), but ΔF508-CFTR can still be rapidly degraded via a lysosomal pathway involving the CFTR-associated ligand (CAL), which binds CFTR via a PDZ interaction domain. We present a study that goes from theory, to new structure-based computational design algorithms, to computational predictions, to biochemical testing and ultimately to epithelial-cell validation of novel, effective CAL PDZ inhibitors (called “stabilizers”) that rescue ΔF508-CFTR activity. To design the “stabilizers”, we extended our structural ensemble-based computational protein redesign algorithm to encompass protein-protein and protein-peptide interactions. The computational predictions achieved high accuracy: all of the top-predicted peptide inhibitors bound well to CAL. Furthermore, when compared to state-of-the-art CAL inhibitors, our design methodology achieved higher affinity and increased binding efficiency. The designed inhibitor with the highest affinity for CAL (kCAL01) binds six-fold more tightly than the previous best hexamer (iCAL35), and 170-fold more tightly than the CFTR C-terminus. We show that kCAL01 has physiological activity and can rescue chloride efflux in CF patient-derived airway epithelial cells. Since stabilizers address a different cellular CF defect from potentiators and correctors, our inhibitors provide an additional therapeutic pathway that can be used in conjunction with current methods

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare