Predicting the Tolerated Sequences for Proteins and Protein Interfaces Using RosettaBackrub Flexible Backbone Design

A Ernst; A Leaver-Fay; A Leaver-Fay; AE Sauer-Eriksson; B Kuhlman; B Kuhlman; CA Rohl; CA Smith; CA Smith; CA Voigt; Colin A. Smith; CT Saunders; DJ Mandell; DM Fowler; EL Humphris; EL Humphris; F Ding; G Fuh; G Pál; GD Friedland; GD Friedland; GD Friedland; GP Smith; HL Schmidt; I Georgiev; I Georgiev; I Georgiev; IW Davis; JD Bloom; JD Kotz; JJ Havranek; JR Desjarlais; KM Frey; MD Distefano; N Metropolis; N Ollikainen; N Pokala; NJ Marini; PB Harbury; R Tonikian; RL Dunbrack; RP Laura; SM Larson; T Clackson; T Kortemme; Tanja Kortemme; TP Treynor; Vladimir N. Uversky; X Fu; X Hu; XI Ambroggio; XI Ambroggio

Predicting the Tolerated Sequences for Proteins and Protein Interfaces Using RosettaBackrub Flexible Backbone Design

Authors: A Ernst
A Leaver-Fay
A Leaver-Fay
AE Sauer-Eriksson
B Kuhlman
B Kuhlman
CA Rohl
CA Smith
CA Smith
CA Voigt
Colin A. Smith
CT Saunders
DJ Mandell
DM Fowler
EL Humphris
EL Humphris
F Ding
G Fuh
G Pál
GD Friedland
GD Friedland
GD Friedland
GP Smith
HL Schmidt
I Georgiev
I Georgiev
I Georgiev
IW Davis
JD Bloom
JD Kotz
JJ Havranek
JR Desjarlais
KM Frey
MD Distefano
N Metropolis
N Ollikainen
N Pokala
NJ Marini
PB Harbury
R Tonikian
RL Dunbrack
RP Laura
SM Larson
T Clackson
T Kortemme
Tanja Kortemme
TP Treynor
Vladimir N. Uversky
X Fu
X Hu
XI Ambroggio
XI Ambroggio
Publication date: 18 July 2011
Publisher: Public Library of Science
Doi

Abstract

Predicting the set of sequences that are tolerated by a protein or protein interface, while maintaining a desired function, is useful for characterizing protein interaction specificity and for computationally designing sequence libraries to engineer proteins with new functions. Here we provide a general method, a detailed set of protocols, and several benchmarks and analyses for estimating tolerated sequences using flexible backbone protein design implemented in the Rosetta molecular modeling software suite. The input to the method is at least one experimentally determined three-dimensional protein structure or high-quality model. The starting structure(s) are expanded or refined into a conformational ensemble using Monte Carlo simulations consisting of backrub backbone and side chain moves in Rosetta. The method then uses a combination of simulated annealing and genetic algorithm optimization methods to enrich for low-energy sequences for the individual members of the ensemble. To emphasize certain functional requirements (e.g. forming a binding interface), interactions between and within parts of the structure (e.g. domains) can be reweighted in the scoring function. Results from each backbone structure are merged together to create a single estimate for the tolerated sequence space. We provide an extensive description of the protocol and its parameters, all source code, example analysis scripts and three tests applying this method to finding sequences predicted to stabilize proteins or protein interfaces. The generality of this method makes many other applications possible, for example stabilizing interactions with small molecules, DNA, or RNA. Through the use of within-domain reweighting and/or multistate design, it may also be possible to use this method to find sequences that stabilize particular protein conformations or binding interactions over others

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Directory of Open Access Journals

oai:doaj.org/article:8fb7179c1...

Last time updated on 13/10/2017

Public Library of Science (PLOS)

Last time updated on 18/09/2018

Crossref

info:doi/10.1371%2Fjournal.pon...

Last time updated on 01/04/2019