Search CORE

Public Library of Science (PLOS)

FigShare

Protein Design Using Continuous Rotamers

Author: AE Eriksson
AR Leach
B Kuhlman
B Kuhlman
BI Dahiyat
BR Donald
Bruce R. Donald
C Chen
C Wang
DA Pearlman
DB Gordon
DJ Huggins
G Wang
I Georgiev
I Georgiev
I Georgiev
I Georgiev
J Desmet
J Desmet
J Word
JM Word
JT Kellis Jr
K Raha
KE Roberts
KM Frey
KW Kaufmann
Kyle E. Roberts
L Jiang
MJ Gorczynski
NA Pierce
Pablo Gainza
R Abagyan
R Goldstein
R Lilien
RH Lilien
S Henikoff
S Hubbard
Sarah A. Teichmann
SC Lovell
SM Lippow
T Harder
T Kortemme
T Lazaridis
VB Chen
W Sheffler
X Hu
Y Dehouck
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Optimizing amino acid conformation and identity is a central problem in computational protein design. Protein design algorithms must allow realistic protein flexibility to occur during this optimization, or they may fail to find the best sequence with the lowest energy. Most design algorithms implement side-chain flexibility by allowing the side chains to move between a small set of discrete, low-energy states, which we call rigid rotamers. In this work we show that allowing continuous side-chain flexibility (which we call continuous rotamers) greatly improves protein flexibility modeling. We present a large-scale study that compares the sequences and best energy conformations in 69 protein-core redesigns using a rigid-rotamer model versus a continuous-rotamer model. We show that in nearly all of our redesigns the sequence found by the continuous-rotamer model is different and has a lower energy than the one found by the rigid-rotamer model. Moreover, the sequences found by the continuous-rotamer model are more similar to the native sequences. We then show that the seemingly easy solution of sampling more rigid rotamers within the continuous region is not a practical alternative to a continuous-rotamer model: at computationally feasible resolutions, using more rigid rotamers was never better than a continuous-rotamer model and almost always resulted in higher energies. Finally, we present a new protein design algorithm based on the dead-end elimination (DEE) algorithm, which we call iMinDEE, that makes the use of continuous rotamers feasible in larger systems. iMinDEE guarantees finding the optimal answer while pruning the search space with close to the same efficiency of DEE. Availability: Software is available under the Lesser GNU Public License v3. Contact the authors for source code

CiteSeerX

FigShare

Cost Function Networks to Solve Large Computational Protein Design Problems

Author: Allouche David
Barbe Sophie
De Givry Simon
Katsirelos George
Lebbah Yahia
Loudni Samir
Ouali Abdelkader
Schiex Thomas
Simoncini David
Zytnicki Matthias
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2019
Field of study

International audienc

A Generic Program for Multistate Protein Design

Some protein design tasks cannot be modeled by the traditional single state design strategy of finding a sequence that is optimal for a single fixed backbone. Such cases require multistate design, where a single sequence is threaded onto multiple backbones (states) and evaluated for its strengths and weaknesses on each backbone. For example, to design a protein that can switch between two specific conformations, it is necessary to to find a sequence that is compatible with both backbone conformations. We present in this paper a generic implementation of multistate design that is suited for a wide range of protein design tasks and demonstrate in silico its capabilities at two design tasks: one of redesigning an obligate homodimer into an obligate heterodimer such that the new monomers would not homodimerize, and one of redesigning a promiscuous interface to bind to only a single partner and to no longer bind the rest of its partners. Both tasks contained negative design in that multistate design was asked to find sequences that would produce high energies for several of the states being modeled. Success at negative design was assessed by computationally redocking the undesired protein-pair interactions; we found that multistate design's accuracy improved as the diversity of conformations for the undesired protein-pair interactions increased. The paper concludes with a discussion of the pitfalls of negative design, which has proven considerably more challenging than positive design

CiteSeerX

Carolina Digital Repository

Algorithm for backrub motions in protein design

Author: B. R. Donald
Conti
D. C. Richardson
D. Keedy
Davis
De Maeyer
Desmet
Dunbrack
Esposito
Georgiev
Georgiev
Goldstein
Gordon
Gordon
Harbury
Havranek
Hu
I. Georgiev
J. S. Richardson
Jiang
Jin
Korkegian
Kortemme
Kuhlman
Kuhlman
Lasters
Leach
Lilien
Looger
Looger
Lovell
Lovell
Malakauskas
Mendes
Neuenschwander
Nukaga
Pierce
Ponder
Richardson
Stevens
Street
SU
Vizcarra
Voigt
Voigt
Yanover
Zanghellini
Publication venue: Oxford University Press
Publication date
Field of study

Motivation: The Backrub is a small but kinematically efficient side-chain-coupled local backbone motion frequently observed in atomic-resolution crystal structures of proteins. A backrub shifts the Cα–Cβ orientation of a given side-chain by rigid-body dipeptide rotation plus smaller individual rotations of the two peptides, with virtually no change in the rest of the protein. Backrubs can therefore provide a biophysically realistic model of local backbone flexibility for structure-based protein design. Previously, however, backrub motions were applied via manual interactive model-building, so their incorporation into a protein design algorithm (a simultaneous search over mutation and backbone/side-chain conformation space) was infeasible

Public Library of Science (PLOS)

Predicting the Tolerated Sequences for Proteins and Protein Interfaces Using RosettaBackrub Flexible Backbone Design

Author: A Ernst
A Leaver-Fay
A Leaver-Fay
AE Sauer-Eriksson
B Kuhlman
B Kuhlman
CA Rohl
CA Smith
CA Smith
CA Voigt
Colin A. Smith
CT Saunders
DJ Mandell
DM Fowler
EL Humphris
EL Humphris
F Ding
G Fuh
G Pál
GD Friedland
GD Friedland
GD Friedland
GP Smith
HL Schmidt
I Georgiev
I Georgiev
I Georgiev
IW Davis
JD Bloom
JD Kotz
JJ Havranek
JR Desjarlais
KM Frey
MD Distefano
N Metropolis
N Ollikainen
N Pokala
NJ Marini
PB Harbury
R Tonikian
RL Dunbrack
RP Laura
SM Larson
T Clackson
T Kortemme
Tanja Kortemme
TP Treynor
Vladimir N. Uversky
X Fu
X Hu
XI Ambroggio
XI Ambroggio
Publication venue: Public Library of Science
Publication date: 18/07/2011
Field of study

Predicting the set of sequences that are tolerated by a protein or protein interface, while maintaining a desired function, is useful for characterizing protein interaction specificity and for computationally designing sequence libraries to engineer proteins with new functions. Here we provide a general method, a detailed set of protocols, and several benchmarks and analyses for estimating tolerated sequences using flexible backbone protein design implemented in the Rosetta molecular modeling software suite. The input to the method is at least one experimentally determined three-dimensional protein structure or high-quality model. The starting structure(s) are expanded or refined into a conformational ensemble using Monte Carlo simulations consisting of backrub backbone and side chain moves in Rosetta. The method then uses a combination of simulated annealing and genetic algorithm optimization methods to enrich for low-energy sequences for the individual members of the ensemble. To emphasize certain functional requirements (e.g. forming a binding interface), interactions between and within parts of the structure (e.g. domains) can be reweighted in the scoring function. Results from each backbone structure are merged together to create a single estimate for the tolerated sequence space. We provide an extensive description of the protocol and its parameters, all source code, example analysis scripts and three tests applying this method to finding sequences predicted to stabilize proteins or protein interfaces. The generality of this method makes many other applications possible, for example stabilizing interactions with small molecules, DNA, or RNA. Through the use of within-domain reweighting and/or multistate design, it may also be possible to use this method to find sequences that stabilize particular protein conformations or binding interactions over others

Scientific Publications of the University of Toulouse II Le Mirail

Cost Function Networks to Solve Large Computational Protein Design Problems

Author: Allouche David
Barbe Sophie
De Givry Simon
Katsirelos George
Lebbah Yahia
Loudni Samir
Ouali Abdelkader
Schiex Thomas
Simoncini David
Zytnicki Matthias
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2019
Field of study

International audienc

INRIA a CCSD electronic archive server

HAL-INSA Toulouse

Cooperative Particle Swarm Optimization for Combinatorial Problems

Author: Lapizco Encinas Grecia del Carmen
Publication venue
Publication date: 01/01/2009
Field of study

A particularly successful line of research for numerical optimization is the well-known computational paradigm particle swarm optimization (PSO). In the PSO framework, candidate solutions are represented as particles that have a position and a velocity in a multidimensional search space. The direct representation of a candidate solution as a point that flies through hyperspace (i.e., Rn) seems to strongly predispose the PSO toward continuous optimization. However, while some attempts have been made towards developing PSO algorithms for combinatorial problems, these techniques usually encode candidate solutions as permutations instead of points in search space and rely on additional local search algorithms. In this dissertation, I present extensions to PSO that by, incorporating a cooperative strategy, allow the PSO to solve combinatorial problems. The central hypothesis is that by allowing a set of particles, rather than one single particle, to represent a candidate solution, combinatorial problems can be solved by collectively constructing solutions. The cooperative strategy partitions the problem into components where each component is optimized by an individual particle. Particles move in continuous space and communicate through a feedback mechanism. This feedback mechanism guides them in the assessment of their individual contribution to the overall solution. Three new PSO-based algorithms are proposed. Shared-space CCPSO and multispace CCPSO provide two new cooperative strategies to split the combinatorial problem, and both models are tested on proven NP-hard problems. Multimodal CCPSO extends these combinatorial PSO algorithms to efficiently sample the search space in problems with multiple global optima. Shared-space CCPSO was evaluated on an abductive problem-solving task: the construction of parsimonious set of independent hypothesis in diagnostic problems with direct causal links between disorders and manifestations. Multi-space CCPSO was used to solve a protein structure prediction subproblem, sidechain packing. Both models are evaluated against the provable optimal solutions and results show that both proposed PSO algorithms are able to find optimal or near-optimal solutions. The exploratory ability of multimodal CCPSO is assessed by evaluating both the quality and diversity of the solutions obtained in a protein sequence design problem, a highly multimodal problem. These results provide evidence that extended PSO algorithms are capable of dealing with combinatorial problems without having to hybridize the PSO with other local search techniques or sacrifice the concept of particles moving throughout a continuous search space

Digital Repository at the University of Maryland

ALGORITHMS FOR SELECTING BREAKPOINT LOCATIONS TO OPTIMIZE DIVERSITY IN PROTEIN ENGINEERING BY SITE-DIRECTED PROTEIN RECOMBINATION

Author
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date
Field of study