116 research outputs found
Four small puzzles that Rosetta doesn't solve
A complete macromolecule modeling package must be able to solve the simplest
structure prediction problems. Despite recent successes in high resolution
structure modeling and design, the Rosetta software suite fares poorly on
deceptively small protein and RNA puzzles, some as small as four residues. To
illustrate these problems, this manuscript presents extensive Rosetta results
for four well-defined test cases: the 20-residue mini-protein Trp cage, an even
smaller disulfide-stabilized conotoxin, the reactive loop of a serine protease
inhibitor, and a UUCG RNA tetraloop. In contrast to previous Rosetta studies,
several lines of evidence indicate that conformational sampling is not the
major bottleneck in modeling these small systems. Instead, approximations and
omissions in the Rosetta all-atom energy function currently preclude
discriminating experimentally observed conformations from de novo models at
atomic resolution. These molecular "puzzles" should serve as useful model
systems for developers wishing to make foundational improvements to this
powerful modeling suite.Comment: Published in PLoS One as a manuscript for the RosettaCon 2010 Special
Collectio
Atomic-accuracy prediction of protein loop structures through an RNA-inspired ansatz
Consistently predicting biopolymer structure at atomic resolution from
sequence alone remains a difficult problem, even for small sub-segments of
large proteins. Such loop prediction challenges, which arise frequently in
comparative modeling and protein design, can become intractable as loop lengths
exceed 10 residues and if surrounding side-chain conformations are erased. This
article introduces a modeling strategy based on a 'stepwise ansatz', recently
developed for RNA modeling, which posits that any realistic all-atom molecular
conformation can be built up by residue-by-residue stepwise enumeration. When
harnessed to a dynamic-programming-like recursion in the Rosetta framework, the
resulting stepwise assembly (SWA) protocol enables enumerative sampling of a 12
residue loop at a significant but achievable cost of thousands of CPU-hours. In
a previously established benchmark, SWA recovers crystallographic conformations
with sub-Angstrom accuracy for 19 of 20 loops, compared to 14 of 20 by KIC
modeling with a comparable expenditure of computational power. Furthermore, SWA
gives high accuracy results on an additional set of 15 loops highlighted in the
biological literature for their irregularity or unusual length. Successes
include cis-Pro touch turns, loops that pass through tunnels of other
side-chains, and loops of lengths up to 24 residues. Remaining problem cases
are traced to inaccuracies in the Rosetta all-atom energy function. In five
additional blind tests, SWA achieves sub-Angstrom accuracy models, including
the first such success in a protein/RNA binding interface, the YbxF/kink-turn
interaction in the fourth RNA-puzzle competition. These results establish
all-atom enumeration as a systematic approach to protein structure that can
leverage high performance computing and physically realistic energy functions
to more consistently achieve atomic resolution.Comment: Identity of four-loop blind test protein and parts of figures 5 have
been omitted in this preprint to ensure confidentiality of the protein
structure prior to its public releas
Prediction of native-state hydrogen exchange from perfectly funneled energy landscapes
Simulations based on perfectly funneled energy landscapes often capture many of the kinetic features of protein folding. We examined whether simulations based on funneled energy functions can also describe fluctuations in native-state protein ensembles. We quantitatively compared the site-specific local stability determined from structure-based folding simulations, with hydrogen exchange protection factors measured experimentally for ubiquitin, chymotrypsin inhibitor 2, and staphylococcal nuclease. Different structural definitions for the open and closed states based on the number of native contacts for each residue, as well as the hydrogen-bonding state, or a combination of both criteria were evaluated. The predicted exchange patterns agree with the experiments under native conditions, indicating that protein topology indeed has a dominant effect on the exchange kinetics. Insights into the simplest mechanistic interpretation of the amide exchange process were thus obtained.Fil: Craig, Patricio Oliver. Fundación Instituto Leloir; Argentina. Consejo Nacional de Investigaciones CientÃficas y Técnicas. Oficina de Coordinación Administrativa Parque Centenario. Instituto de Investigaciones Bioquimicas de Buenos Aires; Argentina. University of California San Diego. Department of Chemistry and Biochemistry; Estados UnidosFil: Lätzer, Joachim. Rutgers University. BioMaPS Institute; Estados UnidosFil: Weinkam, Patrick. University of California at San Francisco. Department of Bioengineering and Therapeutic Sciences; Estados UnidosFil: Hoffman, Ryan M. B.. University Of California At San Diego; Estados UnidosFil: Ferreiro, Diego. Universidad de Buenos Aires. Facultad de Ciencias Exactas y Naturales; Argentina. Consejo Nacional de Investigaciones CientÃficas y Técnicas. Oficina de Coordinación Administrativa Ciudad Universitaria. Instituto de QuÃmica Biológica de la Facultad de Ciencias Exactas y Naturales; ArgentinaFil: Komives, Elizabeth A.. University Of California At San Diego; Estados UnidosFil: Wolynes, Peter G.. University Of California At San Diego; Estados Unido
Characterizing Protein-Protein Interactions with the Fragment Molecular Orbital Method
Proteins are vital components of living systems, serving as building blocks, molecular machines, enzymes, receptors, ion channels, sensors, and transporters. Protein-protein interactions (PPIs) are a key part of their function. There are more than 645,000 reported disease-relevant PPIs in the human interactome, but drugs have been developed for only 2% of these targets. The advances in PPI-focused drug discovery are highly dependent on the availability of structural data and accurate computational tools for analysis of this data. Quantum mechanical approaches are often too expensive computationally, but the fragment molecular orbital (FMO) method offers an excellent solution that combines accuracy, speed and the ability to reveal key interactions that would otherwise be hard to detect. FMO provides essential information for PPI drug discovery, namely, identification of key interactions formed between residues of two proteins, including their strength (in kcal/mol) and their chemical nature (electrostatic or hydrophobic). In this chapter, we have demonstrated how three different FMO-based approaches (pair interaction energy analysis (PIE analysis), subsystem analysis (SA) and analysis of protein residue networks (PRNs)) have been applied to study PPI in three protein-protein complexes
New Structural and Functional Contexts of the Dx[DN]xDG Linear Motif: Insights into Evolution of Calcium-Binding Proteins
Binding of calcium ions (Ca2+) to proteins can have profound effects on their structure and function. Common roles of calcium binding include structure stabilization and regulation of activity. It is known that diverse families – EF-hands being one of at least twelve – use a Dx[DN]xDG linear motif to bind calcium in near-identical fashion. Here, four novel structural contexts for the motif are described. Existing experimental data for one of them, a thermophilic archaeal subtilisin, demonstrate for the first time a role for Dx[DN]xDG-bound calcium in protein folding. An integrin-like embedding of the motif in the blade of a β-propeller fold – here named the calcium blade – is discovered in structures of bacterial and fungal proteins. Furthermore, sensitive database searches suggest a common origin for the calcium blade in β-propeller structures of different sizes and a pan-kingdom distribution of these proteins. Factors favouring the multiple convergent evolution of the motif appear to include its general Asp-richness, the regular spacing of the Asp residues and the fact that change of Asp into Gly and vice versa can occur though a single nucleotide change. Among the known structural contexts for the Dx[DN]xDG motif, only the calcium blade and the EF-hand are currently found intracellularly in large numbers, perhaps because the higher extracellular concentration of Ca2+ allows for easier fixing of newly evolved motifs that have acquired useful functions. The analysis presented here will inform ongoing efforts toward prediction of similar calcium-binding motifs from sequence information alone
- …