9 research outputs found
How structure defines affinity in protein-protein interactions.
Protein-protein interactions (PPI) in nature are conveyed by a multitude of binding modes involving various surfaces, secondary structure elements and intermolecular interactions. This diversity results in PPI binding affinities that span more than nine orders of magnitude. Several early studies attempted to correlate PPI binding affinities to various structure-derived features with limited success. The growing number of high-resolution structures, the appearance of more precise methods for measuring binding affinities and the development of new computational algorithms enable more thorough investigations in this direction. Here, we use a large dataset of PPI structures with the documented binding affinities to calculate a number of structure-based features that could potentially define binding energetics. We explore how well each calculated biophysical feature alone correlates with binding affinity and determine the features that could be used to distinguish between high-, medium- and low- affinity PPIs. Furthermore, we test how various combinations of features could be applied to predict binding affinity and observe a slow improvement in correlation as more features are incorporated into the equation. In addition, we observe a considerable improvement in predictions if we exclude from our analysis low-resolution and NMR structures, revealing the importance of capturing exact intermolecular interactions in our calculations. Our analysis should facilitate prediction of new interactions on the genome scale, better characterization of signaling networks and design of novel binding partners for various target proteins
Improvement in R-value for high-resolution structures.
<p>Barplot displaying correlation (R-value) between different biophysical features and K<sub>d</sub> when using only high-resolution structures (red bars) and all structures (grey bars).</p
Dependence of Kd on various single biophysical features.
<p>(A) Change in the accessible interface surface area (ASA); (B) ΔASA normalized to the total interface area; (C) percent of non-polar change in the accessible surface area; (D) the total number of interfacial H bonds, (E) the number of intermolecular interfacial H bonds, (F) the number of intra-molecular H bonds; (G) Van der Waals energy; (H) volume of cavities; (I) number of hotspots; (J) electrostatic columbic energy; (K) RMSD between bound and unbound structures for interface Cαs; (L) percentage of rotamers that do not change conformation upon binding. Each point represents one PDB file in the database and the line corresponds to a linear fit to all data points in the database.</p
Incorporating more features in the prediction improves correlation with K<sub>d</sub> and ROC analysis.
<p>The best possible weights were obtained to combine the features into one equation using a linear fit to the experimental data. X-axis shows the number of features used to predict K<sub>d</sub> and to discriminate between the two groups. Y-axes shows the best value obtained for each number of features used in the equation. The analysis was performed on all structures in the database (filled circles) and on high-resolution structures only (red stars). (A) AUC were evaluated on high- vs low-affinity (red), medium- vs low-affinity (green) and medium- vs high-affinity (blue) PPIs (B) Pearson's correlation coefficient for all dataset (filled circles) and for high-resolution structures only (red stars).</p
Amino acid interface propensities.
<p>(A) Amino acid propensities to be in an interface compared to protein surface calculated according to <a href="http://www.plosone.org/article/info:doi/10.1371/journal.pone.0110085#pone.0110085-Jones1" target="_blank">[5]</a> (B) Amino acid propensities for high-affinity (black) and low-affinity (grey) complexes.</p
Receiver Operator Characteristic Analysis.
<p>The graph shows the true positive rate vs. false positive rate in discriminating high- from low-affinity PPIs (red line), medium- from low-affinity PPIs (green line) and high- from medium-affinity PPIs (blue line) for each feature. Each point represents a particular cut-off value used to discriminate between the two groups. Features included in the figure are (A) ΔASA, (B) ΔASA/ASA, (C) Van der Waals energy, (D) the total number of interfacial H bonds, (E) the number of intermolecular interfacial H bonds, (F) the number of intra-molecular H bonds; (G) Percentage of rotamers that do not change conformation upon binding; and (H) the number of hotspots.</p
Alteration of the C-terminal ligand specificity of the Erbin PDZ domain by allosteric mutational effects
Modulation of protein binding specificity is important for basic biology and for applied science. Here we explore how binding specificity is conveyed in PDZ (postsynaptic density protein-95/discs large/zonula occludens-1) domains, small interaction modules that recognize various proteins by binding to an extended C terminus. Our goal was to engineer variants of the Erbin PDZ domain with altered specificity for the most C-terminal position (position 0) where a Val is strongly preferred by the wild-type domain. We constructed a library of PDZ domains by randomizing residues in direct contact with position 0 and in a loop that is close to but does not contact position 0. We used phage display to select for PDZ variants that bind to 19 peptide ligands differing only at position 0. To verify that each obtained PDZ domain exhibited the correct binding specificity, we selected peptide ligands for each domain. Despite intensive efforts, we were only able to evolve Erbin PDZ domain variants with selectivity for the aliphatic C-terminal side chains Val, Ile and Leu. Interestingly, many PDZ domains with these three distinct specificities contained identical amino acids at positions that directly contact position 0 but differed in the loop that does not contact position 0. Computational modeling of the selected PDZ domains shows how slight conformational changes in the loop region propagate to the binding site and result in different binding specificities. Our results demonstrate that second-sphere residues could be crucial in determining protein binding specificity