2,728 research outputs found
Metazoans evolved by taking domains from soluble proteins to expand intercellular communication network.
A central question in animal evolution is how multicellular animals evolved from unicellular ancestors. We hypothesize that membrane proteins must be key players in the development of multicellularity because they are well positioned to form the cell-cell contacts and to provide the intercellular communication required for the creation of complex organisms. Here we find that a major mechanism for the necessary increase in membrane protein complexity in the transition from non-metazoan to metazoan life was the new incorporation of domains from soluble proteins. The membrane proteins that have incorporated soluble domains in metazoans are enriched in many of the functions unique to multicellular organisms such as cell-cell adhesion, signaling, immune defense and developmental processes. They also show enhanced protein-protein interaction (PPI) network complexity and centrality, suggesting an important role in the cellular diversification found in complex organisms. Our results expose an evolutionary mechanism that contributed to the development of higher life forms
Cooperative "folding transition" in the sequence space facilitates function-driven evolution of protein families
In the protein sequence space, natural proteins form clusters of families
which are characterized by their unique native folds whereas the great majority
of random polypeptides are neither clustered nor foldable to unique structures.
Since a given polypeptide can be either foldable or unfoldable, a kind of
"folding transition" is expected at the boundary of a protein family in the
sequence space. By Monte Carlo simulations of a statistical mechanical model of
protein sequence alignment that coherently incorporates both short-range and
long-range interactions as well as variable-length insertions to reproduce the
statistics of the multiple sequence alignment of a given protein family, we
demonstrate the existence of such transition between natural-like sequences and
random sequences in the sequence subspaces for 15 domain families of various
folds. The transition was found to be highly cooperative and two-state-like.
Furthermore, enforcing or suppressing consensus residues on a few of the
well-conserved sites enhanced or diminished, respectively, the natural-like
pattern formation over the entire sequence. In most families, the key sites
included ligand binding sites. These results suggest some selective pressure on
the key residues, such as ligand binding activity, may cooperatively facilitate
the emergence of a protein family during evolution. From a more practical
aspect, the present results highlight an essential role of long-range effects
in precisely defining protein families, which are absent in conventional
sequence models.Comment: 13 pages, 7 figures, 2 tables (a new subsection added
A physical model for PDZ-domain/peptide interactions
The PDZ domain is an interaction motif that recognizes and binds the C-terminal peptides of target proteins. PDZ domains are ubiquitous in nature and help assemble multiprotein complexes that control cellular organization and signaling cascades. We present an optimized energy function to predict the binding free energy (ΔΔG) of PDZ domain/peptide interactions computationally. Geometry-optimized models of PDZ domain/peptide interfaces were built using Rosetta, and protein and peptide side chain and backbone degrees of freedom are minimized simultaneously. Using leave-one-out cross-validation, Rosetta’s energy function is adjusted to reproduce experimentally determined ΔΔG values with a correlation coefficient of 0.66 and a standard deviation of 0.79 kcal mol−1. The energy function places an increased weight on hydrogen bonding interactions when compared to a previously developed method to analyze protein/protein interactions. Binding free enthalpies (ΔΔH) and entropies (ΔS) are predicted with reduced accuracies of R = 0.60 and R = 0.17, respectively. The computational method improves prediction of PDZ domain specificity from sequence and allows design of novel PDZ domain/peptide interactions
DomPep—A General Method for Predicting Modular Domain-Mediated Protein-Protein Interactions
Protein-protein interactions (PPIs) are frequently mediated by the binding of a modular domain in one protein to a short, linear peptide motif in its partner. The advent of proteomic methods such as peptide and protein arrays has led to the accumulation of a wealth of interaction data for modular interaction domains. Although several computational programs have been developed to predict modular domain-mediated PPI events, they are often restricted to a given domain type. We describe DomPep, a method that can potentially be used to predict PPIs mediated by any modular domains. DomPep combines proteomic data with sequence information to achieve high accuracy and high coverage in PPI prediction. Proteomic binding data were employed to determine a simple yet novel parameter Ligand-Binding Similarity which, in turn, is used to calibrate Domain Sequence Identity and Position-Weighted-Matrix distance, two parameters that are used in constructing prediction models. Moreover, DomPep can be used to predict PPIs for both domains with experimental binding data and those without. Using the PDZ and SH2 domain families as test cases, we show that DomPep can predict PPIs with accuracies superior to existing methods. To evaluate DomPep as a discovery tool, we deployed DomPep to identify interactions mediated by three human PDZ domains. Subsequent in-solution binding assays validated the high accuracy of DomPep in predicting authentic PPIs at the proteome scale. Because DomPep makes use of only interaction data and the primary sequence of a domain, it can be readily expanded to include other types of modular domains
Putting into Practice Domain-Linear Motif Interaction Predictions for Exploration of Protein Networks
PDZ domains recognise short sequence motifs at the extreme C-termini of proteins. A model based on microarray data has been recently published for predicting the binding preferences of PDZ domains to five residue long C-terminal sequences. Here we investigated the potential of this predictor for discovering novel protein interactions that involve PDZ domains. When tested on real negative data assembled from published literature, the predictor displayed a high false positive rate (FPR). We predicted and experimentally validated interactions between four PDZ domains derived from the human proteins MAGI1 and SCRIB and 19 peptides derived from human and viral C-termini of proteins. Measured binding intensities did not correlate with prediction scores, and the high FPR of the predictor was confirmed. Results indicate that limitations of the predictor may arise from an incomplete model definition and improper training of the model. Taking into account these limitations, we identified several novel putative interactions between PDZ domains of MAGI1 and SCRIB and the C-termini of the proteins FZD4, ARHGAP6, NET1, TANC1, GLUT7, MARCH3, MAS, ABC1, DLL1, TMEM215 and CYSLTR2. These proteins are localised to the membrane or suggested to act close to it and are often involved in G protein signalling. Furthermore, we showed that, while extension of minimal interacting domains or peptides toward tandem constructs or longer peptides never suppressed their ability to interact, the measured affinities and inferred specificity patterns often changed significantly. This suggests that if protein fragments interact, the full length proteins are also likely to interact, albeit possibly with altered affinities and specificities. Therefore, predictors dealing with protein fragments are promising tools for discovering protein interaction networks but their application to predict binding preferences within networks may be limited
Recommended from our members
Predicting Function and Structure using Bioinformatics Protocols:Study of the Intracellular Regions of the Jagged and Delta Protein Families
The type I membrane-spanning proteins Jagged (Jagged-i and -2) and Delta (Delta-l, - 3 and -4) are the human ligands of Notch receptors, which mediate key signaling events in cell differentiation and morphogenesis. The Jagged and Delta proteins are composed of a relatively large extracellular region and of a 100-150 residue, yet uncharacterized cytoplasmic tail, which has been recently found to be important in Notch bi-directional signaling. We applied bioinformatics methods to analyze the intracellular region of human Notch ligands, and to predict their structural and functional properties. We searched databases for orthologues, and found that while the intracellular region is evolutionaiy well conserved within the same ligand type, a wide variability is observed in different ligands. No significant similarity was found between the intracellular region of Jagged and Delta and proteins of known 3D structure. Globularity and disorder predictions indeed suggest that these regions are largely unstructured. However, secondary structure predictions show that these regions have some propensity to form local secondary structure elements. Functional predictions based on pattern recognition imply that the specificity in the Notch machinery response might be related to specific post-translational modifications and binding motifs in the ligand cytoplasmic tail, rather than to specific interactions between the receptors and the extracellular region of the ligands. We also speculate that, given the unusual amino acid composition, the cytoplasmic tail of Jagged and Delta might be involved in zinc binding
Inferring PDZ Domain Multi-Mutant Binding Preferences from Single-Mutant Data
Many important cellular protein interactions are mediated by peptide recognition domains. The ability to predict a domain's binding specificity directly from its primary sequence is essential to understanding the complexity of protein-protein interaction networks. One such recognition domain is the PDZ domain, functioning in scaffold proteins that facilitate formation of signaling networks. Predicting the PDZ domain's binding specificity was a part of the DREAM4 Peptide Recognition Domain challenge, the goal of which was to describe, as position weight matrices, the specificity profiles of five multi-mutant ERBB2IP-1 domains. We developed a method that derives multi-mutant binding preferences by generalizing the effects of single point mutations on the wild type domain's binding specificities. Our approach, trained on publicly available ERBB2IP-1 single-mutant phage display data, combined linear regression-based prediction for ligand positions whose specificity is determined by few PDZ positions, and single-mutant position weight matrix averaging for all other ligand columns. The success of our method as the winning entry of the DREAM4 competition, as well as its superior performance over a general PDZ-ligand binding model, demonstrates the advantages of training a model on a well-selected domain-specific data set
- …