137 research outputs found

    Sequential, structural and functional properties of protein complexes are defined by how folding and binding intertwine.

    Get PDF
    Intrinsically Disordered Proteins (IDPs) fulfill critical biological roles without having the potential to fold on their own. While lacking inherent structure, the majority of IDPs do reach a folded state via interaction with a protein partner, presenting a deep entanglement of the folding and binding process. Protein disorder has been recognized as a major determinant in several properties of proteins, such as sequence, adopted structure upon binding, and function. Yet, the way the binding process is reflected in these features in general lacks a detailed description. Here, we defined three categories of protein complexes depending on the unbound structural state of the interactors, and analyzed them in detail. We found that strikingly, the properties of interactors in terms of sequence and adopted structure are defined not only by the intrinsic structural state of the protein itself, but also to a comparable extent by the structural state of the binding partner. The three different types of interactions are also regulated through divergent molecular tactics of post-translational modifications. This not only widens the range of biologically relevant sequence and structure spaces defined by ordered proteins, but also presents distinct molecular mechanisms compatible with specific biological processes, separately for each interaction type. The distinct attributes of different binding modes identified in this study can help to understand how various types of interactions serve as building blocks for the assembly of tightly regulated and highly intertwined regulatory networks

    The N-terminal intrinsically disordered domain of mgm101p is localized to the mitochondrial nucleoid.

    Get PDF
    The mitochondrial genome maintenance gene, MGM101, is essential for yeasts that depend on mitochondrial DNA replication. Previously, in Saccharomyces cerevisiae, it has been found that the carboxy-terminal two-thirds of Mgm101p has a functional core. Furthermore, there is a high level of amino acid sequence conservation in this region from widely diverse species. By contrast, the amino-terminal region, that is also essential for function, does not have recognizable conservation. Using a bioinformatic approach we find that the functional core from yeast and a corresponding region of Mgm101p from the coral Acropora millepora have an ordered structure, while the N-terminal domains of sequences from yeast and coral are predicted to be disordered. To examine whether ordered and disordered domains of Mgm101p have specific or general functions we made chimeric proteins from yeast and coral by swapping the two regions. We find, by an in vivo assay in S.cerevisiae, that the ordered domain of A.millepora can functionally replace the yeast core region but the disordered domain of the coral protein cannot substitute for its yeast counterpart. Mgm101p is found in the mitochondrial nucleoid along with enzymes and proteins involved in mtDNA replication. By attaching green fluorescent protein to the N-terminal disordered domain of yeast Mgm101p we find that GFP is still directed to the mitochondrial nucleoid where full-length Mgm101p-GFP is targeted

    The Role of Intrinsically Unstructured Proteins in Neurodegenerative Diseases

    Get PDF
    The number and importance of intrinsically disordered proteins (IUP), known to be involved in various human disorders, are growing rapidly. To test for the generalized implications of intrinsic disorders in proteins involved in Neurodegenerative diseases, disorder prediction tools have been applied to three datasets comprising of proteins involved in Huntington Disease (HD), Parkinson's disease (PD), Alzheimer's disease (AD). Results show, in general, proteins in disease datasets possess significantly enhanced intrinsic unstructuredness. Most of these disordered proteins in the disease datasets are found to be involved in neuronal activities, signal transduction, apoptosis, intracellular traffic, cell differentiation etc. Also these proteins are found to have more number of interactors and hence as the proportion of disorderedness (i.e., the length of the unfolded stretch) increased, the size of the interaction network simultaneously increased. All these observations reflect that, “Moonlighting” i.e. the contextual acquisition of different structural conformations (transient), eventually may allow these disordered proteins to act as network “hubs” and thus they may have crucial influences in the pathogenecity of neurodegenerative diseases

    The Hepatitis E Virus Polyproline Region Is Involved in Viral Adaptation

    Get PDF
    Genomes of hepatitis E virus (HEV), rubivirus and cutthroat virus (CTV) contain a region of high proline density and low amino acid (aa) complexity, named the polyproline region (PPR). In HEV genotypes 1, 3 and 4, it is the only region within the non-structural open reading frame (ORF1) with positive selection (4–10 codons with dN/dS>1). This region has the highest density of sites with homoplasy values >0.5. Genotypes 3 and 4 show ∼3-fold increase in homoplastic density (HD) in the PPR compared to any other region in ORF1, genotype 1 does not exhibit significant HD (p<0.0001). PPR sequence divergence was found to be 2-fold greater for HEV genotypes 3 and 4 than for genotype 1. The data suggest the PPR plays an important role in host-range adaptation. Although the PPR appears to be hypervariable and homoplastic, it retains as much phylogenetic signal as any other similar sized region in the ORF1, indicating that convergent evolution operates within the major HEV phylogenetic lineages. Analyses of sequence-based secondary structure and the tertiary structure identify PPR as an intrinsically disordered region (IDR), implicating its role in regulation of replication. The identified propensity for the disorder-to-order state transitions indicates the PPR is involved in protein-protein interactions. Furthermore, the PPR of all four HEV genotypes contains seven putative linear binding motifs for ligands involved in the regulation of a wide number of cellular signaling processes. Structure-based analysis of possible molecular functions of these motifs showed the PPR is prone to bind a wide variety of ligands. Collectively, these data suggest a role for the PPR in HEV adaptation. Particularly as an IDR, the PPR likely contributes to fine tuning of viral replication through protein-protein interactions and should be considered as a target for development of novel anti-viral drugs

    Proteins with Complex Architecture as Potential Targets for Drug Design: A Case Study of Mycobacterium tuberculosis

    Get PDF
    Lengthy co-evolution of Homo sapiens and Mycobacterium tuberculosis, the main causative agent of tuberculosis, resulted in a dramatically successful pathogen species that presents considerable challenge for modern medicine. The continuous and ever increasing appearance of multi-drug resistant mycobacteria necessitates the identification of novel drug targets and drugs with new mechanisms of action. However, further insights are needed to establish automated protocols for target selection based on the available complete genome sequences. In the present study, we perform complete proteome level comparisons between M. tuberculosis, mycobacteria, other prokaryotes and available eukaryotes based on protein domains, local sequence similarities and protein disorder. We show that the enrichment of certain domains in the genome can indicate an important function specific to M. tuberculosis. We identified two families, termed pkn and PE/PPE that stand out in this respect. The common property of these two protein families is a complex domain organization that combines species-specific regions, commonly occurring domains and disordered segments. Besides highlighting promising novel drug target candidates in M. tuberculosis, the presented analysis can also be viewed as a general protocol to identify proteins involved in species-specific functions in a given organism. We conclude that target selection protocols should be extended to include proteins with complex domain architectures instead of focusing on sequentially unique and essential proteins only

    The Critical Role of N- and C-Terminal Contact in Protein Stability and Folding of a Family 10 Xylanase under Extreme Conditions

    Get PDF
    Stabilization strategies adopted by proteins under extreme conditions are very complex and involve various kinds of interactions. Recent studies have shown that a large proportion of proteins have their N- and C-terminal elements in close contact and suggested they play a role in protein folding and stability. However, the biological significance of this contact remains elusive.In the present study, we investigate the role of N- and C-terminal residue interaction using a family 10 xylanase (BSX) with a TIM-barrel structure that shows stability under high temperature, alkali pH, and protease and SDS treatment. Based on crystal structure, an aromatic cluster was identified that involves Phe4, Trp6 and Tyr343 holding the N- and C-terminus together; this is a unique and important feature of this protein that might be crucial for folding and stability under poly-extreme conditions. folding and activity. Alanine substitution with Phe4, Trp6 and Tyr343 drastically decreased stability under all parameters studied. Importantly, substitution of Phe4 with Trp increased stability in SDS treatment. Mass spectrometry results of limited proteolysis further demonstrated that the Arg344 residue is highly susceptible to trypsin digestion in sensitive mutants such as ΔF4, W6A and Y343A, suggesting again that disruption of the Phe4-Trp6-Tyr343 (F-W-Y) cluster destabilizes the N- and C-terminal interaction. Our results underscore the importance of N- and C-terminal contact through aromatic interactions in protein folding and stability under extreme conditions, and these results may be useful to improve the stability of other proteins under suboptimal conditions

    Mechanism of subunit interaction at ketosynthase-dehydratase junctions in trans-AT polyketide synthases

    Get PDF
    Modular polyketide synthases (PKSs) produce numerous structurally complex natural products with diverse applications in medicine and agriculture. They typically consist of several multienzyme subunits that utilize structurally-defined docking domains (DDs) at their N- and C-termini to ensure correct assembly into functional multi-protein complexes. Here we report a fundamentally different mechanism for subunit assembly in trans-AT modular PKSs at the junction between ketosynthase (KS) and dehydratase (DH) domains. This involves direct interaction of a largely unstructured docking domain (DD) at the C-terminus of the KS with the surface of the downstream DH. Acyl transfer assays and mechanism-based cross-linking established that the DD is required for the KS to communicate with the acyl carrier protein appended to the DH. Two distinct regions for binding of the DD to the DH were identified using NMR spectroscopy, carbene foot-printing and mutagenesis, providing a foundation for future elucidation of the molecular basis for interaction specificity

    Intrinsically Disordered Proteins Display No Preference for Chaperone Binding In Vivo

    Get PDF
    Intrinsically disordered/unstructured proteins (IDPs) are extremely sensitive to proteolysis in vitro, but show no enhanced degradation rates in vivo. Their existence and functioning may be explained if IDPs are preferentially associated with chaperones in the cell, which may offer protection against degradation by proteases. To test this inference, we took pairwise interaction data from high-throughput interaction studies and analyzed to see if predicted disorder correlates with the tendency of chaperone binding by proteins. Our major finding is that disorder predicted by the IUPred algorithm actually shows negative correlation with chaperone binding in E. coli, S. cerevisiae, and metazoa species. Since predicted disorder positively correlates with the tendency of partner binding in the interactome, the difference between the disorder of chaperone-binding and non-binding proteins is even more pronounced if normalized to their overall tendency to be involved in pairwise protein–protein interactions. We argue that chaperone binding is primarily required for folding of globular proteins, as reflected in an increased preference for chaperones of proteins in which at least one Pfam domain exists. In terms of the functional consequences of chaperone binding of mostly disordered proteins, we suggest that its primary reason is not the assistance of folding, but promotion of assembly with partners. In support of this conclusion, we show that IDPs that bind chaperones also tend to bind other proteins

    Characterization of globulin storage proteins of a low prolamin cereal species in relation to celiac disease

    Get PDF
    Brachypodium distachyon, a small annual grass with seed storage globulins as primary protein reserves was used in our study to analyse the toxic nature of non-prolamin seed storage proteins related to celiac disease. The main storage proteins of B. distachyon are the 7S globulin type proteins and the 11S, 12S seed storage globulins similar to oat and rice. Immunoblot analyses using serum samples from celiac disease patients were carried out followed by the identification of immune-responsive proteins using mass spectrometry. Serum samples from celiac patients on a gluten-free diet, from patients with Crohn's disease and healthy subjects, were used as controls. The identified proteins with intense serum-IgA reactivity belong to the 7S and 11-12S seed globulin family. Structure prediction and epitope predictions analyses confirmed the presence of celiac disease-related linear B cell epitope homologs and the presence of peptide regions with strong HLA-DQ8 and DQ2 binding capabilities. These results highlight that both MHC-II presentation and B cell response may be developed not only to prolamins but also to seed storage globulins. This is the first study of the non-prolamin type seed storage proteins of Brachypodium from the aspect of the celiac disease

    TMFoldRec: a statistical potential-based transmembrane protein fold recognition tool.

    Get PDF
    BACKGROUND: Transmembrane proteins (TMPs) are the key components of signal transduction, cell-cell adhesion and energy and material transport into and out from the cells. For the deep understanding of these processes, structure determination of transmembrane proteins is indispensable. However, due to technical difficulties, only a few transmembrane protein structures have been determined experimentally. Large-scale genomic sequencing provides increasing amounts of sequence information on the proteins and whole proteomes of living organisms resulting in the challenge of bioinformatics; how the structural information should be gained from a sequence. RESULTS: Here, we present a novel method, TMFoldRec, for fold prediction of membrane segments in transmembrane proteins. TMFoldRec based on statistical potentials was tested on a benchmark set containing 124 TMP chains from the PDBTM database. Using a 10-fold jackknife method, the native folds were correctly identified in 77 % of the cases. This accuracy overcomes the state-of-the-art methods. In addition, a key feature of TMFoldRec algorithm is the ability to estimate the reliability of the prediction and to decide with an accuracy of 70 %, whether the obtained, lowest energy structure is the native one. CONCLUSION: These results imply that the membrane embedded parts of TMPs dictate the TM structures rather than the soluble parts. Moreover, predictions with reliability scores make in this way our algorithm applicable for proteome-wide analyses. AVAILABILITY: The program is available upon request for academic use
    corecore