52 research outputs found

    Investigation of the causal etiology in a patient with T-B+NK+ immunodeficiency

    Get PDF
    Newborn screening for severe combined immunodeficiency (SCID) has not only accelerated diagnosis and improved treatment for affected infants, but also led to identification of novel genes required for human T cell development. A male proband had SCID newborn screening showing very low T cell receptor excision circles (TRECs), a biomarker for thymic output of nascent T cells. He had persistent profound T lymphopenia, but normal numbers of B and natural killer (NK) cells. Despite an allogeneic hematopoietic stem cell transplant from his brother, he failed to develop normal T cells. Targeted resequencing excluded known SCID genes; however, whole exome sequencing (WES) of the proband and parents revealed a maternally inherited X-linked missense mutation in MED14 (MED14V763A), a component of the mediator complex. Morpholino (MO)-mediated loss of MED14 function attenuated T cell development in zebrafish. Moreover, this arrest was rescued by ectopic expression of cDNA encoding the wild type human MED14 ortholog, but not by MED14V763A, suggesting that the variant impaired MED14 function. Modeling of the equivalent mutation in mouse (Med14V769A) did not disrupt T cell development at baseline. However, repopulation of peripheral T cells upon competitive bone marrow transplantation was compromised, consistent with the incomplete T cell reconstitution experienced by the proband upon transplantation with bone marrow from his healthy male sibling, who was found to have the same MED14V763A variant. Suspecting that the variable phenotypic expression between the siblings was influenced by further mutation(s), we sought to identify genetic variants present only in the affected proband. Indeed, WES revealed a mutation in the L1 cell adhesion molecule (L1CAMQ498H); however, introducing that mutation in vivo in mice did not disrupt T cell development. Consequently, immunodeficiency in the proband may depend upon additional, unidentified gene variants

    Nucleotide Binding Switches the Information Flow in Ras GTPases

    Get PDF
    The Ras superfamily comprises many guanine nucleotide-binding proteins (G proteins) that are essential to intracellular signal transduction. The guanine nucleotide-dependent intrinsic flexibility patterns of five G proteins were investigated in atomic detail through Molecular Dynamics simulations of the GDP- and GTP-bound states (SGDP and SGTP, respectively). For all the considered systems, the intrinsic flexibility of SGDP was higher than that of SGTP, suggesting that Guanine Exchange Factor (GEF) recognition and nucleotide switch require higher amplitude motions than effector recognition or GTP hydrolysis. Functional mode, dynamic domain, and interaction energy correlation analyses highlighted significant differences in the dynamics of small G proteins and Gα proteins, especially in the inactive state. Indeed, SGDP of Gαt, is characterized by a more extensive energy coupling between nucleotide binding site and distal regions involved in GEF recognition compared to small G proteins, which attenuates in the active state. Moreover, mechanically distinct domains implicated in nucleotide switch could be detected in the presence of GDP but not in the presence of GTP. Finally, in small G proteins, functional modes are more detectable in the inactive state than in the active one and involve changes in solvent exposure of two highly conserved amino acids in switches I and II involved in GEF recognition. The average solvent exposure of these amino acids correlates in turn with the rate of GDP release, suggesting for them either direct or indirect roles in the process of nucleotide switch. Collectively, nucleotide binding changes the information flow through the conserved Ras-like domain, where GDP enhances the flexibility of mechanically distinct portions involved in nucleotide switch, and favors long distance allosteric communication (in Gα proteins), compared to GTP

    Lipid Exchange Mechanism of the Cholesteryl Ester Transfer Protein Clarified by Atomistic and Coarse-grained Simulations

    Get PDF
    Cholesteryl ester transfer protein (CETP) transports cholesteryl esters, triglycerides, and phospholipids between different lipoprotein fractions in blood plasma. The inhibition of CETP has been shown to be a sound strategy to prevent and treat the development of coronary heart disease. We employed molecular dynamics simulations to unravel the mechanisms associated with the CETP-mediated lipid exchange. To this end we used both atomistic and coarse-grained models whose results were consistent with each other. We found CETP to bind to the surface of high density lipoprotein (HDL) -like lipid droplets through its charged and tryptophan residues. Upon binding, CETP rapidly (in about 10 ns) induced the formation of a small hydrophobic patch to the phospholipid surface of the droplet, opening a route from the core of the lipid droplet to the binding pocket of CETP. This was followed by a conformational change of helix X of CETP to an open state, in which we found the accessibility of cholesteryl esters to the C-terminal tunnel opening of CETP to increase. Furthermore, in the absence of helix X, cholesteryl esters rapidly diffused into CETP through the C-terminal opening. The results provide compelling evidence that helix X acts as a lid which conducts lipid exchange by alternating the open and closed states. The findings have potential for the design of novel molecular agents to inhibit the activity of CETP

    A Combinatorial Approach to Detect Coevolved Amino Acid Networks in Protein Families of Variable Divergence

    Get PDF
    Communication between distant sites often defines the biological role of a protein: amino acid long-range interactions are as important in binding specificity, allosteric regulation and conformational change as residues directly contacting the substrate. The maintaining of functional and structural coupling of long-range interacting residues requires coevolution of these residues. Networks of interaction between coevolved residues can be reconstructed, and from the networks, one can possibly derive insights into functional mechanisms for the protein family. We propose a combinatorial method for mapping conserved networks of amino acid interactions in a protein which is based on the analysis of a set of aligned sequences, the associated distance tree and the combinatorics of its subtrees. The degree of coevolution of all pairs of coevolved residues is identified numerically, and networks are reconstructed with a dedicated clustering algorithm. The method drops the constraints on high sequence divergence limiting the range of applicability of the statistical approaches previously proposed. We apply the method to four protein families where we show an accurate detection of functional networks and the possibility to treat sets of protein sequences of variable divergence

    Rational Mutational Analysis of a Multidrug MFS Transporter CaMdr1p of Candida albicans by Employing a Membrane Environment Based Computational Approach

    Get PDF
    CaMdr1p is a multidrug MFS transporter of pathogenic Candida albicans. An over-expression of the gene encoding this protein is linked to clinically encountered azole resistance. In-depth knowledge of the structure and function of CaMdr1p is necessary for an effective design of modulators or inhibitors of this efflux transporter. Towards this goal, in this study, we have employed a membrane environment based computational approach to predict the functionally critical residues of CaMdr1p. For this, information theoretic scores which are variants of Relative Entropy (Modified Relative Entropy REM) were calculated from Multiple Sequence Alignment (MSA) by separately considering distinct physico-chemical properties of transmembrane (TM) and inter-TM regions. The residues of CaMdr1p with high REM which were predicted to be significantly important were subjected to site-directed mutational analysis. Interestingly, heterologous host Saccharomyces cerevisiae, over-expressing these mutant variants of CaMdr1p wherein these high REM residues were replaced by either alanine or leucine, demonstrated increased susceptibility to tested drugs. The hypersensitivity to drugs was supported by abrogated substrate efflux mediated by mutant variant proteins and was not attributed to their poor expression or surface localization. Additionally, by employing a distance plot from a 3D deduced model of CaMdr1p, we could also predict the role of these functionally critical residues in maintaining apparent inter-helical interactions to provide the desired fold for the proper functioning of CaMdr1p. Residues predicted to be critical for function across the family were also found to be vital from other previously published studies, implying its wider application to other membrane protein families

    A Mathematical Framework for Protein Structure Comparison

    Get PDF
    Comparison of protein structures is important for revealing the evolutionary relationship among proteins, predicting protein functions and predicting protein structures. Many methods have been developed in the past to align two or multiple protein structures. Despite the importance of this problem, rigorous mathematical or statistical frameworks have seldom been pursued for general protein structure comparison. One notable issue in this field is that with many different distances used to measure the similarity between protein structures, none of them are proper distances when protein structures of different sequences are compared. Statistical approaches based on those non-proper distances or similarity scores as random variables are thus not mathematically rigorous. In this work, we develop a mathematical framework for protein structure comparison by treating protein structures as three-dimensional curves. Using an elastic Riemannian metric on spaces of curves, geodesic distance, a proper distance on spaces of curves, can be computed for any two protein structures. In this framework, protein structures can be treated as random variables on the shape manifold, and means and covariance can be computed for populations of protein structures. Furthermore, these moments can be used to build Gaussian-type probability distributions of protein structures for use in hypothesis testing. The covariance of a population of protein structures can reveal the population-specific variations and be helpful in improving structure classification. With curves representing protein structures, the matching is performed using elastic shape analysis of curves, which can effectively model conformational changes and insertions/deletions. We show that our method performs comparably with commonly used methods in protein structure classification on a large manually annotated data set

    Machines vs. Ensembles: Effective MAPK Signaling through Heterogeneous Sets of Protein Complexes

    Get PDF
    A grant from the One-University Open Access Fund at the University of Kansas was used to defray the author’s publication fees in this Open Access journal. The Open Access Fund, administered by librarians from the KU, KU Law, and KUMC libraries, is made possible by contributions from the offices of KU Provost, KU Vice Chancellor for Research & Graduate Studies, and KUMC Vice Chancellor for Research. For more information about the Open Access Fund, please see http://library.kumc.edu/authors-fund.xml.Despite the importance of intracellular signaling networks, there is currently no consensus regarding the fundamental nature of the protein complexes such networks employ. One prominent view involves stable signaling machines with well-defined quaternary structures. The combinatorial complexity of signaling networks has led to an opposing perspective, namely that signaling proceeds via heterogeneous pleiomorphic ensembles of transient complexes. Since many hypotheses regarding network function rely on how we conceptualize signaling complexes, resolving this issue is a central problem in systems biology. Unfortunately, direct experimental characterization of these complexes has proven technologically difficult, while combinatorial complexity has prevented traditional modeling methods from approaching this question. Here we employ rule-based modeling, a technique that overcomes these limitations, to construct a model of the yeast pheromone signaling network. We found that this model exhibits significant ensemble character while generating reliable responses that match experimental observations. To contrast the ensemble behavior, we constructed a model that employs hierarchical assembly pathways to produce scaffold-based signaling machines. We found that this machine model could not replicate the experimentally observed combinatorial inhibition that arises when the scaffold is overexpressed. This finding provides evidence against the hierarchical assembly of machines in the pheromone signaling network and suggests that machines and ensembles may serve distinct purposes in vivo. In some cases, e.g. core enzymatic activities like protein synthesis and degradation, machines assembled via hierarchical energy landscapes may provide functional stability for the cell. In other cases, such as signaling, ensembles may represent a form of weak linkage, facilitating variation and plasticity in network evolution. The capacity of ensembles to signal effectively will ultimately shape how we conceptualize the function, evolution and engineering of signaling networks

    Intrinsic Structural Disorder Confers Cellular Viability on Oncogenic Fusion Proteins

    Get PDF
    Chromosomal translocations, which often generate chimeric proteins by fusing segments of two distinct genes, represent the single major genetic aberration leading to cancer. We suggest that the unifying theme of these events is a high level of intrinsic structural disorder, enabling fusion proteins to evade cellular surveillance mechanisms that eliminate misfolded proteins. Predictions in 406 translocation-related human proteins show that they are significantly enriched in disorder (43.3% vs. 20.7% in all human proteins), they have fewer Pfam domains, and their translocation breakpoints tend to avoid domain splitting. The vicinity of the breakpoint is significantly more disordered than the rest of these already highly disordered fusion proteins. In the unlikely event of domain splitting in fusion it usually spares much of the domain or splits at locations where the newly exposed hydrophobic surface area approximates that of an intact domain. The mechanisms of action of fusion proteins suggest that in most cases their structural disorder is also essential to the acquired oncogenic function, enabling the long-range structural communication of remote binding and/or catalytic elements. In this respect, there are three major mechanisms that contribute to generating an oncogenic signal: (i) a phosphorylation site and a tyrosine-kinase domain are fused, and structural disorder of the intervening region enables intramolecular phosphorylation (e.g., BCR-ABL); (ii) a dimerisation domain fuses with a tyrosine kinase domain and disorder enables the two subunits within the homodimer to engage in permanent intermolecular phosphorylations (e.g., TFG-ALK); (iii) the fusion of a DNA-binding element to a transactivator domain results in an aberrant transcription factor that causes severe misregulation of transcription (e.g. EWS-ATF). Our findings also suggest novel strategies of intervention against the ensuing neoplastic transformations

    An Atlas of the Thioredoxin Fold Class Reveals the Complexity of Function-Enabling Adaptations

    Get PDF
    The group of proteins that contain a thioredoxin (Trx) fold is huge and diverse. Assessment of the variation in catalytic machinery of Trx fold proteins is essential in providing a foundation for understanding their functional diversity and predicting the function of the many uncharacterized members of the class. The proteins of the Trx fold class retain common features—including variations on a dithiol CxxC active site motif—that lead to delivery of function. We use protein similarity networks to guide an analysis of how structural and sequence motifs track with catalytic function and taxonomic categories for 4,082 representative sequences spanning the known superfamilies of the Trx fold. Domain structure in the fold class is varied and modular, with 2.8% of sequences containing more than one Trx fold domain. Most member proteins are bacterial. The fold class exhibits many modifications to the CxxC active site motif—only 56.8% of proteins have both cysteines, and no functional groupings have absolute conservation of the expected catalytic motif. Only a small fraction of Trx fold sequences have been functionally characterized. This work provides a global view of the complex distribution of domains and catalytic machinery throughout the fold class, showing that each superfamily contains remnants of the CxxC active site. The unifying context provided by this work can guide the comparison of members of different Trx fold superfamilies to gain insight about their structure-function relationships, illustrated here with the thioredoxins and peroxiredoxins

    A Self-Organizing Algorithm for Modeling Protein Loops

    Get PDF
    Protein loops, the flexible short segments connecting two stable secondary structural units in proteins, play a critical role in protein structure and function. Constructing chemically sensible conformations of protein loops that seamlessly bridge the gap between the anchor points without introducing any steric collisions remains an open challenge. A variety of algorithms have been developed to tackle the loop closure problem, ranging from inverse kinematics to knowledge-based approaches that utilize pre-existing fragments extracted from known protein structures. However, many of these approaches focus on the generation of conformations that mainly satisfy the fixed end point condition, leaving the steric constraints to be resolved in subsequent post-processing steps. In the present work, we describe a simple solution that simultaneously satisfies not only the end point and steric conditions, but also chirality and planarity constraints. Starting from random initial atomic coordinates, each individual conformation is generated independently by using a simple alternating scheme of pairwise distance adjustments of randomly chosen atoms, followed by fast geometric matching of the conformationally rigid components of the constituent amino acids. The method is conceptually simple, numerically stable and computationally efficient. Very importantly, additional constraints, such as those derived from NMR experiments, hydrogen bonds or salt bridges, can be incorporated into the algorithm in a straightforward and inexpensive way, making the method ideal for solving more complex multi-loop problems. The remarkable performance and robustness of the algorithm are demonstrated on a set of protein loops of length 4, 8, and 12 that have been used in previous studies
    corecore