144 research outputs found

    Confound-leakage: confound removal in machine learning leads to leakage

    Get PDF
    BACKGROUND: Machine learning (ML) approaches are a crucial component of modern data analysis in many fields, including epidemiology and medicine. Nonlinear ML methods often achieve accurate predictions, for instance, in personalized medicine, as they are capable of modeling complex relationships between features and the target. Problematically, ML models and their predictions can be biased by confounding information present in the features. To remove this spurious signal, researchers often employ featurewise linear confound regression (CR). While this is considered a standard approach for dealing with confounding, possible pitfalls of using CR in ML pipelines are not fully understood. RESULTS: We provide new evidence that, contrary to general expectations, linear confound regression can increase the risk of confounding when combined with nonlinear ML approaches. Using a simple framework that uses the target as a confound, we show that information leaked via CR can increase null or moderate effects to near-perfect prediction. By shuffling the features, we provide evidence that this increase is indeed due to confound-leakage and not due to revealing of information. We then demonstrate the danger of confound-leakage in a real-world clinical application where the accuracy of predicting attention-deficit/hyperactivity disorder is overestimated using speech-derived features when using depression as a confound. CONCLUSIONS: Mishandling or even amplifying confounding effects when building ML models due to confound-leakage, as shown, can lead to untrustworthy, biased, and unfair predictions. Our expose of the confound-leakage pitfall and provided guidelines for dealing with it can help create more robust and trustworthy ML models

    Tah1 helix-swap dimerization prevents mixed Hsp90 co-chaperone complexes

    Get PDF
    Specific co-chaperone adaptors facilitate the recruitment of client proteins to the Hsp90 system. Tah1 binds the C-terminal conserved MEEVD motif of Hsp90, thus linking an eclectic set of client proteins to the R2TP complex for their assembly and regulation by Hsp90. Rather than the normal complement of seven α-helices seen in other tetratricopeptide repeat (TPR) domains, Tah1 unusually consists of the first five only. Consequently, the methionine of the MEEVD peptide remains exposed to solvent when bound by Tah1. In solution Tah1 appears to be predominantly monomeric, and recent structures have failed to explain how Tah1 appears to prevent the formation of mixed TPR domain-containing complexes such as Cpr6-(Hsp90)2-Tah1. To understand this further, the crystal structure of Tah1 in complex with the MEEVD peptide of Hsp90 was determined, which shows a helix swap involving the fifth α-helix between two adjacently bound Tah1 molecules. Dimerization of Tah1 restores the normal binding environment of the bound Hsp90 methionine residue by reconstituting a TPR binding site similar to that in seven-helix-containing TPR domain proteins. Dimerization also explains how other monomeric TPR-domain proteins are excluded from forming inappropriate mixed co-chaperone complexes

    Structural basis for phosphorylation-dependent recruitment of tel2 to hsp90 by pih1

    Get PDF
    Client protein recruitment to the Hsp90 system depends on cochaperones that bind the client and Hsp90 simultaneously and facilitate their interaction. Hsp90 involvement in the assembly of snoRNPs, RNA polymerases, PI3-kinase-like kinases, and chromatin remodeling complexes depends on the TTT (Tel2-Tti1-Tti2), and R2TP complexes-consisting of the AAA-ATPases Rvb1 and Rvb2, Tah1 (Spagh/RPAP3 in metazoa), and Pih1 (Pih1D1 in humans)-that together provide the connection to Hsp90. The biochemistry underlying R2TP function is still poorly understood. Pih1 in particular, at the heart of the complex, has not been described at a structural level, nor have the multiple protein-protein interactions it mediates been characterized. Here we present a structural and biochemical analysis of Hsp90-Tah1-Pih1, Hsp90-Spagh, and Pih1D1-Tel2 complexes that reveal a domain in Pih1D1 specific for binding CK2 phosphorylation sites, and together define the structural basis by which the R2TP complex connects the Hsp90 chaperone system to the TTT complex

    Crystal Structures of the ATPase Domains of Four Human Hsp70 Isoforms: HSPA1L/Hsp70-hom, HSPA2/Hsp70-2, HSPA6/Hsp70B', and HSPA5/BiP/GRP78

    Get PDF
    The 70-kDa heat shock proteins (Hsp70) are chaperones with central roles in processes that involve polypeptide remodeling events. Hsp70 proteins consist of two major functional domains: an N-terminal nucleotide binding domain (NBD) with ATPase activity, and a C-terminal substrate binding domain (SBD). We present the first crystal structures of four human Hsp70 isoforms, those of the NBDs of HSPA1L, HSPA2, HSPA5 and HSPA6. As previously with Hsp70 family members, all four proteins crystallized in a closed cleft conformation, although a slight cleft opening through rotation of subdomain IIB was observed for the HSPA5-ADP complex. The structures presented here support the view that the NBDs of human Hsp70 function by conserved mechanisms and contribute little to isoform specificity, which instead is brought about by the SBDs and by accessory proteins.This article can also be viewed as an enhanced version in which the text of the article is integrated with interactive 3D representations and animated transitions. Please note that a web plugin is required to access this enhanced functionality. Instructions for the installation and use of the web plugin are available in Text S1

    Structure of clathrin coat with bound Hsc70 and auxilin: mechanism of Hsc70-facilitated disassembly

    Get PDF
    The chaperone Hsc70 drives the clathrin assembly–disassembly cycle forward by stimulating dissociation of a clathrin lattice. A J-domain containing co-chaperone, auxilin, associates with a freshly budded clathrin-coated vesicle, or with an in vitro assembled clathrin coat, and recruits Hsc70 to its specific heavy-chain-binding site. We have determined by electron cryomicroscopy (cryoEM), at about 11 Å resolution, the structure of a clathrin coat (in the D6-barrel form) with specifically bound Hsc70 and auxilin. The Hsc70 binds a previously analysed site near the C-terminus of the heavy chain, with a stoichiometry of about one per three-fold vertex. Its binding is accompanied by a distortion of the clathrin lattice, detected by a change in the axial ratio of the D6 barrel. We propose that when Hsc70, recruited to a position close to its target by the auxilin J-domain, splits ATP, it clamps firmly onto its heavy-chain site and locks in place a transient fluctuation. Accumulation of the local strain thus imposed at multiple vertices can then lead to disassembly

    The Mammalian Disaggregase Machinery: Hsp110 Synergizes with Hsp70 and Hsp40 to Catalyze Protein Disaggregation and Reactivation in a Cell-Free System

    Get PDF
    Bacteria, fungi, protozoa, chromista and plants all harbor homologues of Hsp104, a AAA+ ATPase that collaborates with Hsp70 and Hsp40 to promote protein disaggregation and reactivation. Curiously, however, metazoa do not possess an Hsp104 homologue. Thus, whether animal cells renature large protein aggregates has long remained unclear. Here, it is established that mammalian cytosol prepared from different sources possesses a potent, ATP-dependent protein disaggregase and reactivation activity, which can be accelerated and stimulated by Hsp104. This activity did not require the AAA+ ATPase, p97. Rather, mammalian Hsp110 (Apg-2), Hsp70 (Hsc70 or Hsp70) and Hsp40 (Hdj1) were necessary and sufficient to slowly dissolve large disordered aggregates and recover natively folded protein. This slow disaggregase activity was conserved to yeast Hsp110 (Sse1), Hsp70 (Ssa1) and Hsp40 (Sis1 or Ydj1). Hsp110 must engage substrate, engage Hsp70, promote nucleotide exchange on Hsp70, and hydrolyze ATP to promote disaggregation of disordered aggregates. Similarly, Hsp70 must engage substrate and Hsp110, and hydrolyze ATP for protein disaggregation. Hsp40 must harbor a functional J domain to promote protein disaggregation, but the J domain alone is insufficient. Optimal disaggregase activity is achieved when the Hsp40 can stimulate the ATPase activity of Hsp110 and Hsp70. Finally, Hsp110, Hsp70 and Hsp40 fail to rapidly remodel amyloid forms of the yeast prion protein, Sup35, or the Parkinson's disease protein, alpha-synuclein. However, Hsp110, Hsp70 and Hsp40 enhanced the activity of Hsp104 against these amyloid substrates. Taken together, these findings suggest that Hsp110 fulfils a subset of Hsp104 activities in mammals. Moreover, they suggest that Hsp104 can collaborate with the mammalian disaggregase machinery to rapidly remodel amyloid conformers

    Role of Hsp70 ATPase Domain Intrinsic Dynamics and Sequence Evolution in Enabling its Functional Interactions with NEFs

    Get PDF
    Catalysis of ADP-ATP exchange by nucleotide exchange factors (NEFs) is central to the activity of Hsp70 molecular chaperones. Yet, the mechanism of interaction of this family of chaperones with NEFs is not well understood in the context of the sequence evolution and structural dynamics of Hsp70 ATPase domains. We studied the interactions of Hsp70 ATPase domains with four different NEFs on the basis of the evolutionary trace and co-evolution of the ATPase domain sequence, combined with elastic network modeling of the collective dynamics of the complexes. Our study reveals a subtle balance between the intrinsic (to the ATPase domain) and specific (to interactions with NEFs) mechanisms shared by the four complexes. Two classes of key residues are distinguished in the Hsp70 ATPase domain: (i) highly conserved residues, involved in nucleotide binding, which mediate, via a global hinge-bending, the ATPase domain opening irrespective of NEF binding, and (ii) not-conserved but co-evolved and highly mobile residues, engaged in specific interactions with NEFs (e.g., N57, R258, R262, E283, D285). The observed interplay between these respective intrinsic (pre-existing, structure-encoded) and specific (co-evolved, sequence-dependent) interactions provides us with insights into the allosteric dynamics and functional evolution of the modular Hsp70 ATPase domain

    [SWI+], the Prion Formed by the Chromatin Remodeling Factor Swi1, Is Highly Sensitive to Alterations in Hsp70 Chaperone System Activity

    Get PDF
    The yeast prion [SWI+], formed of heritable amyloid aggregates of the Swi1 protein, results in a partial loss of function of the SWI/SNF chromatin-remodeling complex, required for the regulation of a diverse set of genes. Our genetic analysis revealed that [SWI+] propagation is highly dependent upon the action of members of the Hsp70 molecular chaperone system, specifically the Hsp70 Ssa, two of its J-protein co-chaperones, Sis1 and Ydj1, and the nucleotide exchange factors of the Hsp110 family (Sse1/2). Notably, while all yeast prions tested thus far require Sis1, [SWI+] is the only one known to require the activity of Ydj1, the most abundant J-protein in yeast. The C-terminal region of Ydj1, which contains the client protein interaction domain, is required for [SWI+] propagation. However, Ydj1 is not unique in this regard, as another, closely related J-protein, Apj1, can substitute for it when expressed at a level approaching that of Ydj1. While dependent upon Ydj1 and Sis1 for propagation, [SWI+] is also highly sensitive to overexpression of both J-proteins. However, this increased prion-loss requires only the highly conserved 70 amino acid J-domain, which serves to stimulate the ATPase activity of Hsp70 and thus to stabilize its interaction with client protein. Overexpression of the J-domain from Sis1, Ydj1, or Apj1 is sufficient to destabilize [SWI+]. In addition, [SWI+] is lost upon overexpression of Sse nucleotide exchange factors, which act to destabilize Hsp70's interaction with client proteins. Given the plethora of genes affected by the activity of the SWI/SNF chromatin-remodeling complex, it is possible that this sensitivity of [SWI+] to the activity of Hsp70 chaperone machinery may serve a regulatory role, keeping this prion in an easily-lost, meta-stable state. Such sensitivity may provide a means to reach an optimal balance of phenotypic diversity within a cell population to better adapt to stressful environments

    Genetic and pharmacological inhibition of CDK9 drives neutrophil apoptosis to resolve inflammation in zebrafish in vivo

    Get PDF
    Neutrophilic inflammation is tightly regulated and subsequently resolves to limit tissue damage and promote repair. When the timely resolution of inflammation is dysregulated, tissue damage and disease results. One key control mechanism is neutrophil apoptosis, followed by apoptotic cell clearance by phagocytes such as macrophages. Cyclin-dependent kinase (CDK) inhibitor drugs induce neutrophil apoptosis in vitro and promote resolution of inflammation in rodent models. Here we present the first in vivo evidence, using pharmacological and genetic approaches, that CDK9 is involved in the resolution of neutrophil-dependent inflammation. Using live cell imaging in zebrafish with labelled neutrophils and macrophages, we show that pharmacological inhibition, morpholino-mediated knockdown and CRISPR/cas9-mediated knockout of CDK9 enhances inflammation resolution by reducing neutrophil numbers via induction of apoptosis after tailfin injury. Importantly, knockdown of the negative regulator La-related protein 7 (LaRP7) increased neutrophilic inflammation. Our data show that CDK9 is a possible target for controlling resolution of inflammation

    Key mechanisms governing resolution of lung inflammation

    Get PDF
    Innate immunity normally provides excellent defence against invading microorganisms. Acute inflammation is a form of innate immune defence and represents one of the primary responses to injury, infection and irritation, largely mediated by granulocyte effector cells such as neutrophils and eosinophils. Failure to remove an inflammatory stimulus (often resulting in failed resolution of inflammation) can lead to chronic inflammation resulting in tissue injury caused by high numbers of infiltrating activated granulocytes. Successful resolution of inflammation is dependent upon the removal of these cells. Under normal physiological conditions, apoptosis (programmed cell death) precedes phagocytic recognition and clearance of these cells by, for example, macrophages, dendritic and epithelial cells (a process known as efferocytosis). Inflammation contributes to immune defence within the respiratory mucosa (responsible for gas exchange) because lung epithelia are continuously exposed to a multiplicity of airborne pathogens, allergens and foreign particles. Failure to resolve inflammation within the respiratory mucosa is a major contributor of numerous lung diseases. This review will summarise the major mechanisms regulating lung inflammation, including key cellular interplays such as apoptotic cell clearance by alveolar macrophages and macrophage/neutrophil/epithelial cell interactions. The different acute and chronic inflammatory disease states caused by dysregulated/impaired resolution of lung inflammation will be discussed. Furthermore, the resolution of lung inflammation during neutrophil/eosinophil-dominant lung injury or enhanced resolution driven via pharmacological manipulation will also be considered
    corecore