58 research outputs found

    FLORA: a novel method to predict protein function from structure in diverse superfamilies

    Get PDF
    Predicting protein function from structure remains an active area of interest, particularly for the structural genomics initiatives where a substantial number of structures are initially solved with little or no functional characterisation. Although global structure comparison methods can be used to transfer functional annotations, the relationship between fold and function is complex, particularly in functionally diverse superfamilies that have evolved through different secondary structure embellishments to a common structural core. The majority of prediction algorithms employ local templates built on known or predicted functional residues. Here, we present a novel method (FLORA) that automatically generates structural motifs associated with different functional sub-families (FSGs) within functionally diverse domain superfamilies. Templates are created purely on the basis of their specificity for a given FSG, and the method makes no prior prediction of functional sites, nor assumes specific physico-chemical properties of residues. FLORA is able to accurately discriminate between homologous domains with different functions and substantially outperforms (a 2–3 fold increase in coverage at low error rates) popular structure comparison methods and a leading function prediction method. We benchmark FLORA on a large data set of enzyme superfamilies from all three major protein classes (α, β, αβ) and demonstrate the functional relevance of the motifs it identifies. We also provide novel predictions of enzymatic activity for a large number of structures solved by the Protein Structure Initiative. Overall, we show that FLORA is able to effectively detect functionally similar protein domain structures by purely using patterns of structural conservation of all residues

    Comparative Structural Analysis of Lipid Binding START Domains

    Get PDF
    Steroidogenic acute regulatory (StAR) protein related lipid transfer (START) domains are small globular modules that form a cavity where lipids and lipid hormones bind. These domains can transport ligands to facilitate lipid exchange between biological membranes, and they have been postulated to modulate the activity of other domains of the protein in response to ligand binding. More than a dozen human genes encode START domains, and several of them are implicated in a disease.We report crystal structures of the human STARD1, STARD5, STARD13 and STARD14 lipid transfer domains. These represent four of the six functional classes of START domains.Sequence alignments based on these and previously reported crystal structures define the structural determinants of human START domains, both those related to structural framework and those involved in ligand specificity.This article can also be viewed as an enhanced version in which the text of the article is integrated with interactive 3D representations and animated transitions. Please note that a web plugin is required to access this enhanced functionality. Instructions for the installation and use of the web plugin are available in Text S1

    A flare in the optical spotted in the changing-look Seyfert NGC 3516

    Get PDF
    Context. We present observations from the short-term intensive optical campaign (from September 2019 to January 2020) of the changing-look Seyfert NGC 3516. This active galactic nucleus is known to have strong optical variability and has changed its type in the past. It has been in the low-activity state in the optical since 2013, with some rebrightening from the end of 2015 to the beginning of 2016, after which it remained dormant.Aims. We aim to study the photometric and spectral variability of NGC 3516 from the new observations in U- and B-bands and examine the profiles of the optical broad emission lines in order to demonstrate that this object may be entering a new state of activity.Methods. NGC 3516 has been monitored intensively for the past 4 months with an automated telescope in U and B filters, enabling accurate photometry of 0.01 precision. Spectral observations were triggered when an increase in brightness was spotted. We support our analysis of past-episodes of violent variability with the UV and X-ray long-term light curves constructed from the archival Swift/UVOT and Swift/XRT data.Results. An increase of the photometric magnitude is seen in both U and B filters to a maximum amplitude of 0.25 mag and 0.11 mag, respectively. During the flare, we observe stronger forbidden high-ionization iron lines ([FeVII] and [FeX]) than reported before, as well as the complex broad H alpha and H beta lines. This is especially seen in H alpha, which appears to be double-peaked. It seems that a very broad component of similar to 10 000 km s(-1) in width in the Balmer lines is appearing. The trends in the optical, UV, and X-ray light curves are similar, with the amplitudes of variability being significantly larger in the case of UV and X-ray bands.Conclusions. The increase of the continuum emission, the variability of the coronal lines, and the very broad component in the Balmer lines may indicate that the AGN of NGC 3516 is finally leaving the low-activity state in which it has been for the last similar to 3 years.</div

    Nucleotide Binding Switches the Information Flow in Ras GTPases

    Get PDF
    The Ras superfamily comprises many guanine nucleotide-binding proteins (G proteins) that are essential to intracellular signal transduction. The guanine nucleotide-dependent intrinsic flexibility patterns of five G proteins were investigated in atomic detail through Molecular Dynamics simulations of the GDP- and GTP-bound states (SGDP and SGTP, respectively). For all the considered systems, the intrinsic flexibility of SGDP was higher than that of SGTP, suggesting that Guanine Exchange Factor (GEF) recognition and nucleotide switch require higher amplitude motions than effector recognition or GTP hydrolysis. Functional mode, dynamic domain, and interaction energy correlation analyses highlighted significant differences in the dynamics of small G proteins and Gα proteins, especially in the inactive state. Indeed, SGDP of Gαt, is characterized by a more extensive energy coupling between nucleotide binding site and distal regions involved in GEF recognition compared to small G proteins, which attenuates in the active state. Moreover, mechanically distinct domains implicated in nucleotide switch could be detected in the presence of GDP but not in the presence of GTP. Finally, in small G proteins, functional modes are more detectable in the inactive state than in the active one and involve changes in solvent exposure of two highly conserved amino acids in switches I and II involved in GEF recognition. The average solvent exposure of these amino acids correlates in turn with the rate of GDP release, suggesting for them either direct or indirect roles in the process of nucleotide switch. Collectively, nucleotide binding changes the information flow through the conserved Ras-like domain, where GDP enhances the flexibility of mechanically distinct portions involved in nucleotide switch, and favors long distance allosteric communication (in Gα proteins), compared to GTP

    Using least median of squares for structural superposition of flexible proteins

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The conventional superposition methods use an ordinary least squares (LS) fit for structural comparison of two different conformations of the same protein. The main problem of the LS fit that it is sensitive to outliers, i.e. large displacements of the original structures superimposed.</p> <p>Results</p> <p>To overcome this problem, we present a new algorithm to overlap two protein conformations by their atomic coordinates using a robust statistics technique: least median of squares (LMS). In order to effectively approximate the LMS optimization, the forward search technique is utilized. Our algorithm can automatically detect and superimpose the rigid core regions of two conformations with small or large displacements. In contrast, most existing superposition techniques strongly depend on the initial LS estimating for the entire atom sets of proteins. They may fail on structural superposition of two conformations with large displacements. The presented LMS fit can be considered as an alternative and complementary tool for structural superposition.</p> <p>Conclusion</p> <p>The proposed algorithm is robust and does not require any prior knowledge of the flexible regions. Furthermore, we show that the LMS fit can be extended to multiple level superposition between two conformations with several rigid domains. Our fit tool has produced successful superpositions when applied to proteins for which two conformations are known. The binary executable program for Windows platform, tested examples, and database are available from <url>https://engineering.purdue.edu/PRECISE/LMSfit</url>.</p

    Structural analysis of the PATZ1 BTB domain homodimer

    Get PDF
    PATZ1 is a ubiquitously expressed transcriptional repressor belonging to the ZBTB family that is functionally expressed in T lymphocytes. PATZ1 targets the CD8 gene in lymphocyte development and interacts with the p53 protein to control genes that are important in proliferation and in the DNA-damage response. PATZ1 exerts its activity through an N-terminal BTB domain that mediates dimerization and co-repressor interactions and a C-terminal zinc-finger motif-containing domain that mediates DNA binding. Here, the crystal structures of the murine and zebrafish PATZ1 BTB domains are reported at 2.3 and 1.8 Å resolution, respectively. The structures revealed that the PATZ1 BTB domain forms a stable homodimer with a lateral surface groove, as in other ZBTB structures. Analysis of the lateral groove revealed a large acidic patch in this region, which contrasts with the previously resolved basic co-repressor binding interface of BCL6. A large 30-amino-acid glycine- and alanine-rich central loop, which is unique to mammalian PATZ1 amongst all ZBTB proteins, could not be resolved, probably owing to its flexibility. Molecular-dynamics simulations suggest a contribution of this loop to modulation of the mammalian BTB dimerization interface

    Perturbation-Response Scanning Reveals Ligand Entry-Exit Mechanisms of Ferric Binding Protein

    Get PDF
    We study apo and holo forms of the bacterial ferric binding protein (FBP) which exhibits the so-called ferric transport dilemma: it uptakes iron from the host with remarkable affinity, yet releases it with ease in the cytoplasm for subsequent use. The observations fit the “conformational selection” model whereby the existence of a weakly populated, higher energy conformation that is stabilized in the presence of the ligand is proposed. We introduce a new tool that we term perturbation-response scanning (PRS) for the analysis of remote control strategies utilized. The approach relies on the systematic use of computational perturbation/response techniques based on linear response theory, by sequentially applying directed forces on single-residues along the chain and recording the resulting relative changes in the residue coordinates. We further obtain closed-form expressions for the magnitude and the directionality of the response. Using PRS, we study the ligand release mechanisms of FBP and support the findings by molecular dynamics simulations. We find that the residue-by-residue displacements between the apo and the holo forms, as determined from the X-ray structures, are faithfully reproduced by perturbations applied on the majority of the residues of the apo form. However, once the stabilizing ligand (Fe) is integrated to the system in holo FBP, perturbing only a few select residues successfully reproduces the experimental displacements. Thus, iron uptake by FBP is a favored process in the fluctuating environment of the protein, whereas iron release is controlled by mechanisms including chelation and allostery. The directional analysis that we implement in the PRS methodology implicates the latter mechanism by leading to a few distant, charged, and exposed loop residues. Upon perturbing these, irrespective of the direction of the operating forces, we find that the cap residues involved in iron release are made to operate coherently, facilitating release of the ion

    Cross-Over between Discrete and Continuous Protein Structure Space: Insights into Automatic Classification and Networks of Protein Structures

    Get PDF
    Structural classifications of proteins assume the existence of the fold, which is an intrinsic equivalence class of protein domains. Here, we test in which conditions such an equivalence class is compatible with objective similarity measures. We base our analysis on the transitive property of the equivalence relationship, requiring that similarity of A with B and B with C implies that A and C are also similar. Divergent gene evolution leads us to expect that the transitive property should approximately hold. However, if protein domains are a combination of recurrent short polypeptide fragments, as proposed by several authors, then similarity of partial fragments may violate the transitive property, favouring the continuous view of the protein structure space. We propose a measure to quantify the violations of the transitive property when a clustering algorithm joins elements into clusters, and we find out that such violations present a well defined and detectable cross-over point, from an approximately transitive regime at high structure similarity to a regime with large transitivity violations and large differences in length at low similarity. We argue that protein structure space is discrete and hierarchic classification is justified up to this cross-over point, whereas at lower similarities the structure space is continuous and it should be represented as a network. We have tested the qualitative behaviour of this measure, varying all the choices involved in the automatic classification procedure, i.e., domain decomposition, alignment algorithm, similarity score, and clustering algorithm, and we have found out that this behaviour is quite robust. The final classification depends on the chosen algorithms. We used the values of the clustering coefficient and the transitivity violations to select the optimal choices among those that we tested. Interestingly, this criterion also favours the agreement between automatic and expert classifications. As a domain set, we have selected a consensus set of 2,890 domains decomposed very similarly in SCOP and CATH. As an alignment algorithm, we used a global version of MAMMOTH developed in our group, which is both rapid and accurate. As a similarity measure, we used the size-normalized contact overlap, and as a clustering algorithm, we used average linkage. The resulting automatic classification at the cross-over point was more consistent than expert ones with respect to the structure similarity measure, with 86% of the clusters corresponding to subsets of either SCOP or CATH superfamilies and fewer than 5% containing domains in distinct folds according to both SCOP and CATH. Almost 15% of SCOP superfamilies and 10% of CATH superfamilies were split, consistent with the notion of fold change in protein evolution. These results were qualitatively robust for all choices that we tested, although we did not try to use alignment algorithms developed by other groups. Folds defined in SCOP and CATH would be completely joined in the regime of large transitivity violations where clustering is more arbitrary. Consistently, the agreement between SCOP and CATH at fold level was lower than their agreement with the automatic classification obtained using as a clustering algorithm, respectively, average linkage (for SCOP) or single linkage (for CATH). The networks representing significant evolutionary and structural relationships between clusters beyond the cross-over point may allow us to perform evolutionary, structural, or functional analyses beyond the limits of classification schemes. These networks and the underlying clusters are available at http://ub.cbm.uam.es/research/ProtNet.ph

    Rapid Sampling of Molecular Motions with Prior Information Constraints

    Get PDF
    Proteins are active, flexible machines that perform a range of different functions. Innovative experimental approaches may now provide limited partial information about conformational changes along motion pathways of proteins. There is therefore a need for computational approaches that can efficiently incorporate prior information into motion prediction schemes. In this paper, we present PathRover, a general setup designed for the integration of prior information into the motion planning algorithm of rapidly exploring random trees (RRT). Each suggested motion pathway comprises a sequence of low-energy clash-free conformations that satisfy an arbitrary number of prior information constraints. These constraints can be derived from experimental data or from expert intuition about the motion. The incorporation of prior information is very straightforward and significantly narrows down the vast search in the typically high-dimensional conformational space, leading to dramatic reduction in running time. To allow the use of state-of-the-art energy functions and conformational sampling, we have integrated this framework into Rosetta, an accurate protocol for diverse types of structural modeling. The suggested framework can serve as an effective complementary tool for molecular dynamics, Normal Mode Analysis, and other prevalent techniques for predicting motion in proteins. We applied our framework to three different model systems. We show that a limited set of experimentally motivated constraints may effectively bias the simulations toward diverse predicates in an outright fashion, from distance constraints to enforcement of loop closure. In particular, our analysis sheds light on mechanisms of protein domain swapping and on the role of different residues in the motion

    Antiinflammatory Therapy with Canakinumab for Atherosclerotic Disease

    Get PDF
    Background: Experimental and clinical data suggest that reducing inflammation without affecting lipid levels may reduce the risk of cardiovascular disease. Yet, the inflammatory hypothesis of atherothrombosis has remained unproved. Methods: We conducted a randomized, double-blind trial of canakinumab, a therapeutic monoclonal antibody targeting interleukin-1β, involving 10,061 patients with previous myocardial infarction and a high-sensitivity C-reactive protein level of 2 mg or more per liter. The trial compared three doses of canakinumab (50 mg, 150 mg, and 300 mg, administered subcutaneously every 3 months) with placebo. The primary efficacy end point was nonfatal myocardial infarction, nonfatal stroke, or cardiovascular death. RESULTS: At 48 months, the median reduction from baseline in the high-sensitivity C-reactive protein level was 26 percentage points greater in the group that received the 50-mg dose of canakinumab, 37 percentage points greater in the 150-mg group, and 41 percentage points greater in the 300-mg group than in the placebo group. Canakinumab did not reduce lipid levels from baseline. At a median follow-up of 3.7 years, the incidence rate for the primary end point was 4.50 events per 100 person-years in the placebo group, 4.11 events per 100 person-years in the 50-mg group, 3.86 events per 100 person-years in the 150-mg group, and 3.90 events per 100 person-years in the 300-mg group. The hazard ratios as compared with placebo were as follows: in the 50-mg group, 0.93 (95% confidence interval [CI], 0.80 to 1.07; P = 0.30); in the 150-mg group, 0.85 (95% CI, 0.74 to 0.98; P = 0.021); and in the 300-mg group, 0.86 (95% CI, 0.75 to 0.99; P = 0.031). The 150-mg dose, but not the other doses, met the prespecified multiplicity-adjusted threshold for statistical significance for the primary end point and the secondary end point that additionally included hospitalization for unstable angina that led to urgent revascularization (hazard ratio vs. placebo, 0.83; 95% CI, 0.73 to 0.95; P = 0.005). Canakinumab was associated with a higher incidence of fatal infection than was placebo. There was no significant difference in all-cause mortality (hazard ratio for all canakinumab doses vs. placebo, 0.94; 95% CI, 0.83 to 1.06; P = 0.31). Conclusions: Antiinflammatory therapy targeting the interleukin-1β innate immunity pathway with canakinumab at a dose of 150 mg every 3 months led to a significantly lower rate of recurrent cardiovascular events than placebo, independent of lipid-level lowering. (Funded by Novartis; CANTOS ClinicalTrials.gov number, NCT01327846.
    corecore