40 research outputs found

    MIRRAGGE – Minimum Information Required for Reproducible AGGregation Experiments

    Get PDF
    Reports on phase separation and amyloid formation for multiple proteins and aggregation-prone peptides are recurrently used to explore the molecular mechanisms associated with several human diseases. The information conveyed by these reports can be used directly in translational investigation, e.g., for the design of better drug screening strategies, or be compiled in databases for benchmarking novel aggregation-predicting algorithms. Given that minute protocol variations determine different outcomes of protein aggregation assays, there is a strong urge for standardized descriptions of the different types of aggregates and the detailed methods used in their production. In an attempt to address this need, we assembled the Minimum Information Required for Reproducible Aggregation Experiments (MIRRAGGE) guidelines, considering first-principles and the established literature on protein self-assembly and aggregation. This consensus information aims to cover the major and subtle determinants of experimental reproducibility while avoiding excessive technical details that are of limited practical interest for non-specialized users. The MIRRAGGE table (template available in Supplementary Information) is useful as a guide for the design of new studies and as a checklist during submission of experimental reports for publication. Full disclosure of relevant information also enables other researchers to reproduce results correctly and facilitates systematic data deposition into curated databases.This work was supported by (i) the European Regional Development Fund (ERDF) through the COMPETE 2020—Operacional Programme for Competitiveness and Internationalisation (POCI), Portugal 2020, and by Portuguese funds through FCT—Fundação para a Ciência e a Tecnologia (FCT/MCTES) in the framework of grants POCI-01-0145-FEDER-031173, POCI-01-0145-FEDER-007274, POCI-01-0145-FEDER-031323 (“Institute for Research and Innovation in Health Sciences”), UID/Multi/04046/2013 (BioISI) and PTDC/NEUNMC/2138/2014 (to CMG). SV was funded by the Spanish Ministry of Economy and Competitiveness (BIO2016-78310-R) and by ICREA (ICREA-Academia 2015). ZG and ZB were funded by Slovak research agentures VEGA 02/0145/17, 02/0030/18 and APVV-18-0284. RS was funded by VEGA 02/0163/19. DEO was funded by the Lundbeck Foundation (grant no. R276-2018-671) and the Independent Research Foundation Denmark | Natural Sciences (grant no. 8021-00208B). AP research was supported by UK Dementia Research Institute (RE1 3556) and by ARUK (ARUK-PG2019B-020)

    Improved general regression network for protein domain boundary prediction

    Get PDF
    Background: Protein domains present some of the most useful information that can be used to understand protein structure and functions. Recent research on protein domain boundary prediction has been mainly based on widely known machine learning techniques, such as Artificial Neural Networks and Support Vector Machines. In this study, we propose a new machine learning model (IGRN) that can achieve accurate and reliable classification, with significantly reduced computations. The IGRN was trained using a PSSM (Position Specific Scoring Matrix), secondary structure, solvent accessibility information and inter-domain linker index to detect possible domain boundaries for a target sequence. Results: The proposed model achieved average prediction accuracy of 67% on the Benchmark_2 dataset for domain boundary identification in multi-domains proteins and showed superior predictive performance and generalisation ability among the most widely used neural network models. With the CASP7 benchmark dataset, it also demonstrated comparable performance to existing domain boundary predictors such as DOMpro, DomPred, DomSSEA, DomCut and DomainDiscovery with 70.10% prediction accuracy. Conclusion: The performance of proposed model has been compared favourably to the performance of other existing machine learning based methods as well as widely known domain boundary predictors on two benchmark datasets and excels in the identification of domain boundaries in terms of model bias, generalisation and computational requirements. © 2008 Yoo et al; licensee BioMed Central Ltd

    A microscale protein NMR sample screening pipeline

    Get PDF
    As part of efforts to develop improved methods for NMR protein sample preparation and structure determination, the Northeast Structural Genomics Consortium (NESG) has implemented an NMR screening pipeline for protein target selection, construct optimization, and buffer optimization, incorporating efficient microscale NMR screening of proteins using a micro-cryoprobe. The process is feasible because the newest generation probe requires only small amounts of protein, typically 30–200 μg in 8–35 μl volume. Extensive automation has been made possible by the combination of database tools, mechanization of key process steps, and the use of a micro-cryoprobe that gives excellent data while requiring little optimization and manual setup. In this perspective, we describe the overall process used by the NESG for screening NMR samples as part of a sample optimization process, assessing optimal construct design and solution conditions, as well as for determining protein rotational correlation times in order to assess protein oligomerization states. Database infrastructure has been developed to allow for flexible implementation of new screening protocols and harvesting of the resulting output. The NESG micro NMR screening pipeline has also been used for detergent screening of membrane proteins. Descriptions of the individual steps in the NESG NMR sample design, production, and screening pipeline are presented in the format of a standard operating procedure

    Rules Governing Selective Protein Carbonylation

    Get PDF
    BACKGROUND:Carbonyl derivatives are mainly formed by direct metal-catalysed oxidation (MCO) attacks on the amino-acid side chains of proline, arginine, lysine and threonine residues. For reasons unknown, only some proteins are prone to carbonylation. METHODOLOGY/PRINCIPAL FINDINGS:we used mass spectrometry analysis to identify carbonylated sites in: BSA that had undergone in vitro MCO, and 23 carbonylated proteins in Escherichia coli. The presence of a carbonylated site rendered the neighbouring carbonylatable site more prone to carbonylation. Most carbonylated sites were present within hot spots of carbonylation. These observations led us to suggest rules for identifying sites more prone to carbonylation. We used these rules to design an in silico model (available at http://www.lcb.cnrs-mrs.fr/CSPD/), allowing an effective and accurate prediction of sites and of proteins more prone to carbonylation in the E. coli proteome. CONCLUSIONS/SIGNIFICANCE:We observed that proteins evolve to either selectively maintain or lose predicted hot spots of carbonylation depending on their biological function. As our predictive model also allows efficient detection of carbonylated proteins in Bacillus subtilis, we believe that our model may be extended to direct MCO attacks in all organisms

    Multiple Routes and Milestones in the Folding of HIV–1 Protease Monomer

    Get PDF
    Proteins fold on a time scale incompatible with a mechanism of random search in conformational space thus indicating that somehow they are guided to the native state through a funneled energetic landscape. At the same time the heterogeneous kinetics suggests the existence of several different folding routes. Here we propose a scenario for the folding mechanism of the monomer of HIV–1 protease in which multiple pathways and milestone events coexist. A variety of computational approaches supports this picture. These include very long all-atom molecular dynamics simulations in explicit solvent, an analysis of the network of clusters found in multiple high-temperature unfolding simulations and a complete characterization of free-energy surfaces carried out using a structure-based potential at atomistic resolution and a combination of metadynamics and parallel tempering. Our results confirm that the monomer in solution is stable toward unfolding and show that at least two unfolding pathways exist. In our scenario, the formation of a hydrophobic core is a milestone in the folding process which must occur along all the routes that lead this protein towards its native state. Furthermore, the ensemble of folding pathways proposed here substantiates a rational drug design strategy based on inhibiting the folding of HIV–1 protease

    An Evolutionary Trade-Off between Protein Turnover Rate and Protein Aggregation Favors a Higher Aggregation Propensity in Fast Degrading Proteins

    Get PDF
    We previously showed the existence of selective pressure against protein aggregation by the enrichment of aggregation-opposing ‘gatekeeper’ residues at strategic places along the sequence of proteins. Here we analyzed the relationship between protein lifetime and protein aggregation by combining experimentally determined turnover rates, expression data, structural data and chaperone interaction data on a set of more than 500 proteins. We find that selective pressure on protein sequences against aggregation is not homogeneous but that short-living proteins on average have a higher aggregation propensity and fewer chaperone interactions than long-living proteins. We also find that short-living proteins are more often associated to deposition diseases. These findings suggest that the efficient degradation of high-turnover proteins is sufficient to preclude aggregation, but also that factors that inhibit proteasomal activity, such as physiological ageing, will primarily affect the aggregation of short-living proteins

    Amyloidogenic Regions and Interaction Surfaces Overlap in Globular Proteins Related to Conformational Diseases

    Get PDF
    Protein aggregation underlies a wide range of human disorders. The polypeptides involved in these pathologies might be intrinsically unstructured or display a defined 3D-structure. Little is known about how globular proteins aggregate into toxic assemblies under physiological conditions, where they display an initially folded conformation. Protein aggregation is, however, always initiated by the establishment of anomalous protein-protein interactions. Therefore, in the present work, we have explored the extent to which protein interaction surfaces and aggregation-prone regions overlap in globular proteins associated with conformational diseases. Computational analysis of the native complexes formed by these proteins shows that aggregation-prone regions do frequently overlap with protein interfaces. The spatial coincidence of interaction sites and aggregating regions suggests that the formation of functional complexes and the aggregation of their individual subunits might compete in the cell. Accordingly, single mutations affecting complex interface or stability usually result in the formation of toxic aggregates. It is suggested that the stabilization of existing interfaces in multimeric proteins or the formation of new complexes in monomeric polypeptides might become effective strategies to prevent disease-linked aggregation of globular proteins
    corecore