83 research outputs found

    Mechanical Strength of 17 134 Model Proteins and Cysteine Slipknots

    Get PDF
    A new theoretical survey of proteins' resistance to constant speed stretching is performed for a set of 17 134 proteins as described by a structure-based model. The proteins selected have no gaps in their structure determination and consist of no more than 250 amino acids. Our previous studies have dealt with 7510 proteins of no more than 150 amino acids. The proteins are ranked according to the strength of the resistance. Most of the predicted top-strength proteins have not yet been studied experimentally. Architectures and folds which are likely to yield large forces are identified. New types of potent force clamps are discovered. They involve disulphide bridges and, in particular, cysteine slipknots. An effective energy parameter of the model is estimated by comparing the theoretical data on characteristic forces to the corresponding experimental values combined with an extrapolation of the theoretical data to the experimental pulling speeds. These studies provide guidance for future experiments on single molecule manipulation and should lead to selection of proteins for applications. A new class of proteins, involving cystein slipknots, is identified as one that is expected to lead to the strongest force clamps known. This class is characterized through molecular dynamics simulations.Comment: 40 pages, 13 PostScript figure

    Better together: Elements of successful scientific software development in a distributed collaborative community

    Get PDF
    Many scientific disciplines rely on computational methods for data analysis, model generation, and prediction. Implementing these methods is often accomplished by researchers with domain expertise but without formal training in software engineering or computer science. This arrangement has led to underappreciation of sustainability and maintainability of scientific software tools developed in academic environments. Some software tools have avoided this fate, including the scientific library Rosetta. We use this software and its community as a case study to show how modern software development can be accomplished successfully, irrespective of subject area. Rosetta is one of the largest software suites for macromolecular modeling, with 3.1 million lines of code and many state-of-the-art applications. Since the mid 1990s, the software has been developed collaboratively by the RosettaCommons, a community of academics from over 60 institutions worldwide with diverse backgrounds including chemistry, biology, physiology, physics, engineering, mathematics, and computer science. Developing this software suite has provided us with more than two decades of experience in how to effectively develop advanced scientific software in a global community with hundreds of contributors. Here we illustrate the functioning of this development community by addressing technical aspects (like version control, testing, and maintenance), community-building strategies, diversity efforts, software dissemination, and user support. We demonstrate how modern computational research can thrive in a distributed collaborative community. The practices described here are independent of subject area and can be readily adopted by other software development communities

    High Throughput Method for Protein Structure Prediction

    Get PDF

    OUP accepted manuscript

    No full text
    37FINAL_PUBLISHED20AT_PUBLICATIONPublikacja bezkosztow

    Modeling of a putative programmed cell death receptor

    No full text
    Multicellularity is a process strictly coupled to the mechanisms of programmed cell death (PCD). A homology between well-studied eukaryotic PCD proteins (human Apaf-1, Caenorhabditis elegans Ced-4) and yet not fully characterized prokaryotic PCD-like proteins has been reported. Wrap1 is a transmembrane protein, which is a component of a putative PCD apparatus in a multicellular cyanobacterium Nostoc punctiforme. It is equipped with a highly-repetitive interaction-mediating β- propeller domain, which suggests its possible antibody-like role. Upon activation, Apaf-1 and Ced-4 to form apotosomes, which trigger signaling pathways in apoptosis and innate immunity. Due to homology, Wrap1 is also expected to oligomerize. This study has been focused on the possible state of NTPase domains of two proteins and Wrap1 using Rosetta and AlphaFold2. Additionally, control modeling for Apaf-1 and Ced-4 (for which the oligomerization states are known) has been performed. The obtained results might serve as indication of the most probable state for the Wrap1 protein and its potential interaction partner

    Automated Protein Secondary Structure Assignment from Cα Positions Using Neural Networks

    No full text
    The assignment of secondary structure elements in protein conformations is necessary to interpret a protein model that has been established by computational methods. The process essentially involves labeling the amino acid residues with H (Helix), E (Strand), or C (Coil, also known as Loop). When particular atoms are absent from an input protein structure, the procedure becomes more complicated, especially when only the alpha carbon locations are known. Various techniques have been tested and applied to this problem during the last forty years. The application of machine learning techniques is the most recent trend. This contribution presents the HECA classifier, which uses neural networks to assign protein secondary structure types. The technique exclusively employs Cα coordinates. The Keras (TensorFlow) library was used to implement and train the neural network model. The BioShell toolkit was used to calculate the neural network input features from raw coordinates. The study’s findings show that neural network-based methods may be successfully used to take on structure assignment challenges when only Cα trace is available. Thanks to the careful selection of input features, our approach’s accuracy (above 97%) exceeded that of the existing methods
    corecore