83 research outputs found
Modeling Protein Structure, Dynamics and Thermodynamics with Reduced Representation of Conformational Space
Mechanical Strength of 17 134 Model Proteins and Cysteine Slipknots
A new theoretical survey of proteins' resistance to constant speed stretching
is performed for a set of 17 134 proteins as described by a structure-based
model. The proteins selected have no gaps in their structure determination and
consist of no more than 250 amino acids. Our previous studies have dealt with
7510 proteins of no more than 150 amino acids. The proteins are ranked
according to the strength of the resistance. Most of the predicted top-strength
proteins have not yet been studied experimentally. Architectures and folds
which are likely to yield large forces are identified. New types of potent
force clamps are discovered. They involve disulphide bridges and, in
particular, cysteine slipknots. An effective energy parameter of the model is
estimated by comparing the theoretical data on characteristic forces to the
corresponding experimental values combined with an extrapolation of the
theoretical data to the experimental pulling speeds. These studies provide
guidance for future experiments on single molecule manipulation and should lead
to selection of proteins for applications. A new class of proteins, involving
cystein slipknots, is identified as one that is expected to lead to the
strongest force clamps known. This class is characterized through molecular
dynamics simulations.Comment: 40 pages, 13 PostScript figure
Better together: Elements of successful scientific software development in a distributed collaborative community
Many scientific disciplines rely on computational methods for data analysis, model generation, and prediction. Implementing these methods is often accomplished by researchers with domain expertise but without formal training in software engineering or computer science. This arrangement has led to underappreciation of sustainability and maintainability of scientific software tools developed in academic environments. Some software tools have avoided this fate, including the scientific library Rosetta. We use this software and its community as a case study to show how modern software development can be accomplished successfully, irrespective of subject area. Rosetta is one of the largest software suites for macromolecular modeling, with 3.1 million lines of code and many state-of-the-art applications. Since the mid 1990s, the software has been developed collaboratively by the RosettaCommons, a community of academics from over 60 institutions worldwide with diverse backgrounds including chemistry, biology, physiology, physics, engineering, mathematics, and computer science. Developing this software suite has provided us with more than two decades of experience in how to effectively develop advanced scientific software in a global community with hundreds of contributors. Here we illustrate the functioning of this development community by addressing technical aspects (like version control, testing, and maintenance), community-building strategies, diversity efforts, software dissemination, and user support. We demonstrate how modern computational research can thrive in a distributed collaborative community. The practices described here are independent of subject area and can be readily adopted by other software development communities
Modeling of a putative programmed cell death receptor
Multicellularity is a process strictly coupled to the mechanisms of programmed cell death (PCD). A homology between well-studied eukaryotic PCD proteins (human Apaf-1, Caenorhabditis elegans Ced-4) and yet not fully characterized prokaryotic PCD-like proteins has been reported. Wrap1 is a transmembrane protein, which is a component of a putative PCD apparatus in a multicellular cyanobacterium Nostoc punctiforme. It is equipped with a highly-repetitive interaction-mediating β- propeller domain, which suggests its possible antibody-like role. Upon activation, Apaf-1 and Ced-4 to form apotosomes, which trigger signaling pathways in apoptosis and innate immunity. Due to homology, Wrap1 is also expected to oligomerize. This study has been focused on the possible state of NTPase domains of two proteins and Wrap1 using Rosetta and AlphaFold2. Additionally, control modeling for Apaf-1 and Ced-4 (for which the oligomerization states are known) has been performed. The obtained results might serve as indication of the most probable state for the Wrap1 protein and its potential interaction partner
Automated Protein Secondary Structure Assignment from Cα Positions Using Neural Networks
The assignment of secondary structure elements in protein conformations is necessary to interpret a protein model that has been established by computational methods. The process essentially involves labeling the amino acid residues with H (Helix), E (Strand), or C (Coil, also known as Loop). When particular atoms are absent from an input protein structure, the procedure becomes more complicated, especially when only the alpha carbon locations are known. Various techniques have been tested and applied to this problem during the last forty years. The application of machine learning techniques is the most recent trend. This contribution presents the HECA classifier, which uses neural networks to assign protein secondary structure types. The technique exclusively employs Cα coordinates. The Keras (TensorFlow) library was used to implement and train the neural network model. The BioShell toolkit was used to calculate the neural network input features from raw coordinates. The study’s findings show that neural network-based methods may be successfully used to take on structure assignment challenges when only Cα trace is available. Thanks to the careful selection of input features, our approach’s accuracy (above 97%) exceeded that of the existing methods
- …
