10,313 research outputs found

    Stochastic kinetics of viral capsid assembly based on detailed protein structures

    Get PDF
    We present a generic computational framework for the simulation of viral capsid assembly which is quantitative and specific. Starting from PDB files containing atomic coordinates, the algorithm builds a coarse grained description of protein oligomers based on graph rigidity. These reduced protein descriptions are used in an extended Gillespie algorithm to investigate the stochastic kinetics of the assembly process. The association rates are obtained from a diffusive Smoluchowski equation for rapid coagulation, modified to account for water shielding and protein structure. The dissociation rates are derived by interpreting the splitting of oligomers as a process of graph partitioning akin to the escape from a multidimensional well. This modular framework is quantitative yet computationally tractable, with a small number of physically motivated parameters. The methodology is illustrated using two different viruses which are shown to follow quantitatively different assembly pathways. We also show how in this model the quasi-stationary kinetics of assembly can be described as a Markovian cascading process in which only a few intermediates and a small proportion of pathways are present. The observed pathways and intermediates can be related a posteriori to structural and energetic properties of the capsid oligomers

    Towards Prediction of Metabolic Products of Polyketide Synthases: An In Silico Analysis

    Get PDF
    Sequence data arising from an increasing number of partial and complete genome projects is revealing the presence of the polyketide synthase (PKS) family of genes not only in microbes and fungi but also in plants and other eukaryotes. PKSs are huge multifunctional megasynthases that use a variety of biosynthetic paradigms to generate enormously diverse arrays of polyketide products that posses several pharmaceutically important properties. The remarkable conservation of these gene clusters across organisms offers abundant scope for obtaining novel insights into PKS biosynthetic code by computational analysis. We have carried out a comprehensive in silico analysis of modular and iterative gene clusters to test whether chemical structures of the secondary metabolites can be predicted from PKS protein sequences. Here, we report the success of our method and demonstrate the feasibility of deciphering the putative metabolic products of uncharacterized PKS clusters found in newly sequenced genomes. Profile Hidden Markov Model analysis has revealed distinct sequence features that can distinguish modular PKS proteins from their iterative counterparts. For iterative PKS proteins, structural models of iterative ketosynthase (KS) domains have revealed novel correlations between the size of the polyketide products and volume of the active site pocket. Furthermore, we have identified key residues in the substrate binding pocket that control the number of chain extensions in iterative PKSs. For modular PKS proteins, we describe for the first time an automated method based on crucial intermolecular contacts that can distinguish the correct biosynthetic order of substrate channeling from a large number of non-cognate combinatorial possibilities. Taken together, our in silico analysis provides valuable clues for formulating rules for predicting polyketide products of iterative as well as modular PKS clusters. These results have promising potential for discovery of novel natural products by genome mining and rational design of novel natural products

    Atomic-accuracy prediction of protein loop structures through an RNA-inspired ansatz

    Get PDF
    Consistently predicting biopolymer structure at atomic resolution from sequence alone remains a difficult problem, even for small sub-segments of large proteins. Such loop prediction challenges, which arise frequently in comparative modeling and protein design, can become intractable as loop lengths exceed 10 residues and if surrounding side-chain conformations are erased. This article introduces a modeling strategy based on a 'stepwise ansatz', recently developed for RNA modeling, which posits that any realistic all-atom molecular conformation can be built up by residue-by-residue stepwise enumeration. When harnessed to a dynamic-programming-like recursion in the Rosetta framework, the resulting stepwise assembly (SWA) protocol enables enumerative sampling of a 12 residue loop at a significant but achievable cost of thousands of CPU-hours. In a previously established benchmark, SWA recovers crystallographic conformations with sub-Angstrom accuracy for 19 of 20 loops, compared to 14 of 20 by KIC modeling with a comparable expenditure of computational power. Furthermore, SWA gives high accuracy results on an additional set of 15 loops highlighted in the biological literature for their irregularity or unusual length. Successes include cis-Pro touch turns, loops that pass through tunnels of other side-chains, and loops of lengths up to 24 residues. Remaining problem cases are traced to inaccuracies in the Rosetta all-atom energy function. In five additional blind tests, SWA achieves sub-Angstrom accuracy models, including the first such success in a protein/RNA binding interface, the YbxF/kink-turn interaction in the fourth RNA-puzzle competition. These results establish all-atom enumeration as a systematic approach to protein structure that can leverage high performance computing and physically realistic energy functions to more consistently achieve atomic resolution.Comment: Identity of four-loop blind test protein and parts of figures 5 have been omitted in this preprint to ensure confidentiality of the protein structure prior to its public releas

    Development of novel orthogonal genetic circuits, based on extracytoplasmic function (ECF) σ factors

    Get PDF
    The synthetic biology field aims to apply the engineering 'design-build-test-learn' cycle for the implementation of synthetic genetic circuits modifying the behavior of biological systems. In order to reach this goal, synthetic biology projects use a set of fully characterized biological parts that subsequently are assembled into complex synthetic circuits following a rational, model-driven design. However, even though the bottom-up design approach represents an optimal starting point to assay the behavior of the synthetic circuits under defined conditions, the rational design of such circuits is often restricted by the limited number of available DNA building blocks. These usually consist only of a handful of transcriptional regulators that additionally are often borrowed from natural biological systems. This, in turn, can lead to cross-reactions between the synthetic circuit and the host cell and eventually to loss of the original circuit function. Thus, one of the challenges in synthetic biology is to design synthetic circuits that perform the designated functions with minor cross-reactions (orthogonality). To overcome the restrictions of the widely used transcriptional regulators, this project aims to apply extracytoplasmic function (ECF) σ factors in the design novel orthogonal synthetic circuits. ECFs are the smallest and simplest alternative σ factors that recognize highly specific promoters. ECFs represent one of the most important mechanisms of signal transduction in bacteria, indeed, their activity is often controlled by anti-σ factors. Even though it was shown that the overexpression of heterologous anti-σ factors can generate an adverse effect on cell growth, they represent an attractive solution to control ECF activity. Finally, to date, we know thousands of ECF σ factors, widespread among different bacterial phyla, that are identifiable together with the cognate promoters and anti-σ factors, using bioinformatic approaches. All the above-mentioned features make ECF σ factors optimal candidates as core orthogonal regulators for the design of novel synthetic circuits. In this project, in order to establish ECF σ factors as standard building blocks in the synthetic biology field, we first established a high throughput experimental setup. This relies on microplate reader experiments performed using a highly sensitive luminescent reporter system. Luminescent reporters have a superior signal-to-noise ratio when compared to fluorescent reporters since they do not suffer from the high auto-fluorescence background of the bacterial cell. However, they also have a drawback represented by the constant light emission that can generate undesired cross-talk between neighboring wells on a microplate. To overcome this limitation, we developed a computational algorithm that corrects for luminescence bleed-through and estimates the “true” luminescence activity for each well of a microplate. We show that the correcting algorithm preserves low-level signals close to the background and that it is universally applicable to different experimental conditions. In order to simplify the assembly of large ECF-based synthetic circuits, we designed an ECF toolbox in E. coli. The toolbox allows for the combinatorial assembly of circuits into expression vectors, using a library of reusable genetic parts. Moreover, it also offers the possibility of integrating the newly generated synthetic circuits into four different phage attachment (att) sites present in the genome of E. coli. This allows for a flawless transition between plasmid-encoded and chromosomally integrated genetic circuits, expanding the possible genetic configurations of a given synthetic construct. Moreover, our results demonstrate that the four att sites are orthogonal in terms of the gene expression levels of the synthetic circuits. With the purpose of rationally design ECF-based synthetic circuits and taking advantage of the ECF toolbox, we characterized the dynamic behavior of a set of 15 ECF σ factors, their cognate promoters, and relative anti-σs. Overall, we found that ECFs are non-toxic and functional and that they display different binding affinities for the cognate target promoters. Moreover, our results show that it is possible to optimize the output dynamic range of the ECF-based switches by changing the copy number of the ECFs and target promoters, thus, tuning the input/output signal ratio. Next, by combining up to three ECF-switches, we generated a set of “genetic-timer circuits”, the first synthetic circuits harboring more than one ECF. ECF-based timer circuits sequentially activate a series of target genes with increasing time delays, moreover, the behavior of the circuits can be predicted by a set of mathematical models. In order to improve the dynamic response of the ECF-based constructs, we introduced anti-σ factors in our synthetic circuits. By doing so we first confirmed that anti-σ factors can exert an adverse effect on the growth of E. coli, thus we explored possible solutions. Our results demonstrate that anti-σ factors toxicity can be partially alleviated by generating truncated, soluble variants of the anti-σ factors and, eventually, completely abolished via chromosomal integration of the anti-σ factor-based circuits. Finally, after demonstrating that anti-σ factors can be used to generate a tunable time delay among ECF expression and target promoter activation, we designed ECF/AS-suicide circuits. Such circuits allow for the time-delayed cell-death of E. coli and will serve as a prototype for the further development of ECF/AS-based lysis circuits