10,313 research outputs found
Stochastic kinetics of viral capsid assembly based on detailed protein structures
We present a generic computational framework for the simulation of viral
capsid assembly which is quantitative and specific. Starting from PDB files
containing atomic coordinates, the algorithm builds a coarse grained
description of protein oligomers based on graph rigidity. These reduced protein
descriptions are used in an extended Gillespie algorithm to investigate the
stochastic kinetics of the assembly process. The association rates are obtained
from a diffusive Smoluchowski equation for rapid coagulation, modified to
account for water shielding and protein structure. The dissociation rates are
derived by interpreting the splitting of oligomers as a process of graph
partitioning akin to the escape from a multidimensional well. This modular
framework is quantitative yet computationally tractable, with a small number of
physically motivated parameters. The methodology is illustrated using two
different viruses which are shown to follow quantitatively different assembly
pathways. We also show how in this model the quasi-stationary kinetics of
assembly can be described as a Markovian cascading process in which only a few
intermediates and a small proportion of pathways are present. The observed
pathways and intermediates can be related a posteriori to structural and
energetic properties of the capsid oligomers
Towards Prediction of Metabolic Products of Polyketide Synthases: An In Silico Analysis
Sequence data arising from an increasing number of partial and complete genome projects is revealing the presence of the polyketide synthase (PKS) family of genes not only in microbes and fungi but also in plants and other eukaryotes. PKSs are huge multifunctional megasynthases that use a variety of biosynthetic paradigms to generate enormously diverse arrays of polyketide products that posses several pharmaceutically important properties. The remarkable conservation of these gene clusters across organisms offers abundant scope for obtaining novel insights into PKS biosynthetic code by computational analysis. We have carried out a comprehensive in silico analysis of modular and iterative gene clusters to test whether chemical structures of the secondary metabolites can be predicted from PKS protein sequences. Here, we report the success of our method and demonstrate the feasibility of deciphering the putative metabolic products of uncharacterized PKS clusters found in newly sequenced genomes. Profile Hidden Markov Model analysis has revealed distinct sequence features that can distinguish modular PKS proteins from their iterative counterparts. For iterative PKS proteins, structural models of iterative ketosynthase (KS) domains have revealed novel correlations between the size of the polyketide products and volume of the active site pocket. Furthermore, we have identified key residues in the substrate binding pocket that control the number of chain extensions in iterative PKSs. For modular PKS proteins, we describe for the first time an automated method based on crucial intermolecular contacts that can distinguish the correct biosynthetic order of substrate channeling from a large number of non-cognate combinatorial possibilities. Taken together, our in silico analysis provides valuable clues for formulating rules for predicting polyketide products of iterative as well as modular PKS clusters. These results have promising potential for discovery of novel natural products by genome mining and rational design of novel natural products
Atomic-accuracy prediction of protein loop structures through an RNA-inspired ansatz
Consistently predicting biopolymer structure at atomic resolution from
sequence alone remains a difficult problem, even for small sub-segments of
large proteins. Such loop prediction challenges, which arise frequently in
comparative modeling and protein design, can become intractable as loop lengths
exceed 10 residues and if surrounding side-chain conformations are erased. This
article introduces a modeling strategy based on a 'stepwise ansatz', recently
developed for RNA modeling, which posits that any realistic all-atom molecular
conformation can be built up by residue-by-residue stepwise enumeration. When
harnessed to a dynamic-programming-like recursion in the Rosetta framework, the
resulting stepwise assembly (SWA) protocol enables enumerative sampling of a 12
residue loop at a significant but achievable cost of thousands of CPU-hours. In
a previously established benchmark, SWA recovers crystallographic conformations
with sub-Angstrom accuracy for 19 of 20 loops, compared to 14 of 20 by KIC
modeling with a comparable expenditure of computational power. Furthermore, SWA
gives high accuracy results on an additional set of 15 loops highlighted in the
biological literature for their irregularity or unusual length. Successes
include cis-Pro touch turns, loops that pass through tunnels of other
side-chains, and loops of lengths up to 24 residues. Remaining problem cases
are traced to inaccuracies in the Rosetta all-atom energy function. In five
additional blind tests, SWA achieves sub-Angstrom accuracy models, including
the first such success in a protein/RNA binding interface, the YbxF/kink-turn
interaction in the fourth RNA-puzzle competition. These results establish
all-atom enumeration as a systematic approach to protein structure that can
leverage high performance computing and physically realistic energy functions
to more consistently achieve atomic resolution.Comment: Identity of four-loop blind test protein and parts of figures 5 have
been omitted in this preprint to ensure confidentiality of the protein
structure prior to its public releas
Development of novel orthogonal genetic circuits, based on extracytoplasmic function (ECF) σ factors
The synthetic biology field aims to apply the engineering 'design-build-test-learn' cycle for the implementation of synthetic genetic circuits modifying the behavior of biological systems. In order to reach this goal, synthetic biology projects use a set of fully characterized biological parts that subsequently are assembled into complex synthetic circuits following a rational, model-driven design. However, even though the bottom-up design approach represents an optimal starting point to assay the behavior of the synthetic circuits under defined conditions, the rational design of such circuits is often restricted by the limited number of available DNA building blocks. These usually consist only of a handful of transcriptional regulators that additionally are often borrowed from natural biological systems. This, in turn, can lead to cross-reactions between the synthetic circuit and the host cell and eventually to loss of the original circuit function. Thus, one of the challenges in synthetic biology is to design synthetic circuits that perform the designated functions with minor cross-reactions (orthogonality).
To overcome the restrictions of the widely used transcriptional regulators, this project aims to apply extracytoplasmic function (ECF) σ factors in the design novel orthogonal synthetic circuits. ECFs are the smallest and simplest alternative σ factors that recognize highly specific promoters. ECFs represent one of the most important mechanisms of signal transduction in bacteria, indeed, their activity is often controlled by anti-σ factors. Even though it was shown that the overexpression of heterologous anti-σ factors can generate an adverse effect on cell growth, they represent an attractive solution to control ECF activity. Finally, to date, we know thousands of ECF σ factors, widespread among different bacterial phyla, that are identifiable together with the cognate promoters and anti-σ factors, using bioinformatic approaches.
All the above-mentioned features make ECF σ factors optimal candidates as core orthogonal regulators for the design of novel synthetic circuits. In this project, in order to establish ECF σ factors as standard building blocks in the synthetic biology field, we first established a high throughput experimental setup. This relies on microplate reader experiments performed using a highly sensitive luminescent reporter system. Luminescent reporters have a superior signal-to-noise ratio when compared to fluorescent reporters since they do not suffer from the high auto-fluorescence background of the bacterial cell. However, they also have a drawback represented by the constant light emission that can generate undesired cross-talk between neighboring wells on a microplate. To overcome this limitation, we developed a computational algorithm that corrects for luminescence bleed-through and estimates the “true” luminescence activity for each well of a microplate. We show that the correcting algorithm preserves low-level signals close to the background and that it is universally applicable to different experimental conditions.
In order to simplify the assembly of large ECF-based synthetic circuits, we designed an ECF toolbox in E. coli. The toolbox allows for the combinatorial assembly of circuits into expression vectors, using a library of reusable genetic parts. Moreover, it also offers the possibility of integrating the newly generated synthetic circuits into four different phage attachment (att) sites present in the genome of E. coli. This allows for a flawless transition between plasmid-encoded and chromosomally integrated genetic circuits, expanding the possible genetic configurations of a given synthetic construct. Moreover, our results demonstrate that the four att sites are orthogonal in terms of the gene expression levels of the synthetic circuits.
With the purpose of rationally design ECF-based synthetic circuits and taking advantage of the ECF toolbox, we characterized the dynamic behavior of a set of 15 ECF σ factors, their cognate promoters, and relative anti-σs. Overall, we found that ECFs are non-toxic and functional and that they display different binding affinities for the cognate target promoters. Moreover, our results show that it is possible to optimize the output dynamic range of the ECF-based switches by changing the copy number of the ECFs and target promoters, thus, tuning the input/output signal ratio. Next, by combining up to three ECF-switches, we generated a set of “genetic-timer circuits”, the first synthetic circuits harboring more than one ECF. ECF-based timer circuits sequentially activate a series of target genes with increasing time delays, moreover, the behavior of the circuits can be predicted by a set of mathematical models.
In order to improve the dynamic response of the ECF-based constructs, we introduced anti-σ factors in our synthetic circuits. By doing so we first confirmed that anti-σ factors can exert an adverse effect on the growth of E. coli, thus we explored possible solutions. Our results demonstrate that anti-σ factors toxicity can be partially alleviated by generating truncated, soluble variants of the anti-σ factors and, eventually, completely abolished via chromosomal integration of the anti-σ factor-based circuits. Finally, after demonstrating that anti-σ factors can be used to generate a tunable time delay among ECF expression and target promoter activation, we designed ECF/AS-suicide circuits. Such circuits allow for the time-delayed cell-death of E. coli and will serve as a prototype for the further development of ECF/AS-based lysis circuits
- …