Predicting recombinant protein expression experiments using molecular dynamics simulation

Abstract

Soluble expression of de novo-designed proteins in Escherichia coli (E. coli) remains empirical. For given experimental conditions expression success is determined in part by protein primary sequence. This has been previously explored with varying success using a variety of statistical solubility prediction tools though without taking fold stability into account. In the present study, the three-dimensional structure of proteins in molecular dynamics (MD) simulations is used to predict expression as a new approach with a set of four-helix bundles. Stability-related parameters for ten structures were determined in a thermal unfolding MD simulation and used to build statistical models with a support vector machine (SVM) classifier. The most accurate models were identified by their performance on five independent four-helix bundle sequences. The final model provided accurate classification prediction for this test set and was successfully applied in a model challenge with two newly designed sequences. The combination of simulation-derived parameters and an SVM classifier has potential to predict recombinant expression outcome for this set of four-helix bundles. With further development, this approach of utilizing higher-dimensional protein structural information to predict expression may have potential to advance recombinant biotechnology through modern computational and statistical science

Similar works

Full text

thumbnail-image

UQ eSpace (University of Queensland)

redirect
Last time updated on 04/08/2016

This paper was published in UQ eSpace (University of Queensland).

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.