21 research outputs found
A Hierarchical Approach to Protein Molecular Evolution
Biological diversity has evolved despite the essentially infinite complexity
of protein sequence space. We present a hierarchical approach to the efficient
searching of this space and quantify the evolutionary potential of our approach
with Monte Carlo simulations. These simulations demonstrate that non-homologous
juxtaposition of encoded structure is the rate-limiting step in the production
of new tertiary protein folds. Non-homologous ``swapping'' of low energy
secondary structures increased the binding constant of a simulated protein by
relative to base substitution alone. Applications of our approach
include the generation of new protein folds and modeling the molecular
evolution of disease.Comment: 15 pages. 2 figures. LaTeX styl
Library Design in Combinatorial Chemistry by Monte Carlo Methods
Strategies for searching the space of variables in combinatorial chemistry
experiments are presented, and a random energy model of combinatorial chemistry
experiments is introduced. The search strategies, derived by analogy with the
computer modeling technique of Monte Carlo, effectively search the variable
space even in combinatorial chemistry experiments of modest size. Efficient
implementations of the library design and redesign strategies are feasible with
current experimental capabilities.Comment: 5 pages, 3 figure
Experimental Rugged Fitness Landscape in Protein Sequence Space
The fitness landscape in sequence space determines the process of biomolecular evolution. To plot the fitness landscape of protein function, we carried out in vitro molecular evolution beginning with a defective fd phage carrying a random polypeptide of 139 amino acids in place of the g3p minor coat protein D2 domain, which is essential for phage infection. After 20 cycles of random substitution at sites 12–130 of the initial random polypeptide and selection for infectivity, the selected phage showed a 1.7×10(4)-fold increase in infectivity, defined as the number of infected cells per ml of phage suspension. Fitness was defined as the logarithm of infectivity, and we analyzed (1) the dependence of stationary fitness on library size, which increased gradually, and (2) the time course of changes in fitness in transitional phases, based on an original theory regarding the evolutionary dynamics in Kauffman's n-k fitness landscape model. In the landscape model, single mutations at single sites among n sites affect the contribution of k other sites to fitness. Based on the results of these analyses, k was estimated to be 18–24. According to the estimated parameters, the landscape was plotted as a smooth surface up to a relative fitness of 0.4 of the global peak, whereas the landscape had a highly rugged surface with many local peaks above this relative fitness value. Based on the landscapes of these two different surfaces, it appears possible for adaptive walks with only random substitutions to climb with relative ease up to the middle region of the fitness landscape from any primordial or random sequence, whereas an enormous range of sequence diversity is required to climb further up the rugged surface above the middle region