459 research outputs found
Techniques for modeling and analyzing RNA and protein folding energy landscapes
RNA and protein molecules undergo a dynamic folding process that is important
to their function. Computational methods are critical for studying this folding pro-
cess because it is difficult to observe experimentally. In this work, we introduce
new computational techniques to study RNA and protein energy landscapes, includ-
ing a method to approximate an RNA energy landscape with a coarse graph (map)
and new tools for analyzing graph-based approximations of RNA and protein energy
landscapes. These analysis techniques can be used to study RNA and protein fold-
ing kinetics such as population kinetics, folding rates, and the folding of particular
subsequences. In particular, a map-based Master Equation (MME) method can be
used to analyze the population kinetics of the maps, while another map analysis tool,
map-based Monte Carlo (MMC) simulation, can extract stochastic folding pathways
from the map.
To validate the results, I compared our methods with other computational meth-
ods and with experimental studies of RNA and protein. I first compared our MMC
and MME methods for RNA with other computational methods working on the com-
plete energy landscape and show that the approximate map captures the major fea-
tures of a much larger (e.g., by orders of magnitude) complete energy landscape.
Moreover, I show that the methods scale well to large molecules, e.g., RNA with 200+ nucleotides. Then, I correlate the computational results with experimental
findings. I present comparisons with two experimental cases to show how I can pre-
dict kinetics-based functional rates of ColE1 RNAII and MS2 phage RNA and their
mutants using our MME and MMC tools respectively. I also show that the MME
and MMC tools can be applied to map-based approximations of protein energy energy
landscapes and present kinetics analysis results for several proteins
Techniques for modeling and analyzing RNA and protein folding energy landscapes
RNA and protein molecules undergo a dynamic folding process that is important
to their function. Computational methods are critical for studying this folding pro-
cess because it is difficult to observe experimentally. In this work, we introduce
new computational techniques to study RNA and protein energy landscapes, includ-
ing a method to approximate an RNA energy landscape with a coarse graph (map)
and new tools for analyzing graph-based approximations of RNA and protein energy
landscapes. These analysis techniques can be used to study RNA and protein fold-
ing kinetics such as population kinetics, folding rates, and the folding of particular
subsequences. In particular, a map-based Master Equation (MME) method can be
used to analyze the population kinetics of the maps, while another map analysis tool,
map-based Monte Carlo (MMC) simulation, can extract stochastic folding pathways
from the map.
To validate the results, I compared our methods with other computational meth-
ods and with experimental studies of RNA and protein. I first compared our MMC
and MME methods for RNA with other computational methods working on the com-
plete energy landscape and show that the approximate map captures the major fea-
tures of a much larger (e.g., by orders of magnitude) complete energy landscape.
Moreover, I show that the methods scale well to large molecules, e.g., RNA with 200+ nucleotides. Then, I correlate the computational results with experimental
findings. I present comparisons with two experimental cases to show how I can pre-
dict kinetics-based functional rates of ColE1 RNAII and MS2 phage RNA and their
mutants using our MME and MMC tools respectively. I also show that the MME
and MMC tools can be applied to map-based approximations of protein energy energy
landscapes and present kinetics analysis results for several proteins
Constraining Protein Structure Simulation with Aggregate Data Using Robotic Techniques
This approach to protein simulation uses Protein Data Bank information to construct useful, simple, geometric constraints that can be applied to a protein simulation. We compiled experimental data for proteins with between 30-90 residues and analyzed the relationship between their sizes, defined as the radius of a sphere that encloses the 3D structure; the maximum distance between any two residues and the number of residues in the protein. A significant relationship was found and the analysis was used to predict the ranges that the size and maximum distance between residues would have for a protein with a given number of residues. These ranges were used to constrain folding from secondary structures for proteins IROP and IHDD and, using a random path planning approach, produced results that were not terribly accurate, but quite fast, suggesting that the constraint would be most useful as an inexpensive addition to an existing technique
- …