965 research outputs found
Conformational Dynamics of Supramolecular Protein Assemblies in the EMDB
The Electron Microscopy Data Bank (EMDB) is a rapidly growing repository for
the dissemination of structural data from single-particle reconstructions of
supramolecular protein assemblies including motors, chaperones, cytoskeletal
assemblies, and viral capsids. While the static structure of these assemblies
provides essential insight into their biological function, their conformational
dynamics and mechanics provide additional important information regarding the
mechanism of their biological function. Here, we present an unsupervised
computational framework to analyze and store for public access the
conformational dynamics of supramolecular protein assemblies deposited in the
EMDB. Conformational dynamics are analyzed using normal mode analysis in the
finite element framework, which is used to compute equilibrium thermal
fluctuations, cross-correlations in molecular motions, and strain energy
distributions for 452 of the 681 entries stored in the EMDB at present. Results
for the viral capsid of hepatitis B, ribosome-bound termination factor RF2, and
GroEL are presented in detail and validated with all-atom based models. The
conformational dynamics of protein assemblies in the EMDB may be useful in the
interpretation of their biological function, as well as in the classification
and refinement of EM-based structures.Comment: Associated online data bank available at:
http://lcbb.mit.edu/~em-nmdb
An adaptive methodology to discretize and select features
A lot of significant data describing the behavior or/and actions of systems can be collected in several domains. These data
define some aspects, called features, that can be clustered in several classes. A qualitative or quantitative value for each
feature is stored from measurements or observations. In this paper, the problem of finding independent features for getting
the best accuracy on classification problems is considered. Obtaining these features is the main objective of this work,
where an automatic method to select features is proposed. The method extends the functionality of Ameva coefficient to
use it in other tasks of machine learning where it has not been defined.Ministerio de Ciencia e Innovación ARTEMISA TIN2009-14378-C02-01Junta de Andalucia Simon TIC-805
Unsupervised landmark analysis for jump detection in molecular dynamics simulations
Molecular dynamics is a versatile and powerful method to study diffusion in
solid-state ionic conductors, requiring minimal prior knowledge of equilibrium
or transition states of the system's free energy surface. However, the analysis
of trajectories for relevant but rare events, such as a jump of the diffusing
mobile ion, is still rather cumbersome, requiring prior knowledge of the
diffusive process in order to get meaningful results. In this work, we present
a novel approach to detect the relevant events in a diffusive system without
assuming prior information regarding the underlying process. We start from a
projection of the atomic coordinates into a landmark basis to identify the
dominant features in a mobile ion's environment. Subsequent clustering in
landmark space enables a discretization of any trajectory into a sequence of
distinct states. As a final step, the use of the smooth overlap of atomic
positions descriptor allows distinguishing between different environments in a
straightforward way. We apply this algorithm to ten Li-ionic systems and
conduct in-depth analyses of cubic LiLaZrO, tetragonal
LiGePS, and the -eucryptite LiAlSiO. We
compare our results to existing methods, underscoring strong points,
weaknesses, and insights into the diffusive behavior of the ionic conduction in
the materials investigated
Application Of Bayesian Networks In Consumer Service Industry
Gao, Yuan. M.S.I.E., Purdue University. December 2014. Application of Bayesian Networks in Consumer Service Industry. Major professor: Vincent G. Duffy The purpose of the present study is to explore the application of Bayesian networks in the consumer service industry to model causal relationships within complex risk factor structures using aggregate data. An analysis of the Hawaii tourism market was conducted to find out how visitor characteristics affect their behavior and experience as consumers during the trips, and influence the tourism market outcomes represented by measurable factors. Two hypotheses were proposed regarding the use of aggregate data and the influence of visitor origin, and were verified through the analysis. The source data came from the Hawaii Tourism Authority\u27s official website, including monthly tourists highlight reports over a period of 36 months. The analysis verified the hypotheses that visitor origin, as a symbol of cultural background, plays an important role in their behavior, preferences, decisions and experience in consuming. The results were validated both statistically and against literature and expert opinion. In the increasingly segmented tourism market, such findings can help tourism service providers improve consumer satisfaction and loyalty with assistance in policy-making, investment decision-making, resource planning, and strategic marketing
Collective variables between large-scale states in turbulent convection
The dynamics in a confined turbulent convection flow is dominated by multiple
long-lived macroscopic circulation states, which are visited subsequently by
the system in a Markov-type hopping process. In the present work, we analyze
the short transition paths between these subsequent macroscopic system states
by a data-driven learning algorithm that extracts the low-dimensional
transition manifold and the related new coordinates, which we term collective
variables, in the state space of the complex turbulent flow. We therefore
transfer and extend concepts for conformation transitions in stochastic
microscopic systems, such as in the dynamics of macromolecules, to a
deterministic macroscopic flow. Our analysis is based on long-term direct
numerical simulation trajectories of turbulent convection in a closed cubic
cell at a Prandtl number and Rayleigh numbers and
for a time lag of convective free-fall time units. The simulations
resolve vortices and plumes of all physically relevant scales resulting in a
state space spanned by more than 3.5 million degrees of freedom. The transition
dynamics between the large-scale circulation states can be captured by the
transition manifold analysis with only two collective variables which implies a
reduction of the data dimension by a factor of more than a million. Our method
demonstrates that cessations and subsequent reversals of the large-scale flow
are unlikely in the present setup and thus paves the way to the development of
efficient reduced-order models of the macroscopic complex nonlinear dynamical
system.Comment: 24 pages, 12 Figures, 1 tabl
- …