44 research outputs found
Detecting dynamic domains and local fluctuations in complex molecular systems via timelapse neighbors shuffling
Many complex molecular systems owe their properties to local dynamic
rearrangements or fluctuations that, despite the rise of machine learning (ML)
and sophisticated structural descriptors, remain often difficult to detect.
Here we show an ML framework based on a new descriptor, named Local
Environments and Neighbors Shuffling (LENS), which allows identifying dynamic
domains and detecting local fluctuations in a variety of systems via tracking
how much the surrounding of each molecular unit changes over time in terms of
neighbor individuals. Statistical analysis of the LENS time-series data allows
to blindly detect different dynamic domains within various types of molecular
systems with, e.g., liquid-like, solid-like, or diverse dynamics, and to track
local fluctuations emerging within them in an efficient way. The approach is
found robust, versatile, and, given the abstract definition of the LENS
descriptor, capable of shedding light on the dynamic complexity of a variety of
(not necessarily molecular) systems
Machine learning of microscopic structure-dynamics relationships in complex molecular systems
In many complex molecular systems, the macroscopic ensemble's properties are
controlled by microscopic dynamic events (or fluctuations) that are often
difficult to detect via pattern-recognition approaches. Discovering the
relationships between local structural environments and the dynamical events
originating from them would allow unveiling microscopic level
structure-dynamics relationships fundamental to understand the macroscopic
behavior of complex systems. Here we show that, by coupling advanced structural
(e.g., Smooth Overlap of Atomic Positions, SOAP) with local dynamical
descriptors (e.g., Local Environment and Neighbor Shuffling, LENS) in a unique
dataset, it is possible to improve both individual SOAP- and LENS-based
analyses, obtaining a more complete characterization of the system under study.
As representative examples, we use various molecular systems with diverse
internal structural dynamics. On the one hand, we demonstrate how the
combination of structural and dynamical descriptors facilitates decoupling
relevant dynamical fluctuations from noise, overcoming the intrinsic limits of
the individual analyses. Furthermore, machine learning approaches also allow
extracting from such combined structural/dynamical dataset useful
microscopic-level relationships, relating key local dynamical events (e.g.,
LENS fluctuations) occurring in the systems to the local structural (SOAP)
environments they originate from. Given its abstract nature, we believe that
such an approach will be useful in revealing hidden microscopic
structure-dynamics relationships fundamental to rationalize the behavior of a
variety of complex systems, not necessarily limited to the atomistic and
molecular scales
TimeSOAP: Tracking high-dimensional fluctuations in complex molecular systems via time variations of SOAP spectra
Many molecular systems and physical phenomena are controlled by local fluctuations and microscopic dynamical rearrangements of the constitutive interacting units that are often difficult to detect. This is the case, for example, of phase transitions, phase equilibria, nucleation events, and defect propagation, to mention a few. A detailed comprehension of local atomic environments and of their dynamic rearrangements is essential to understand such phenomena and also to draw structure-property relationships useful to unveil how to control complex molecular systems. Considerable progress in the development of advanced structural descriptors [e.g., Smooth Overlap of Atomic Position (SOAP), etc.] has certainly enhanced the representation of atomic-scale simulations data. However, despite such efforts, local dynamic environment rearrangements still remain difficult to elucidate. Here, exploiting the structurally rich description of atomic environments of SOAP and building on the concept of time-dependent local variations, we developed a SOAP-based descriptor, TimeSOAP (τSOAP), which essentially tracks time variations in local SOAP environments surrounding each molecule (i.e., each SOAP center) along ensemble trajectories. We demonstrate how analysis of the time-series τSOAP data and of their time derivatives allows us to detect dynamic domains and track instantaneous changes of local atomic arrangements (i.e., local fluctuations) in a variety of molecular systems. The approach is simple and general, and we expect that it will help shed light on a variety of complex dynamical phenomena
Unsupervised Data-Driven Reconstruction of Molecular Motifs in Simple to Complex Dynamic Micelles
The reshuffling mobility of molecular building blocks in self-assembled micelles is a key determinant of many their interesting properties, from emerging morphologies and surface compartmentalization, to dynamic reconfigurability and stimuli-responsiveness. However, the microscopic details of such complex structural dynamics are typically nontrivial to elucidate, especially in multicomponent assemblies. Here we show a machine-learning approach that allows us to reconstruct the structural and dynamic complexity of mono- and bicomponent surfactant micelles from high-dimensional data extracted from equilibrium molecular dynamics simulations. Unsupervised clustering of smooth overlap of atomic position (SOAP) data enables us to identify, in a set of multicomponent surfactant micelles, the dominant local molecular environments that emerge within them and to retrace their dynamics, in terms of exchange probabilities and transition pathways of the constituent building blocks. Tested on a variety of micelles differing in size and in the chemical nature of the constitutive self-assembling units, this approach effectively recognizes the molecular motifs populating them in an exquisitely agnostic and unsupervised way, and allows correlating them to their composition in terms of constitutive surfactant species
Impact evaluation of biomass used in small combustion activities sector on air emissions: Analyses of emissions from Alpine, Adriatic-Ionian and Danube EU macro-regions by using the EDGAR emissions inventory
The emissions from small stationary combustion activities sector, in particular from the energy needs for residential buildings, have significant shares in total emissions of EU28. Therefore, measures to mitigate the emissions from this less regulated sector related to implementation checking are needed. In this study, we analysed the changes in fuel mix for this sector over 1990-2012 period, the emissions and their distribution over the areas covered by European Union Strategy for Alpine macro-region (EUSALP), European Union Strategy for Adriatic and Ionian macro-region (EUSAIR) and European Union Strategy for Danube macro-region (EUSDR). The emissions gridmaps of fine particulate matter (PM2.5), black carbon (BC) and benzo(a)pyren (BaP) are presented for the year 2010; in specific circumstances, these pollutants are known to produce negative effects on health. For this research, we used the data and information of the Emissions Database for Global Atmospheric Research (EDGAR) versions v4.3.2 and v4.tox3.
Accurate emissions estimates are important to evaluate the impacts of fuel combustion in small stationary combustion activities sector on air quality, human health and crops. Inventories of GHGs, air pollutants and toxic pollutants included in EDGAR are developed by using, as input, fuel consumption from IEA (2014) and emissions factors from scientific literature and official guidebooks such as EMEP/EEA (2013). Working together with emissions inventory experts from selected countries in these macro-regions, the effects of improvements of fuel consumption statistics, biomass in particular, on emissions in the latest years have been quantified by comparing EDGAR data with national data.
Besides sectorial emissions estimation, the emissions distribution is also important in the inventory development process. In order to distribute emissions consistently for all countries included in Alpine, Adriatic-Ionian and Danube macro-regions, the EDGAR team upgraded the WEB-based gridding tool with a module for small stationary combustion activities. Emissions estimation and distribution are key elements in preparing a complete input for chemical transport models and further evaluate the impacts of these emissions on air quality, health and crops.
This report aims to provide the policy makers and scientists insights on the representativeness and uncertainty of local emissions from the residential sector that play an important role on air quality. These datasets can be used as input for the atmospheric chemical transport models for air pollutants and can illustrate the importance of emission inventory uncertainties and discrepancies.JRC.C.5-Air and Climat
Sampling Real‐Time Atomic Dynamics in Metal Nanoparticles by Combining Experiments, Simulations, and Machine Learning
Even at low temperatures, metal nanoparticles (NPs) possess atomic dynamics that are key for their properties but challenging to elucidate. Recent experimental advances allow obtaining atomic-resolution snapshots of the NPs in realistic regimes, but data acquisition limitations hinder the experimental reconstruction of the atomic dynamics present within them. Molecular simulations have the advantage that these allow directly tracking the motion of atoms over time. However, these typically start from ideal/perfect NP structures and, suffering from sampling limits, provide results that are often dependent on the initial/putative structure and remain purely indicative. Here, by combining state-of-the-art experimental and computational approaches, how it is possible to tackle the limitations of both approaches and resolve the atomistic dynamics present in metal NPs in realistic conditions is demonstrated. Annular dark-field scanning transmission electron microscopy enables the acquisition of ten high-resolution images of an Au NP at intervals of 0.6 s. These are used to reconstruct atomistic 3D models of the real NP used to run ten independent molecular dynamics simulations. Machine learning analyses of the simulation trajectories allow resolving the real-time atomic dynamics present within the NP. This provides a robust combined experimental/computational approach to characterize the structural dynamics of metal NPs in realistic conditions
Automatic Optimization of Lipid Models in the Martini Force Field Using SwarmCG.
After two decades of continued development of the Martini coarse-grained force field (CG FF), further refinment of the already rather accurate Martini lipid models has become a demanding task that could benefit from integrative data-driven methods. Automatic approaches are increasingly used in the development of accurate molecular models, but they typically make use of specifically designed interaction potentials that transfer poorly to molecular systems or conditions different than those used for model calibration. As a proof of concept, here, we employ SwarmCG, an automatic multiobjective optimization approach facilitating the development of lipid force fields, to refine specifically the bonded interaction parameters in building blocks of lipid models within the framework of the general Martini CG FF. As targets of the optimization procedure, we employ both experimental observables (top-down references: area per lipid and bilayer thickness) and all-atom molecular dynamics simulations (bottom-up reference), which respectively inform on the supra-molecular structure of the lipid bilayer systems and on their submolecular dynamics. In our training sets, we simulate at different temperatures in the liquid and gel phases up to 11 homogeneous lamellar bilayers composed of phosphatidylcholine lipids spanning various tail lengths and degrees of (un)saturation. We explore different CG representations of the molecules and evaluate improvements a posteriori using additional simulation temperatures and a portion of the phase diagram of a DOPC/DPPC mixture. Successfully optimizing up to ∼80 model parameters within still limited computational budgets, we show that this protocol allows the obtainment of improved transferable Martini lipid models. In particular, the results of this study demonstrate how a fine-tuning of the representation and parameters of the models may improve their accuracy and how automatic approaches, such as SwarmCG, may be very useful to this end. </p
Survey on carbapenem-resistant bacteria in pigs at slaughter and comparison with human clinical isolates in Italy
This study is focused on resistance to carbapenems and third-generation cephalosporins in Gram-negative microorganisms isolated from swine, whose transmission to humans via pork consumption cannot be excluded. In addition, the common carriage of carbapenem-resistant (CR) bacteria between humans and pigs was evaluated. Sampling involved 300 faecal samples collected from slaughtered pigs and 300 urine samples collected from 187 hospitalised patients in Parma Province (Italy). In swine, MIC testing confirmed resistance to meropenem for isolates of Pseudomonas aeruginosa and Pseudomonas oryzihabitans and resistance to cefotaxime and ceftazidime for Escherichia coli, Ewingella americana, Enterobacter agglomerans, and Citrobacter freundii. For Acinetobacter lwoffii, Aeromonas hydrofila, Burkolderia cepacia, Corynebacterium indologenes, Flavobacterium odoratum, and Stenotrophomonas maltophilia, no EUCAST MIC breakpoints were available. However, ESBL genes (blaCTXM-1, blaCTX-M-2, blaTEM-1, and blaSHV) and AmpC genes (blaCIT, blaACC, and blaEBC) were found in 38 and 16 isolates, respectively. P. aeruginosa was the only CR species shared by pigs (4/300 pigs; 1.3%) and patients (2/187; 1.1%). P. aeruginosa ST938 carrying blaPAO and blaOXA396 was detected in one pig as well as an 83-year-old patient. Although no direct epidemiological link was demonstrable, SNP calling and cgMLST showed a genetic relationship of the isolates (86 SNPs and 661 allele difference), thus suggesting possible circulation of CR bacteria between swine and humans
Prediction of early distant recurrence in upfront resectable pancreatic adenocarcinoma: A multidisciplinary, machine learning-based approach
Despite careful selection, the recurrence rate after upfront surgery for pancreatic adenocarcinoma can be very high. We aimed to construct and validate a model for the prediction of early distant recurrence (<12 months from index surgery) after upfront pancreaticoduodenectomy. After exclusions, 147 patients were retrospectively enrolled. Preoperative clinical and radiological (CT-based) data were systematically evaluated; moreover, 182 radiomics features (RFs) were extracted. Most significant RFs were selected using minimum redundancy, robustness against delineation uncertainty and an original machine learning bootstrap-based method. Patients were split into training (n = 94) and validation cohort (n = 53). Multivariable Cox regression analysis was first applied on the training cohort; the resulting prognostic index was then tested in the validation cohort. Clinical (serum level of CA19.9), radiological (necrosis), and radiomic (SurfAreaToVolumeRatio) features were significantly associated with the early resurge of distant recurrence. The model combining these three variables performed well in the training cohort (p = 0.0015,HR = 3.58,95%CI = 1.98–6.71) and was then confirmed in the validation cohort (p = 0.0178,HR = 5.06,95%CI = 1.75–14.58). The comparison of survival curves between low and high-risk patients showed a p-value <0.0001. Our model may help to better define resectability status, thus providing an actual aid for pancreatic adenocarcinoma patients’ management (upfront surgery vs. neoadjuvant chemotherapy). Independent validations are warranted