263 research outputs found
Neural networks in interpretation of electronic core-level spectra
We explore the applicability of artificial intelligence for molecular
structure - core-level spectrum interpretation. We focus on the electronic
Hamiltonian using the HO molecule in the classical-nuclei approximation as
our test system. For a systematic view we studied both predicting structures
from spectra and, vice versa, spectra from structures, using polynomial
approaches and neural networks. We find predicting spectra easier than
predicting structures, where a tighter grid of the spectrum improves
prediction. However, the accuracy of the structure prediction worsens when
moving outwards from the center of mass of the training set in the structural
parameter space
Machine learning in interpretation of electronic core-level spectra
Electronic transitions involving core-level orbitals offer a localized, atomic-site and element specific peek window into statistical systems such as molecular liquids. Although formally understood, the complex relation between structure and spectrum -- and the effect of statistical averaging of highly differing spectra of individual structures -- render the analysis of an ensemble-averaged core-level spectrum complicated. We explore the applicability of machine learning for molecular structure -- core-level spectrum interpretation. We focus on the electronic Hamiltonian using the \ce{H2O} molecule in the classical-nuclei approximation as our test system. For a systematic view we studied both predicting structures from spectra and, vice versa, spectra from structures, using polynomial approaches and neural networks. We find predicting spectra easier than predicting structures, where a tighter grid (even unphysical) of the spectrum improves prediction, possibly inviting for over-interpretation of the model. The accuracy of the structure prediction worsens when moving outwards from the center of mass of the training set in the structural parameter space, which can not be overcome by model selection based on generalizability.</p
Coronary artery disease, genetic risk and the metabolome in young individuals [version 2; referees: 2 approved]
Background: Genome-wide association studies have identified genetic variants associated with coronary artery disease (CAD) in adults – the leading cause of death worldwide. It often occurs later in life, but variants may impact CAD-relevant phenotypes early and throughout the life-course. Cohorts with longitudinal and genetic data on thousands of individuals are letting us explore the antecedents of this adult disease. Methods: 148 metabolites, with a focus on the lipidome, measured using nuclear magnetic resonance (1H-NMR) spectroscopy, and genotype data were available from 5,907 individuals at ages 7, 15, and 17 years from the Avon Longitudinal Study of Parents and Children (ALSPAC) cohort. Linear regression was used to assess the association between the metabolites and an adult-derived genetic risk score (GRS) of CAD comprising 146 variants. Individual variant-metabolite associations were also examined. Results: The CAD-GRS associated with 118 of 148 metabolites (false discovery rate [FDR] < 0.05), the strongest associations being with low-density lipoprotein (LDL) and atherogenic non-LDL subgroups. Nine of 146 variants in the GRS associated with one or more metabolites (FDR < 0.05). Seven of these are within lipid loci: rs11591147 PCSK9, rs12149545 HERPUD1-CETP, rs17091891 LPL, rs515135 APOB, rs602633 CELSR2-PSRC1, rs651821 APOA5, rs7412 APOE-APOC1. All associated with metabolites in the LDL or atherogenic non-LDL subgroups or both including aggregate cholesterol measures. The other two variants identified were rs112635299 SERPINA1 and rs2519093 ABO. Conclusions: Genetic variants that influence CAD risk in adults are associated with large perturbations in metabolite levels in individuals as young as seven. The variants identified are mostly within lipid-related loci and the metabolites they associated with are primarily linked to lipoproteins. Along with further research, this knowledge could allow for preventative measures, such as increased monitoring of at-risk individuals and perhaps treatment earlier in life, to be taken years before any symptoms of the disease arise
EpiMetal:an open-source graphical web browser tool for easy statistical analyses in epidemiology and metabolomics
MOTIVATION: An intuitive graphical interface that allows statistical analyses and visualizations of extensive data without any knowledge of dedicated statistical software or programming. IMPLEMENTATION: EpiMetal is a single-page web application written in JavaScript, to be used via a modern desktop web browser. GENERAL FEATURES: Standard epidemiological analyses and self-organizing maps for data-driven metabolic profiling are included. Multiple extensive datasets with an arbitrary number of continuous and category variables can be integrated with the software. Any snapshot of the analyses can be saved and shared with others via a www-link. We demonstrate the usage of EpiMetal using pilot data with over 500 quantitative molecular measures for each sample as well as in two large-scale epidemiological cohorts (N >10 000). AVAILABILITY: The software usage exemplar and the pilot data are open access online at [http://EpiMetal.computationalmedicine.fi]. MIT licensed source code is available at the Github repository at [https://github.com/amergin/epimetal]
Genetic and observational evidence : No independent role for cholesterol efflux over static high-density lipoprotein concentration measures in coronary heart disease risk assessment
Background Observational findings for high-density lipoprotein (HDL)-mediated cholesterol efflux capacity (HDL-CEC) and coronary heart disease (CHD) appear inconsistent, and knowledge of the genetic architecture of HDL-CEC is limited. Objectives A large-scale observational study on the associations of HDL-CEC and other HDL-related measures with CHD and the largest genome-wide association study (GWAS) of HDL-CEC. Participants/methods Six independent cohorts were included with follow-up data for 14,438 participants to investigate the associations of HDL-related measures with incident CHD (1,570 events). The GWAS of HDL-CEC was carried out in 20,372 participants. Results HDL-CEC did not associate with CHD when adjusted for traditional risk factors and HDL cholesterol (HDL-C). In contradiction, almost all HDL-related concentration measures associated consistently with CHD after corresponding adjustments. There were no genetic loci associated with HDL-CEC independent of HDL-C and triglycerides. Conclusion HDL-CEC is not unequivocally associated with CHD in contrast to HDL-C, apolipoprotein A-I, and most of the HDL subclass particle concentrations.Peer reviewe
Direct Estimation of HDL-Mediated Cholesterol Efflux Capacity from Serum
BACKGROUND: HDL-mediated cholesterol efflux capacity (HDL-CEC) is a functional attribute that may have a protective role in atherogenesis. However, the estimation of HDL-CEC is based on in vitro cell assays that are laborious and hamper large-scale phenotyping. METHODS: Here, we present a cost-effective high-throughput nuclear magnetic resonance (NMR) spectroscopy method to estimate HDL-CEC directly from serum. We applied the new method in a population-based study of 7603 individuals including 574 who developed incident coronary heart disease (CHD) during 15 years of follow-up, making this the largest quantitative study for HDL-CEC. RESULTS: As estimated by NMR-spectroscopy, a 1-SD higher HDL-CEC was associated with a lower risk of incident CHD (hazards ratio, 0.86; 95% CI, 0.79-0.93, adjusted for traditional risk factors and HDL-C). These findings are consistent with published associations based on in vitro cell assays. CONCLUSIONS: These corroborative large-scale findings provide further support for a potential protective role of HDL-CEC in CHD and substantiate this new method and its future applications. (C) 2019 American Association for Clinical ChemistryPeer reviewe
Recommended from our members
Assessing multivariate gene-metabolome associations with rare variants using Bayesian reduced rank regression
Motivation: A typical genome-wide association study searches for associations between single nucleotide polymorphisms (SNPs) and a univariate phenotype. However, there is a growing interest to investigate associations between genomics data and multivariate phenotypes, for example, in gene expression or metabolomics studies. A common approach is to perform a univariate test between each genotype–phenotype pair, and then to apply a stringent significance cutoff to account for the large number of tests performed. However, this approach has limited ability to uncover dependencies involving multiple variables. Another trend in the current genetics is the investigation of the impact of rare variants on the phenotype, where the standard methods often fail owing to lack of power when the minor allele is present in only a limited number of individuals. Results: We propose a new statistical approach based on Bayesian reduced rank regression to assess the impact of multiple SNPs on a high-dimensional phenotype. Because of the method’s ability to combine information over multiple SNPs and phenotypes, it is particularly suitable for detecting associations involving rare variants. We demonstrate the potential of our method and compare it with alternatives using the Northern Finland Birth Cohort with 4702 individuals, for whom genome-wide SNP data along with lipoprotein profiles comprising 74 traits are available. We discovered two genes (XRCC4 and MTHFD2L) without previously reported associations, which replicated in a combined analysis of two additional cohorts: 2390 individuals from the Cardiovascular Risk in Young Finns study and 3659 individuals from the FINRISK study. Availability and implementation: R-code freely available for download at http://users.ics.aalto.fi/pemartti/gene_metabolome/. Contact: [email protected]; [email protected] Supplementary information: Supplementary data are available at Bioinformatics online
Characterization of the metabolic profile associated with serum 25-hydroxyvitamin D : a cross-sectional analysis in population-based data
Background: Numerous observational studies have observed associations between vitamin D deficiency and cardiometabolic diseases, but these findings might be confounded by obesity. A characterization of the metabolic profile associated with serum 25-hydroxyvitamin D [25(OH)D] levels, in general and stratified by abdominal obesity, may help to untangle the relationship between vitamin D, obesity and cardiometabolic health. Methods: Serum metabolomics measurements were obtained from a nuclear magnetic resonance spectroscopy (NMR)- and a mass spectrometry (MS)-based platform. The discovery was conducted in 1726 participants of the population-based KORA-F4 study, in which the associations of the concentrations of 415 metabolites with 25(OH)D levels were assessed in linear models. The results were replicated in 6759 participants (NMR) and 609 (MS) participants, respectively, of the population-based FINRISK 1997 study. Results: Mean [standard deviation (SD)] 25(OH)D levels were 15.2 (7.5) ng/ml in KORA F4 and 13.8 (5.9) ng/ml in FINRISK 1997; 37 metabolites were associated with 25(OH) D in KORA F4 at P <0.05/415. Of these, 30 associations were replicated in FINRISK 1997 at P <0.05/37. Among these were constituents of (very) large very-low-density lipoprotein and small low-density lipoprotein subclasses and related measures like serum triglycerides as well as fatty acids and measures reflecting the degree of fatty acid saturation. The observed associations were independent of waist circumference and generally similar in abdominally obese and non-obese participants. Conclusions: Independently of abdominal obesity, higher 25(OH)D levels were associated with a metabolite profile characterized by lower concentrations of atherogenic lipids and a higher degree of fatty acid polyunsaturation. These results indicate that the relationship between vitamin D deficiency and cardiometabolic diseases is unlikely to merely reflect obesity-related pathomechanisms.Peer reviewe
- …