67 research outputs found

    Blinded Predictions and Post Hoc Analysis of the Second Solubility Challenge Data: Exploring Training Data and Feature Set Selection for Machine and Deep Learning Models

    Get PDF
    Accurate methods to predict solubility from molecular structure are highly sought after in the chemical sciences. To assess the state of the art, the American Chemical Society organized a "Second Solubility Challenge"in 2019, in which competitors were invited to submit blinded predictions of the solubilities of 132 drug-like molecules. In the first part of this article, we describe the development of two models that were submitted to the Blind Challenge in 2019 but which have not previously been reported. These models were based on computationally inexpensive molecular descriptors and traditional machine learning algorithms and were trained on a relatively small data set of 300 molecules. In the second part of the article, to test the hypothesis that predictions would improve with more advanced algorithms and higher volumes of training data, we compare these original predictions with those made after the deadline using deep learning models trained on larger solubility data sets consisting of 2999 and 5697 molecules. The results show that there are several algorithms that are able to obtain near state-of-the-art performance on the solubility challenge data sets, with the best model, a graph convolutional neural network, resulting in an RMSE of 0.86 log units. Critical analysis of the models reveals systematic differences between the performance of models using certain feature sets and training data sets. The results suggest that careful selection of high quality training data from relevant regions of chemical space is critical for prediction accuracy but that other methodological issues remain problematic for machine learning solubility models, such as the difficulty in modeling complex chemical spaces from sparse training data sets

    Young and Intermediate-age Distance Indicators

    Full text link
    Distance measurements beyond geometrical and semi-geometrical methods, rely mainly on standard candles. As the name suggests, these objects have known luminosities by virtue of their intrinsic proprieties and play a major role in our understanding of modern cosmology. The main caveats associated with standard candles are their absolute calibration, contamination of the sample from other sources and systematic uncertainties. The absolute calibration mainly depends on their chemical composition and age. To understand the impact of these effects on the distance scale, it is essential to develop methods based on different sample of standard candles. Here we review the fundamental properties of young and intermediate-age distance indicators such as Cepheids, Mira variables and Red Clump stars and the recent developments in their application as distance indicators.Comment: Review article, 63 pages (28 figures), Accepted for publication in Space Science Reviews (Chapter 3 of a special collection resulting from the May 2016 ISSI-BJ workshop on Astronomical Distance Determination in the Space Age

    Magnetic Field Amplification in Galaxy Clusters and its Simulation

    Get PDF
    We review the present theoretical and numerical understanding of magnetic field amplification in cosmic large-scale structure, on length scales of galaxy clusters and beyond. Structure formation drives compression and turbulence, which amplify tiny magnetic seed fields to the microGauss values that are observed in the intracluster medium. This process is intimately connected to the properties of turbulence and the microphysics of the intra-cluster medium. Additional roles are played by merger induced shocks that sweep through the intra-cluster medium and motions induced by sloshing cool cores. The accurate simulation of magnetic field amplification in clusters still poses a serious challenge for simulations of cosmological structure formation. We review the current literature on cosmological simulations that include magnetic fields and outline theoretical as well as numerical challenges.Comment: 60 pages, 19 Figure

    Physical Processes in Star Formation

    Get PDF
    © 2020 Springer-Verlag. The final publication is available at Springer via https://doi.org/10.1007/s11214-020-00693-8.Star formation is a complex multi-scale phenomenon that is of significant importance for astrophysics in general. Stars and star formation are key pillars in observational astronomy from local star forming regions in the Milky Way up to high-redshift galaxies. From a theoretical perspective, star formation and feedback processes (radiation, winds, and supernovae) play a pivotal role in advancing our understanding of the physical processes at work, both individually and of their interactions. In this review we will give an overview of the main processes that are important for the understanding of star formation. We start with an observationally motivated view on star formation from a global perspective and outline the general paradigm of the life-cycle of molecular clouds, in which star formation is the key process to close the cycle. After that we focus on the thermal and chemical aspects in star forming regions, discuss turbulence and magnetic fields as well as gravitational forces. Finally, we review the most important stellar feedback mechanisms.Peer reviewedFinal Accepted Versio

    Liver transplantation for patients with human immunodeficiency virus and hepatitis C virus coinfection with special reference to hemophiliac recipients in Japan.

    Get PDF
    Liver transplantation for patients with hepatitis C virus (HCV) and human immunodeficiency virus (HIV) remains challenging. The advent of highly active antiretroviral therapy (HAART) for HIV has reduced mortality from opportunistic infection related to acquired immunodeficiency syndrome dramatically, while about 50% of patients die of end-stage liver cirrhosis resulting from HCV. In Japan, liver cirrhosis frequently develops after HCV-HIV coinfection resulting from previously transfused infected blood products for hemophilia. The problems of liver transplantation for those patients arise from the need to control calcineurin inhibitor with HAART drugs, the difficulty of using interferon after liver transplantation with HAART, and the need to control intraoperative coagulopathy associated with hemophilia. We review published reports of liver transplantation for these patients in the updated world literature

    Whole-genome sequencing reveals host factors underlying critical COVID-19

    Get PDF
    Critical COVID-19 is caused by immune-mediated inflammatory lung injury. Host genetic variation influences the development of illness requiring critical care1 or hospitalization2,3,4 after infection with SARS-CoV-2. The GenOMICC (Genetics of Mortality in Critical Care) study enables the comparison of genomes from individuals who are critically ill with those of population controls to find underlying disease mechanisms. Here we use whole-genome sequencing in 7,491 critically ill individuals compared with 48,400 controls to discover and replicate 23 independent variants that significantly predispose to critical COVID-19. We identify 16 new independent associations, including variants within genes that are involved in interferon signalling (IL10RB and PLSCR1), leucocyte differentiation (BCL11A) and blood-type antigen secretor status (FUT2). Using transcriptome-wide association and colocalization to infer the effect of gene expression on disease severity, we find evidence that implicates multiple genes—including reduced expression of a membrane flippase (ATP11A), and increased expression of a mucin (MUC1)—in critical disease. Mendelian randomization provides evidence in support of causal roles for myeloid cell adhesion molecules (SELE, ICAM5 and CD209) and the coagulation factor F8, all of which are potentially druggable targets. Our results are broadly consistent with a multi-component model of COVID-19 pathophysiology, in which at least two distinct mechanisms can predispose to life-threatening disease: failure to control viral replication; or an enhanced tendency towards pulmonary inflammation and intravascular coagulation. We show that comparison between cases of critical illness and population controls is highly efficient for the detection of therapeutically relevant mechanisms of disease

    Value of VKORC1 (−1639G>A) rs9923231 genotyping in predicting warfarin dose: A replication study in South Indian population

    No full text
    Objective: Warfarin is the most commonly prescribed oral anticoagulant, although having a narrow therapeutic index and wide interindividual variability. The aim of this study was to replicate the utility of VKORC1 (−1639G>A) rs9923231 genotyping in predicting the mean daily dose and to evaluate its ability to categorize warfarin-treated patients to high-, intermediate-, or low-dose categories in the South Indian population. Materials and methods: A cohort of 222 warfarin-treated patients was genotyped using restriction fragment length polymorphism method. The influence of the rs9923231 polymorphism on the variations in the mean daily dose was compared using one-way analysis of variance and linear regression analysis. Discriminatory ability of the rs9923231 polymorphism to group the patients into ordered dose categories was assessed by estimating the proportional odds ratios using the ordered logit regression analysis. Results: The frequency of AA genotype and A allele in the study sample was found to be 1.8% and 9.23%, respectively, which was similar to reports from other South Indian populations. The mean daily dose required to achieve the optimum international normalized ratio was significantly lower in AA homozygous genotype carriers (3.99 ± 1.67 mg/day) and GA heterozygous (4.26 ± 1.57 mg/day) compared to the GG genotype carriers (5.51 ± 2.13 mg/day), p = 0.003. The A allele carriers (GA+AA genotypes) had a 3.23 higher odds of being grouped as a low-dose requiring category compared to non-carriers (95% CI 1.49–6.98, p = 0.003). Conclusions: These preliminary results strongly support the use of VKORC1 (−1639G>A) rs9923231 polymorphism for genetically guided initial warfarin dosing in South Indian patients with heart valve replacements. Keywords: VKORC1, Polymorphism, Warfarin, Pharmacogenetics, Regressio
    • …
    corecore